Sure, gpt-4o has a context window of 128k, but it loses a lot from the beginning/middle.
reply
nolima https://arxiv.org/abs/2502.05167