Claude beat Gemini on a 150-page document test, but not for the reason you'd think

Claude beat Gemini on a 150-page document test, but not for the reason you’d think
www.makeuseof.com/claude-beat-gemini-on-150-page-document-not-for-reason-you-think/

The real reason I decided to run this test
A recurring bottleneck with dense documents and two competing answers
How I structured the comparison without tilting the odds
The 150-page document, the exact same prompts, and what I was actually measuring
Claude retained the full context; Gemini started dropping threads partway through
The behavior difference emerged in specific ways
What a larger context window actually buys you in practice
Token limits matter less than consistency, and consistency has real limits