Claude beat Gemini on a 150-page document test, but not for the reason you’d think

Claude beat Gemini on a 150-page document test, but not for the reason you’d think
www.makeuseof.com/claude-beat-gemini-on-150-page-document-not-for-reason-you-think/

  • The real reason I decided to run this test
    A recurring bottleneck with dense documents and two competing answers
  • How I structured the comparison without tilting the odds
    The 150-page document, the exact same prompts, and what I was actually measuring
  • Claude retained the full context; Gemini started dropping threads partway through
    The behavior difference emerged in specific ways
  • What a larger context window actually buys you in practice
    Token limits matter less than consistency, and consistency has real limits