model ranking

Best AI Models for Document Reasoning

Best AI Models for Document Reasoning ranked by lifecycle, evidence gates, fit scores, and source-backed policy review. Extraction, legal/finance documents, citations, structured output, and audit trails. Reviewed every 30 days.

Internal lane: High-accuracy structured document reasoning

Public candidatelane specific
Candidates
43
Evidence gates
3
Modality
documents
Review
30d
Policy
tool-policy-shadow-2026-05-07
Policy state
candidate
Live rankings

This lane is reading governed policy rows and ranked candidates from the live database.

Ranking methodology

Candidates are ranked only after separating model quality from lifecycle safety. A high benchmark score can improve rank, but deprecated, superseded, unavailable, or insufficiently evidenced models stay out of public recommendation authority.

Primary signals
  • Official lifecycle, availability, pricing, and provider documentation
  • Leaderboard and benchmark evidence for model quality and preference
  • Privacy, residency, tenancy, and security constraints
  • Enterprise readiness, governance, and support fit
  • Grounding and citation support
  • Structured-output reliability
  • Human authority and review workflow fit
Disqualifiers
  • Deprecated, superseded, retired, or blocked lifecycle status
  • Missing official availability or provider surface evidence
  • Missing required lane evidence gates
  • Provider binding conflicts for lanes that require a deployable offering
  • Context mismatch between the solution slot and the ranking lane
Refresh cadence: 30 days

Ranked candidates

Candidate rows are lane-scoped and evidence-gated; fallback references are shown separately.

43 rows
Rank 9 · model deployment

Google Gemini 3 Flash API

Google Gemini 3 Flash API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent2 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
68%
Latency
88%
Cost
80%
Privacy
56%
Ops
82%
Enterprise
78%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 10 · model deployment

Google Gemini 2.5 Flash API

Google Gemini 2.5 Flash API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent2 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
68%
Latency
88%
Cost
80%
Privacy
56%
Ops
82%
Enterprise
78%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 17 · model deployment

OpenAI GPT-5.5 API

OpenAI GPT-5.5 API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent1 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
82%
Latency
66%
Cost
61%
Privacy
59%
Ops
82%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 18 · model deployment

Anthropic Claude Sonnet 4.6 API

Anthropic Claude Sonnet 4.6 API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent1 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
82%
Latency
66%
Cost
61%
Privacy
59%
Ops
82%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 21 · model deployment

Google Gemini 3 Pro API

Google Gemini 3 Pro API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent2 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
82%
Latency
56%
Cost
46%
Privacy
56%
Ops
82%
Enterprise
78%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 24 · model deployment

Anthropic Claude Opus 4.7 API

Anthropic Claude Opus 4.7 API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent1 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
82%
Latency
56%
Cost
49%
Privacy
59%
Ops
82%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 3 · model deployment

Llama 3.1 via Azure AI Foundry

Llama 3.1 via Azure AI Foundry is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
92%
Latency
56%
Cost
46%
Privacy
68%
Ops
76%
Enterprise
78%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 12 · model deployment

Azure OpenAI GPT-4.1

Azure OpenAI GPT-4.1 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate1 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
66%
Cost
58%
Privacy
71%
Ops
76%
Enterprise
86%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 22 · model deployment

Mistral via Azure

Mistral via Azure is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
56%
Cost
46%
Privacy
71%
Ops
76%
Enterprise
86%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 28 · model deployment

OpenAI GPT-4.1

OpenAI GPT-4.1 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate1 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 31 · model deployment

GPT-4.1 via OpenAI

GPT-4.1 via OpenAI is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate1 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 35 · model deployment

gpt-4.1 (OpenAI)

gpt-4.1 (OpenAI) is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate1 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 37 · model deployment

Mistral API

Mistral API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
56%
Cost
46%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 38 · model deployment

Mistral Large via Le Chat API

Mistral Large via Le Chat API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
56%
Cost
46%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 39 · model deployment

Mistral Large via API

Mistral Large via API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
56%
Cost
46%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 40 · model deployment

Mistral AI API

Mistral AI API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
56%
Cost
46%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 41 · model deployment

Mistral Codestral

Mistral Codestral is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
62%
Latency
62%
Cost
58%
Privacy
59%
Ops
71%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 42 · model deployment

Mistral Large via APIs

Mistral Large via APIs is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
52%
Cost
46%
Privacy
59%
Ops
71%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 43 · model deployment

Mistral AI

Mistral AI is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07
Quality
66%
Latency
52%
Cost
46%
Privacy
59%
Ops
71%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 1 · model deployment

GPT-4o via Azure OpenAI

GPT-4o via Azure OpenAI is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.
OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.
Quality
91%
Latency
66%
Cost
58%
Privacy
71%
Ops
76%
Enterprise
86%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 2 · model deployment

Azure OpenAI (GPT‑4o)

Azure OpenAI (GPT‑4o) is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.
OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.
Quality
91%
Latency
66%
Cost
58%
Privacy
71%
Ops
76%
Enterprise
86%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 4 · model deployment

Google Vertex Gemini

Google Vertex Gemini is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Google Gemini 3 Pro or Gemini 2.5 Pro as the replacement.
Google Gemini 1.5 Pro is deprecated; use Google Gemini 3 Pro or Gemini 2.5 Pro.
Quality
92%
Latency
56%
Cost
46%
Privacy
68%
Ops
76%
Enterprise
78%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 5 · model deployment

Anthropic Claude 3.5 Opus

Anthropic Claude 3.5 Opus is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Opus 4.7 as the replacement.
Anthropic Claude 3.5 Opus is deprecated; use Anthropic Claude Opus 4.7.
Quality
100%
Latency
56%
Cost
46%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 6 · model deployment

OpenAI GPT-4o

OpenAI GPT-4o is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.
OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.
Quality
91%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 7 · model deployment

OpenAI GPT‑4o API

OpenAI GPT‑4o API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.
OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.
Quality
91%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 8 · model deployment

OpenAI GPT-4o (via API)

OpenAI GPT-4o (via API) is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.
OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.
Quality
91%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 11 · model deployment

Anthropic Claude on AWS

Anthropic Claude on AWS is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
71%
Ops
76%
Enterprise
86%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 13 · model deployment

Anthropic Claude via AWS

Anthropic Claude via AWS is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
71%
Ops
76%
Enterprise
86%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 14 · model deployment

Anthropic Claude via AWS Bedrock

Anthropic Claude via AWS Bedrock is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
71%
Ops
76%
Enterprise
86%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 15 · model deployment

Anthropic Claude via Bedrock

Anthropic Claude via Bedrock is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
71%
Ops
76%
Enterprise
86%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 16 · model deployment

Anthropic Claude 3 Opus

Anthropic Claude 3 Opus is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Opus 4.7 as the replacement.
Anthropic Claude 3 Opus is deprecated; use Anthropic Claude Opus 4.7.
Quality
95%
Latency
56%
Cost
46%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 19 · model deployment

Google Gemini 1.5 Pro

Google Gemini 1.5 Pro is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Google Gemini 3 Pro or Gemini 2.5 Pro as the replacement.
Google Gemini 1.5 Pro is deprecated; use Google Gemini 3 Pro or Gemini 2.5 Pro.
Quality
92%
Latency
56%
Cost
46%
Privacy
56%
Ops
71%
Enterprise
78%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 20 · model deployment

Google Gemini 1.5

Google Gemini 1.5 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Google Gemini 3 Pro or Gemini 2.5 Pro as the replacement.
Google Gemini 1.5 Pro is deprecated; use Google Gemini 3 Pro or Gemini 2.5 Pro.
Quality
92%
Latency
56%
Cost
46%
Privacy
56%
Ops
71%
Enterprise
78%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 23 · model deployment

Google Gemini 1.5 Flash

Google Gemini 1.5 Flash is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Google Gemini 2.5 Flash or Gemini 2.5 Flash-Lite as the replacement.
Google Gemini 1.5 Flash is deprecated; use Google Gemini 2.5 Flash or Gemini 2.5 Flash-Lite.
Quality
65%
Latency
88%
Cost
80%
Privacy
56%
Ops
71%
Enterprise
78%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 25 · model deployment

OpenAI GPT-4o mini

OpenAI GPT-4o mini is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5, GPT-5.4 mini, or GPT-5.4 nano as the replacement.
OpenAI GPT-4o mini is superseded; review OpenAI GPT-5.5, GPT-5.4 mini, or GPT-5.4 nano.
Quality
52%
Latency
88%
Cost
80%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 26 · model deployment

Anthropic Claude 3.5

Anthropic Claude 3.5 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 27 · model deployment

Anthropic Claude 3.5 Sonnet

Anthropic Claude 3.5 Sonnet is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 29 · model deployment

Anthropic Claude 3

Anthropic Claude 3 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 30 · model deployment

Anthropic Claude 3 Sonnet

Anthropic Claude 3 Sonnet is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 32 · model deployment

Anthropic Claude 3.5 Sonnet API

Anthropic Claude 3.5 Sonnet API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 33 · model deployment

Claude 3.5 via Anthropic

Claude 3.5 via Anthropic is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 34 · model deployment

Anthropic Claude via API

Anthropic Claude via API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.
Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.
Quality
66%
Latency
66%
Cost
58%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Rank 36 · model deployment

Anthropic Claude 3 Haiku

Anthropic Claude 3 Haiku is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07
This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Haiku 4.5 as the replacement.
Anthropic Claude 3 Haiku is deprecated; use Anthropic Claude Haiku 4.5.
Quality
48%
Latency
88%
Cost
80%
Privacy
59%
Ops
76%
Enterprise
70%
Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

Fallback references

These are navigation aids for unresolved slots, not authority to call a tool the best option.

reference

Model leaderboard reference

Use benchmark leaderboards and provider status while lane-specific policy candidates are pending.

pending policy candidate

Provider-bound deployment choice

Provider, API surface, residency, and lifecycle state still need a component offering row before public recommendation authority.

Evidence gates

A candidate needs lane-specific evidence before it can move from comparison to public selection.

benchmark
30d cadence

Citation grounding evaluation

citation_grounding_eval

security review
30d cadence

Privacy and compliance review

privacy_review

benchmark
30d cadence

Structured output evaluation

structured_output_eval