model ranking

Best AI Models for Document Reasoning

Best AI Models for Document Reasoning ranked by lifecycle, evidence gates, fit scores, and source-backed policy review. Extraction, legal/finance documents, citations, structured output, and audit trails. Reviewed every 30 days.

Internal lane: High-accuracy structured document reasoning

Public candidatelane specific

Candidates

Evidence gates

Modality

documents

Review

30d

Policy

tool-policy-shadow-2026-05-07

Policy state

candidate

Live rankings

This lane is reading governed policy rows and ranked candidates from the live database.

Active contextSolution: automotive-automotive-operations-optimization

Ranking methodology

Candidates are ranked only after separating model quality from lifecycle safety. A high benchmark score can improve rank, but deprecated, superseded, unavailable, or insufficiently evidenced models stay out of public recommendation authority.

Primary signals

Official lifecycle, availability, pricing, and provider documentation
Leaderboard and benchmark evidence for model quality and preference
Privacy, residency, tenancy, and security constraints
Enterprise readiness, governance, and support fit
Grounding and citation support
Structured-output reliability
Human authority and review workflow fit

Disqualifiers

Deprecated, superseded, retired, or blocked lifecycle status
Missing official availability or provider surface evidence
Missing required lane evidence gates
Provider binding conflicts for lanes that require a deployable offering
Context mismatch between the solution slot and the ranking lane

Refresh cadence: 30 days

Ranked candidates

Candidate rows are lane-scoped and evidence-gated; fallback references are shown separately.

43 rows

Rank 9 · model deployment

Google Gemini 3 Flash API

Google Gemini 3 Flash API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent2 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

68%

Latency

88%

Cost

80%

Privacy

56%

Ops

82%

Enterprise

78%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Google Gemini 3 Flash API base_model: Gemini 3 Flash api_surface: Gemini API provider: Google

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details

Rank 10 · model deployment

Google Gemini 2.5 Flash API

Google Gemini 2.5 Flash API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent2 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

68%

Latency

88%

Cost

80%

Privacy

56%

Ops

82%

Enterprise

78%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Google Gemini 2.5 Flash API base_model: Gemini 2.5 Flash provider: Google api_surface: Gemini API

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 17 · model deployment

OpenAI GPT-5.5 API

OpenAI GPT-5.5 API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent1 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

82%

Latency

66%

Cost

61%

Privacy

59%

Ops

82%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: OpenAI GPT-5.5 API base_model: GPT-5.5 api_surface: OpenAI LLM API provider: OpenAI

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 18 · model deployment

Anthropic Claude Sonnet 4.6 API

Anthropic Claude Sonnet 4.6 API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent1 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

82%

Latency

66%

Cost

61%

Privacy

59%

Ops

82%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude Sonnet 4.6 API base_model: Claude Sonnet 4.6 api_surface: Anthropic API provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 21 · model deployment

Google Gemini 3 Pro API

Google Gemini 3 Pro API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent2 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

82%

Latency

56%

Cost

46%

Privacy

56%

Ops

82%

Enterprise

78%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Google Gemini 3 Pro API base_model: Gemini 3 Pro api_surface: Gemini API provider: Google

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 24 · model deployment

Anthropic Claude Opus 4.7 API

Anthropic Claude Opus 4.7 API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidatecurrent1 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

82%

Latency

56%

Cost

49%

Privacy

59%

Ops

82%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude Opus 4.7 API base_model: Claude Opus 4.7 api_surface: Anthropic API provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 3 · model deployment

Llama 3.1 via Azure AI Foundry

Llama 3.1 via Azure AI Foundry is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

92%

Latency

56%

Cost

46%

Privacy

68%

Ops

76%

Enterprise

78%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Llama 3.1 via Azure AI Foundry base_model: Llama 3.1 70B api_surface: Llama 3.1 via Azure AI Foundry provider: Microsoft

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 12 · model deployment

Azure OpenAI GPT-4.1

Azure OpenAI GPT-4.1 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate1 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

66%

Cost

58%

Privacy

71%

Ops

76%

Enterprise

86%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Azure OpenAI GPT-4.1 base_model: GPT-4.1 api_surface: Azure OpenAI GPT-4.1 provider: Microsoft

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 22 · model deployment

Mistral via Azure

Mistral via Azure is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

56%

Cost

46%

Privacy

71%

Ops

76%

Enterprise

86%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Mistral via Azure base_model: Mistral Large api_surface: Mistral via Azure provider: Microsoft

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 28 · model deployment

OpenAI GPT-4.1

OpenAI GPT-4.1 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate1 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: OpenAI GPT-4.1 base_model: GPT-4.1 api_surface: OpenAI GPT-4.1 provider: OpenAI

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 31 · model deployment

GPT-4.1 via OpenAI

GPT-4.1 via OpenAI is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate1 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: GPT-4.1 via OpenAI base_model: GPT-4.1 api_surface: GPT-4.1 via OpenAI provider: OpenAI

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 35 · model deployment

gpt-4.1 (OpenAI)

gpt-4.1 (OpenAI) is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate1 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: gpt-4.1 (OpenAI)base_model: GPT-4.1 api_surface: gpt-4.1 (OpenAI)provider: OpenAI

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 37 · model deployment

Mistral API

Mistral API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

56%

Cost

46%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Mistral API base_model: Mistral Large api_surface: Mistral API provider: Mistral

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 38 · model deployment

Mistral Large via Le Chat API

Mistral Large via Le Chat API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

56%

Cost

46%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Mistral Large via Le Chat API base_model: Mistral Large api_surface: Mistral Large via Le Chat API provider: Mistral

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 39 · model deployment

Mistral Large via API

Mistral Large via API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

56%

Cost

46%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Mistral Large via API base_model: Mistral Large api_surface: Mistral Large via API provider: Mistral

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 40 · model deployment

Mistral AI API

Mistral AI API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

56%

Cost

46%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Mistral AI API base_model: Mistral Large api_surface: Mistral AI API provider: Mistral

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 41 · model deployment

Mistral Codestral

Mistral Codestral is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

62%

Latency

62%

Cost

58%

Privacy

59%

Ops

71%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Mistral Codestral base_model: Codestral api_surface: Mistral Codestral provider: Mistral

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 42 · model deployment

Mistral Large via APIs

Mistral Large via APIs is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

52%

Cost

46%

Privacy

59%

Ops

71%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Mistral Large via APIs base_model: Mistral Large api_surface: Mistral Large via APIs provider: Mistral

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 43 · model deployment

Mistral AI

Mistral AI is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

candidate3 evidence rowsPolicy tool-policy-shadow-2026-05-07

Quality

66%

Latency

52%

Cost

46%

Privacy

59%

Ops

71%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Mistral AI base_model: Mistral Large api_surface: Mistral AI provider: Mistral

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 1 · model deployment

GPT-4o via Azure OpenAI

GPT-4o via Azure OpenAI is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.

OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.

Quality

91%

Latency

66%

Cost

58%

Privacy

71%

Ops

76%

Enterprise

86%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

base_model: GPT-4o offering: GPT-4o via Azure OpenAI api_surface: GPT-4o via Azure OpenAI provider: Microsoft

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 2 · model deployment

Azure OpenAI (GPT‑4o)

Azure OpenAI (GPT‑4o) is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.

OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.

Quality

91%

Latency

66%

Cost

58%

Privacy

71%

Ops

76%

Enterprise

86%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Azure OpenAI (GPT‑4o)base_model: GPT-4o api_surface: Azure OpenAI (GPT‑4o)provider: Microsoft

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 4 · model deployment

Google Vertex Gemini

Google Vertex Gemini is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Google Gemini 3 Pro or Gemini 2.5 Pro as the replacement.

Google Gemini 1.5 Pro is deprecated; use Google Gemini 3 Pro or Gemini 2.5 Pro.

Quality

92%

Latency

56%

Cost

46%

Privacy

68%

Ops

76%

Enterprise

78%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Google Vertex Gemini base_model: Gemini 1.5 Pro api_surface: Google Vertex Gemini provider: Google

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 5 · model deployment

Anthropic Claude 3.5 Opus

Anthropic Claude 3.5 Opus is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Opus 4.7 as the replacement.

Anthropic Claude 3.5 Opus is deprecated; use Anthropic Claude Opus 4.7.

Quality

100%

Latency

56%

Cost

46%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude 3.5 Opus base_model: Claude 3.5 Opus api_surface: Anthropic Claude 3.5 Opus provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 6 · model deployment

OpenAI GPT-4o

OpenAI GPT-4o is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.

OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.

Quality

91%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: OpenAI GPT-4o base_model: GPT-4o api_surface: OpenAI GPT-4o provider: OpenAI

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 7 · model deployment

OpenAI GPT‑4o API

OpenAI GPT‑4o API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.

OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.

Quality

91%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: OpenAI GPT‑4o API base_model: GPT-4o api_surface: OpenAI GPT‑4o API provider: OpenAI

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 8 · model deployment

OpenAI GPT-4o (via API)

OpenAI GPT-4o (via API) is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5 or GPT-5.4 mini as the replacement.

OpenAI GPT-4o is superseded; review OpenAI GPT-5.5 or GPT-5.4 mini.

Quality

91%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: OpenAI GPT-4o (via API)base_model: GPT-4o api_surface: OpenAI GPT-4o (via API)provider: OpenAI

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 11 · model deployment

Anthropic Claude on AWS

Anthropic Claude on AWS is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

71%

Ops

76%

Enterprise

86%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude on AWS base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude on AWS provider: AWS

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 13 · model deployment

Anthropic Claude via AWS

Anthropic Claude via AWS is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

71%

Ops

76%

Enterprise

86%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude via AWS base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude via AWS provider: AWS

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 14 · model deployment

Anthropic Claude via AWS Bedrock

Anthropic Claude via AWS Bedrock is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

71%

Ops

76%

Enterprise

86%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

base_model: Claude 3.5 Sonnet offering: Anthropic Claude via AWS Bedrock api_surface: Anthropic Claude via AWS Bedrock provider: AWS

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 15 · model deployment

Anthropic Claude via Bedrock

Anthropic Claude via Bedrock is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

71%

Ops

76%

Enterprise

86%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude via Bedrock base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude via Bedrock provider: AWS

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 16 · model deployment

Anthropic Claude 3 Opus

Anthropic Claude 3 Opus is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Opus 4.7 as the replacement.

Anthropic Claude 3 Opus is deprecated; use Anthropic Claude Opus 4.7.

Quality

95%

Latency

56%

Cost

46%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude 3 Opus base_model: Claude 3 Opus api_surface: Anthropic Claude 3 Opus provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 19 · model deployment

Google Gemini 1.5 Pro

Google Gemini 1.5 Pro is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Google Gemini 3 Pro or Gemini 2.5 Pro as the replacement.

Google Gemini 1.5 Pro is deprecated; use Google Gemini 3 Pro or Gemini 2.5 Pro.

Quality

92%

Latency

56%

Cost

46%

Privacy

56%

Ops

71%

Enterprise

78%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Google Gemini 1.5 Pro base_model: Gemini 1.5 Pro api_surface: Google Gemini 1.5 Pro provider: Google

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 20 · model deployment

Google Gemini 1.5

Google Gemini 1.5 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Google Gemini 3 Pro or Gemini 2.5 Pro as the replacement.

Google Gemini 1.5 Pro is deprecated; use Google Gemini 3 Pro or Gemini 2.5 Pro.

Quality

92%

Latency

56%

Cost

46%

Privacy

56%

Ops

71%

Enterprise

78%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Google Gemini 1.5 base_model: Gemini 1.5 Pro api_surface: Google Gemini 1.5 provider: Google

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 23 · model deployment

Google Gemini 1.5 Flash

Google Gemini 1.5 Flash is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Google Gemini 2.5 Flash or Gemini 2.5 Flash-Lite as the replacement.

Google Gemini 1.5 Flash is deprecated; use Google Gemini 2.5 Flash or Gemini 2.5 Flash-Lite.

Quality

65%

Latency

88%

Cost

80%

Privacy

56%

Ops

71%

Enterprise

78%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Google Gemini 1.5 Flash base_model: Gemini 1.5 Flash api_surface: Google Gemini 1.5 Flash provider: Google

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 25 · model deployment

OpenAI GPT-4o mini

OpenAI GPT-4o mini is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecatedsuperseded2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review OpenAI GPT-5.5, GPT-5.4 mini, or GPT-5.4 nano as the replacement.

OpenAI GPT-4o mini is superseded; review OpenAI GPT-5.5, GPT-5.4 mini, or GPT-5.4 nano.

Quality

52%

Latency

88%

Cost

80%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: OpenAI GPT-4o mini base_model: GPT-4o Mini api_surface: OpenAI GPT-4o mini provider: OpenAI

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 26 · model deployment

Anthropic Claude 3.5

Anthropic Claude 3.5 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude 3.5 base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude 3.5 provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 27 · model deployment

Anthropic Claude 3.5 Sonnet

Anthropic Claude 3.5 Sonnet is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude 3.5 Sonnet base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude 3.5 Sonnet provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 29 · model deployment

Anthropic Claude 3

Anthropic Claude 3 is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude 3 base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude 3 provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 30 · model deployment

Anthropic Claude 3 Sonnet

Anthropic Claude 3 Sonnet is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude 3 Sonnet base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude 3 Sonnet provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 32 · model deployment

Anthropic Claude 3.5 Sonnet API

Anthropic Claude 3.5 Sonnet API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude 3.5 Sonnet API base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude 3.5 Sonnet API provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 33 · model deployment

Claude 3.5 via Anthropic

Claude 3.5 via Anthropic is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Claude 3.5 via Anthropic base_model: Claude 3.5 Sonnet api_surface: Claude 3.5 via Anthropic provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 34 · model deployment

Anthropic Claude via API

Anthropic Claude via API is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated2 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Sonnet 4.6 as the replacement.

Anthropic Claude 3.5 Sonnet is deprecated; use Anthropic Claude Sonnet 4.6.

Quality

66%

Latency

66%

Cost

58%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude via API base_model: Claude 3.5 Sonnet api_surface: Anthropic Claude via API provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Rank 36 · model deployment

Anthropic Claude 3 Haiku

Anthropic Claude 3 Haiku is a candidate policy-ranked High-accuracy structured document reasoning option. Public recommendation authority still requires gates to promote this candidate.

deprecateddeprecated3 evidence rowsPolicy tool-policy-shadow-2026-05-07

This option is lifecycle-blocked for public recommendations. Review Anthropic Claude Haiku 4.5 as the replacement.

Anthropic Claude 3 Haiku is deprecated; use Anthropic Claude Haiku 4.5.

Quality

48%

Latency

88%

Cost

80%

Privacy

59%

Ops

76%

Enterprise

70%

Why it ranks here

Quality and fit scores are lane-specific, so this rank is not a global popularity score.

Lifecycle status must stay current enough for the candidate to be rendered as a safe public choice.

Evidence rows and policy state decide whether the rank is advisory, candidate, or public authority.

Canonical links

offering: Anthropic Claude 3 Haiku base_model: Claude 3 Haiku api_surface: Anthropic Claude 3 Haiku provider: Anthropic

Generated as a candidate from a candidate component_offering. Not approved for public Recommended rendering until authority gates pass.

View option details Compare with rank #1

Fallback references

These are navigation aids for unresolved slots, not authority to call a tool the best option.

reference

Model leaderboard reference

Use benchmark leaderboards and provider status while lane-specific policy candidates are pending.

pending policy candidate

Provider-bound deployment choice

Provider, API surface, residency, and lifecycle state still need a component offering row before public recommendation authority.

Evidence gates

A candidate needs lane-specific evidence before it can move from comparison to public selection.

benchmark

30d cadence

Citation grounding evaluation

citation_grounding_eval

security review

30d cadence

Privacy and compliance review

privacy_review

benchmark

30d cadence

Structured output evaluation

structured_output_eval