MultimodalText Generation

Llama 3.2 90B Vision

Meta's first multimodal Llama with vision capabilities

Released 2024-09-25
Context Window
128K
API Access
Available

Benchmark Performance

reasoning

reasoning

Massive Multitask Language Understanding

86.0%
reasoning

Massive Multi-discipline Multimodal Understanding

60.3%