Honeyindex  /  LLMs  /  Llama 4 Scout

Llama 4 Scout

Efficient Llama 4 variant for single-GPU inference

Context Window
10M tokens
Parameters
109B (MoE)
Source
Open
Modality
Text, Vision

Llama 4 Scout is a smaller efficient variant of the Llama 4 family, designed for single-GPU inference with multimodal capabilities.

Specifications

Technical details
Developer
Meta AI
Model Family
Llama 4
Parameters
109B (MoE)
Context Window
10,000,000 tokens (10M)
Modality
Text, Vision
Open Source
Yes
License
Llama 4 Community License
API Available
Yes
Release Date
April 5, 2025
Pricing
Free to self-host