Meta AI gets two new models as Meta releases Llama 4
Meta has announced the release of Llama 4 , its newest collection of AI models that now power Meta AI on the web and in WhatsApp, Messenger, and Instagram Direct. The two models, also available to download from Meta or Hugging Face now, are Llama 4 Scout, a small model capable of “fitting in a single Nvidia H100 GPU,” and Llama 4 Maverick, which is more akin to GPT-4o and Gemini 2.0 Flash. And the company says it’s in the process of training Llama 4 Behemoth, which Meta CEO Mark Zuckerberg says on Instagram is “already the highest performing base model in the world.” According to Meta, Scout has a 10-million-token context window — the working memory of an AI model — and beats Google’s Gemma 3 and Gemini 2.0 Flash-Lite models, as well as the open-source Mistral 3.1, “across a broad range of widely reported benchmarks,” while still “fitting in a single Nvidia H100 GPU.” It makes similar claims about its larger Maverick model’s performance versus OpenAI’s GPT-4o and Google’s Gemin...