meta-llamaVision

Meta: Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

Context
1M tokens
Input / 1M tokens
$0.15
Output / 1M tokens
$0.60
Benchmark

Pricing

Input tokens$0.15 per 1M tokens
Output tokens$0.60 per 1M tokens

Technical details

Model IDmeta-llama/llama-4-maverick
Context window1M tokens
Input modalitiestext, image
Output modalitiestext
TokenizerLlama4
Max output tokens16,384

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'meta-llama/llama-4-maverick',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Meta: Llama 4 Maverick?

Get an API key and start making requests in minutes.

Get an API key