meta-llamaVision

Meta: Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

Context

1M tokens

Input / 1M tokens

$0.15

Output / 1M tokens

$0.60

Benchmark

—

Pricing

Input tokens$0.15 per 1M tokens

Output tokens$0.60 per 1M tokens

Technical details

Model IDmeta-llama/llama-4-maverick

Context window1M tokens

Input modalitiestext, image

Output modalitiestext

TokenizerLlama4

Max output tokens16,384

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'meta-llama/llama-4-maverick',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Meta: Llama 4 Maverick?

Get an API key and start making requests in minutes.

Get an API key

← All models