meta-llamaVision
Meta: Llama 3.2 11B Vision Instruct
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
Context
131k tokens
Input / 1M tokens
$0.34
Output / 1M tokens
$0.34
Benchmark
—
Pricing
Input tokens$0.34 per 1M tokens
Output tokens$0.34 per 1M tokens
Technical details
Model IDmeta-llama/llama-3.2-11b-vision-instruct
Context window131k tokens
Input modalitiestext, image
Output modalitiestext
TokenizerLlama3
Max output tokens16,384
Use with MegaBrain
import OpenAI from 'openai'
const client = new OpenAI({
baseURL: 'https://getmegabrain.com/api/gateway/v1',
apiKey: process.env.MEGABRAIN_API_KEY,
})
const response = await client.chat.completions.create({
model: 'meta-llama/llama-3.2-11b-vision-instruct',
messages: [{ role: 'user', content: 'Hello!' }],
})Ready to use Meta: Llama 3.2 11B Vision Instruct?
Get an API key and start making requests in minutes.
Get an API key