meta-llamaVision

Meta: Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

Context

131k tokens

Input / 1M tokens

$0.34

Output / 1M tokens

$0.34

Benchmark

—

Pricing

Input tokens$0.34 per 1M tokens

Output tokens$0.34 per 1M tokens

Technical details

Model IDmeta-llama/llama-3.2-11b-vision-instruct

Context window131k tokens

Input modalitiestext, image

Output modalitiestext

TokenizerLlama3

Max output tokens16,384

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'meta-llama/llama-3.2-11b-vision-instruct',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Meta: Llama 3.2 11B Vision Instruct?

Get an API key and start making requests in minutes.

Get an API key

← All models