Meta: Llama 3.1 405B (base)
meta-llama/llama-3.1-405b
Created Aug 2, 202432,768 context
$4/M input tokens$4/M output tokens
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.