Google's latest multimodal model, supports image and video[0] in text or chat prompts.
Optimized for language tasks including:
Usage of Gemini is subject to Google's Gemini Terms of Use.
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.