Google's latest multimodal model, supports image and video[0] in text or chat prompts.

Optimized for language tasks including:

  • Code generation
  • Text generation
  • Text editing
  • Problem solving
  • Recommendations
  • Information extraction
  • Data extraction or generation
  • AI agents

Usage of Gemini is subject to Google's Gemini Terms of Use.

  • [0]: Video input is not available through OpenRouter at this time.

Model Information

Model ID

google/gemini-pro-1.5

Context Length

2,000,000 tokens

Author

google

Capabilities