The Yi Vision is a complex visual task models provide high-performance understanding and analysis capabilities based on multiple images.

It's ideal for scenarios that require analysis and interpretation of images and charts, such as image question answering, chart understanding, OCR, visual reasoning, education, research report understanding, or multilingual document reading.

Model Information

Model ID

01-ai/yi-vision

Context Length

16,384 tokens

Author

01-ai

Capabilities