AI Glossary

Gemini

Google DeepMind's family of multimodal AI models designed to process and generate text, images, audio, and video within a single unified architecture.

Model Tiers

Gemini Ultra/1.5 Pro: Most capable, 1M+ token context. Gemini Pro: Balanced performance. Gemini Flash: Fast and efficient. Gemini Nano: On-device for mobile.

Key Features

Native multimodal understanding (not just text+vision bolted together), extremely long context window (up to 2M tokens), strong reasoning and coding capabilities, and deep integration with Google services.

← Back to AI Glossary

Gemini

Model Tiers

Key Features

Related Articles

Google Gemini: The Multimodal AI Powerhouse

Multimodal LLMs: When AI Can See, Hear, and Read

Related Concepts