Gemini
Google DeepMind's family of multimodal AI models designed to process and generate text, images, audio, and video within a single unified architecture.
Model Tiers
Gemini Ultra/1.5 Pro: Most capable, 1M+ token context. Gemini Pro: Balanced performance. Gemini Flash: Fast and efficient. Gemini Nano: On-device for mobile.
Key Features
Native multimodal understanding (not just text+vision bolted together), extremely long context window (up to 2M tokens), strong reasoning and coding capabilities, and deep integration with Google services.