Gemini 2.0: Google Knocks It Out of the Park
Published: January 19, 2025
Quick Take: Next-level tech with real-time multimodal power
Note: These are early impressions based on a few hours of testing-Gemini 2.0 launched just yesterday. This is not a detailed evaluation.
Explored AI Studio, tinkered with the React starter app with its full API access to real-time vision, video, and audio. Came in with low expectations. Left genuinely astounded.
What Stood Out
- Real-Time vision, video, and audio are remarkable
- Toolkit for Devs & Analysts: Full API access, React starter kit, and Python SDK
- Generous Free Tier: 10 RPM, 4M TPM, and 1,500 requests/day-great for prototyping and testing
What's in it for You?
- AI Studio: Realtime multimodal power. For developers as well as non-technical users
- Build Multimodal AI Apps: Solid set of developer resources and affordable pricing (assumed, if free tier is anything to go by)
Any Minuses?
Still early days. Some latency and breaks were noticeable, but it's Day 0-plenty of time to optimize.
Up Next
Integrating real-time voice with REX, my open-source decision intelligence app for natural language querying and database integration. Testing alternatives to OpenAI Realtime API-starting with Eleven Labs, Hume and now Gemini 2.0. Planning to share learnings, working apps, and source code soon.
Try REX for free: rex.tigzig.com
Full Breakdown: Releasing REX-2: AI Decision Intelligence
Links
- AI Studio: aistudio.google.com
- Open Source: Source code on app site