Sonnet-3.7 the best, Deepseek 2nd, Gemini excellent. ๐๐ซ๐ฒ ๐ข๐ญ ๐ rex.tigzig.com (open source)
ย
โ ๐๐จ๐ฉ ๐๐ข๐ง๐
As an AI Co-Analyst LLM, Sonnet-3.7 is my top choice for deep, incisive analysis support....loving Gemini-2.0-Flash for balance of quality, reliability and cost.. and it's the fastest. Deepseek-R1 quality close to Sonnet but less reliable. o3-mini is lowest cost but not too great
ย
โ ๐๐๐ค๐ ๐ข๐ญ ๐๐จ๐ซ ๐ ๐ฌ๐ฉ๐ข๐ง
โธGo to rex.tigzig.com โ Click โSampleโ to auto-upload a sample file into a temporary Postgres database. Choose your advanced analyst agent - Gemini/Sonnet/R1/o3-mini. Use sample prompt or modify it
โธNo login, database creds, or API keys needed
โธOption: connect your own database...or upload your own files
ย
โ ๐๐ ๐๐ง๐ญ ๐๐๐ญ๐ฎ๐ฉ โย ๐ ๐ฅ๐จ๐ฐ๐ข๐ฌ๐ ๐๐
Sequential Agents (LangGraph). Router agent โ regular queries to a general analyst agent and complex queries to an advanced analysis route โ Reasoning LLM โ analysis plan + SQL queries โ execution agent (gpt-4o) reviews, corrects, executes, and debugs before delivering results
ย
โ ๐๐ฎ๐๐ฅ๐ข๐ญ๐ฒ
My (judgmental) ranking โ reasoning & analysis
โธSonnet โ best by far. Brilliant derived variables & approach. Scoreโ 100 (baseline). Sometimes too deep for 4o to execute, but superb for iterative analysis
โธR1 โ close to sonnet โ 95
โธGemini โ excellent โย 85
โธo3-mini โ hmmm... โย 50
ย
โ ๐๐๐ (๐๐จ๐ฌ๐ญ ๐ฉ๐๐ซ ๐๐ฎ๐๐ซ๐ฒ)
Reasoning-based analysis (breakdown in comments)
โธo3-mini: ~8.5c
โธGemini: ~11c
โธR1: ~13.5c
โธSonnet: ~20.5c
๐๐๐ซ๐ข๐๐ง๐๐: up to ยฑ50% on the same query.. models evolving...and variances coming down.
๐๐๐ญ๐๐ง๐๐ข๐๐ฌ: mostly 1-4 mins, sometimes 10+ mins....time of day matters โ peak vs. off-peak. Gemini the fastest.
ย
โ ๐๐๐โ Regular Queries
โธ4o-mini: ~0.10c
โธ4o: ~1.5c
4o-mini the workhorse; 4o when it stumbles...Gemini may take over
๐๐๐ซ๐ข๐๐ง๐๐: ยฑ20% โ stable in live deployments
๐๐๐ญ๐๐ง๐๐ข๐๐ฌ: 15 sec to 3 min depending on query complexity and time of day.
ย
โ ๐๐๐ฅ๐ข๐๐๐ข๐ฅ๐ข๐ญ๐ฒ
โธo3-mini & Sonnet โ high reliability -negligible API failures
โธGemini โ high nowadays...but would like to see for some time
โธR1 โ low - API failures & latency spikes. Improving- likely temporary. Alternate hosting options available.
ย
โ ๐๐๐ฆ๐จ๐๐ ๐๐ฑ๐๐ฆ๐ฉ๐ฅ๐
โธScoring & Ranking of Indian Banks - Credit Card Segment
โธData Mart & Profile Summary for 1M Cust + 10M Trans.
ย
โ ๐๐๐ ๐๐ซ๐ซ๐จ๐ซ๐ฌ / ๐๐๐ ๐ ๐๐ข๐ฅ๐ฎ๐ซ๐๐ฌ / ๐๐๐ญ๐ ๐๐๐ฅ๐ข๐๐๐ญ๐ข๐จ๐ง๐ฌ?
See detailed video guide - for live debugging / error catching (link in comments)
ย
โ ๐๐จ๐ฎ๐ซ๐๐ ๐๐จ๐๐๐ฌ, ๐๐ซ๐๐ก๐ข๐ญ๐๐๐ญ๐ฎ๐ซ๐ & ๐๐ฎ๐ข๐ฅ๐ ๐๐ฎ๐ข๐๐
5 repos + 7 Flowise schemas + video build guide. Links in comments
ย
โ ๐๐๐ฏ๐๐๐ญ๐ฌ & ๐๐ฌ๐ฌ๐ฎ๐ฆ๐ฉ๐ญ๐ข๐จ๐ง๐ฌ
Lots of them...plus tips...check comments...
ย
๐๐๐ฏ๐๐๐ญ๐ฌ, ๐๐ฌ๐ฌ๐ฎ๐ฆ๐ฉ๐ญ๐ข๐จ๐ง๐ฌย & ๐๐ข๐ฉ๐ฌ
โธย Reasoning estimates: โ ~100 queries across 4 reasoning agents (1-3 iterations per request. 1 iteration = 1 query).
โธย Regular queries: Based on months of live usage (API calls, automation, web scraping, NL-to-SQL via custom UIs).
โธย Use case-specific: Estimates apply to queries demoed in the video.
โธย High variability for same query: expect to come down as LLMs stabilize
โธย Critical to estimate costs for your own use case.
โธย Check actual billing โ Pen-and-paper token math is unreliable.
โธย Time-based variability โ Example: r1 costs were very high a few weeks ago but are now more reasonableโeven though rack rate pricing is unchanged. Be mindful.
โธPrototype app - live working prototype.
ย
๐๐๐ ๐๐ซ๐๐๐ค๐๐จ๐ฐ๐ง- ๐ซ๐๐๐ฌ๐จ๐ง๐ข๐ง๐ ย & ๐๐ง๐๐ฅ๐ฒ๐ฌ๐ข๐ฌ
โธย o3-mini: ~8.5c (reasoning + execution)
โธย gemini-2.0-flash: ~11c (reasoning = free tier, execution = 11c). Paid tier is cheaper than gpt-4o-mini (~0.10c additional).
โธย r1: ~13.5c (reasoning = 4c, execution = 9.5c)
โธย sonnet-3.7: ~20.5c (planning = 11.5c, execution = 9c)
ย
๐๐๐ย - ๐ซ๐๐ ๐ฎ๐ฅ๐๐ซ ๐ช๐ฎ๐๐ซ๐ข๐๐ฌ
โธย gpt-4o-mini โ ~0.10c (my workhorse โ solid performance, solid pricing)
โธย gpt-4o โ ~1.5c (I shift to gpt-4o if gpt-4o-mini stumbles)
โธย sonnet โ With 3.5, I used to get ~2.5c. With 3.7, costs are now much higher despite the same token pricingโlikely a temporary issue.
ย
๐๐จ๐ซ๐ค๐ก๐จ๐ซ๐ฌ๐ ๐๐๐: 4o-mini default; 4o when it stumbles. Flash2 may take overโbetter performance, quality, and cost, with improved reliability over last yearโs Gemini.
ย
๐๐๐ญ๐๐ข๐ฅ๐๐ ๐๐ข๐๐๐จ ๐๐ฎ๐ข๐๐
Demo, build guide, architecture, API call flows, error catching, repo walkthrus and more.
ย
๐๐ข๐ญ๐๐ฎ๐ ๐๐๐ฉ๐จ๐ฌย & ๐๐๐ก๐๐ฆ๐๐ฌ
Main Repo
With step-by-step build guide & links to other repos
Agents Schemas - Flowise
In docs folder in Main Repo