Try Text-to-SQL on Real Data - Multi-Million Rows & GB+ Sizes

Published: December 5, 2025

Two Clicks. Zero Setup. No Database, No Server, No Login needed. With 9 LLM options, Python & SQL to torture your data till it confesses.

App live here: app.tigzig.com/analyzer

Related Post: Large File Upload for Database AI Text-to-SQL Apps: A Practical Guide

What's New

I've enhanced the sample datasets in my Database AI app (DATS-4). Previously, the test files were tiny, 50-60 rows. Now there's a full range: 64 rows to 11.8 million rows. File sizes from 14 KB to 1.6 GB.

For the 1.6 GB file, setup takes around 9 minutes. Fully automated: database creation, file upload, agent ready.

The Datasets

Two Clicks to Analytics-Ready

Setup is two clicks Go to the app Tigzig Analyzer

  1. Go to Datasets. Pick one.
  2. Select 'Use Temporary Database'

That's it. The app creates a temporary database, uploads your data, extracts the schema, and connects it to the AI agent. You're ready to query. For small files, setup takes 20-30 seconds. For the largest file, 2-3 minutes. Backend is neon.com - which provisions a Postgres database in less than a second via an API call.

How to Explore

Once setup completes, you're in the chat interface. Use the pre-built prompts. Each dataset has a sample prompt. Hit the copy icon, paste, run. These are structured queries: ranking systems, derived metrics, comparisons. The more specific the better. Avoid generic 'analyze this'. AI can't read your mind yet.

Or explore data with:

Check Agent Reasoning

Click to see the SQL the agent generated. Useful for validation and learning.

LLM Options

9 models available for advanced analysis. Choose based on quality needs and cost tolerance.

Model Type Quality Cost
Gemini 2.0 Flash Best Value 75 Lowest
Qwen3 Max Good 80 Low
Gemini 2.5 Flash Good 85 Low
KIMI K2 Thinking High Variance 85 High
Deepseek-R1-0158 Great Quality 90 Med
GPT-4.1 Great Quality 90 Med
Gemini 3 Pro Good 95 High
GPT-5.1 Top Quality 100 High
Claude 4.5 Sonnet Topmost Quality 115 High

For detailed cost and quality comparisons based on live testing, see Gemini 3 Pro Added to Database AI Suite. Tested Against Claude Sonnet 4.5 and GPT-5.1

Summary: Claude still leads. GPT-5.1 is solid. Gemini 3 Pro lands third.

What Else Can the App Do

The sample dataset feature is just one entry point. DATS-4 is a full database AI suite. Here's what's available:

Database Connections

Two Agents

File Uploads

Working Tables & Export

Table Viewer

PDF Output

Python Charts & Stats

Logs

Technical Note on File Uploads

  1. Download: App downloads compressed file from GitHub repo.
  2. Compression (frontend): Uncompressed CSV or TXT uploads are compressed using the browser CompressionStream API without loading full file into memory.
  3. Temporary database provisioning: A temporary Postgres database is created via Neon with automatic role setup and unique credentials.
  4. File upload to backend: Compressed file is sent to the FastAPI SQL connector.
  5. Memory efficient file handling: Backend streams file to disk in 32MB chunks to prevent RAM bloat.
  6. Decompression: Backend decompresses .gz files when needed, streaming to disk in 32MB chunks.
  7. AI powered schema detection: Backend samples first 5 lines, detects delimiter, and sends data to OpenAI for schema inference.
  8. Table creation: Empty table is created using the detected schema.
  9. Smart upload path selection: Postgres uses in memory COPY for uncompressed files under 100MB and streamed COPY from temp file for larger or compressed files. MySQL always streams in 100K row batches using Polars or Pandas with executemany inserts.
  10. Agent handoff: After upload, schema plus credentials and sample rows are handed to the Database Agent.
  11. Confirmation: App confirms environment readiness and the Agent confirms schema receipt.

Open Source

All open source. Docs and source code accessible from the app (hit Docs in top nav). Guides and posts at tigzig.com. The app has 7 major components, each with its own GitHub repo:

Full build guide and architecture docs available in the Docs section.

Visual Guide: GB File Upload Process

10 slides explaining the upload architecture and data flow.

GB File Upload Process

Browse the slides or download the PDF

Slide preview
slides
Click to Browse
Download PDF
🔗
Blog Migration Notice: Some links or images in earlier posts may be broken. View the original post on the old blog site.