AutoAnalytics

AutoAnalytics is a research project on AI-native analytics design: not only translating questions into SQL, but helping determine which questions are meaningful and answerable given the data that actually exists.

Motivation

In many organizations, analytics bottlenecks happen before SQL:

teams do not know what the database can reliably answer,
KPI ideation is disconnected from real schema constraints,
analysts spend time asking impossible or low-value questions.

AutoAnalytics focuses on this earlier and harder layer:

Given role context + available data, what are the highest-value questions we should ask?

This requires explicit understanding of data availability, schema semantics, granularity, joins, and blind spots.

The Core Research Question

Traditional BI workflows assume the question is already well-formed.
AutoAnalytics studies the upstream problem:

How can a system infer the question space from raw enterprise data?
How can it map that question space to role-specific KPIs?
How can it ensure generated queries are executable and decision-useful?

Differentiation From Text2SQL

Text2SQL solves:
natural language question -> SQL query

AutoAnalytics solves a broader loop:

data understanding -> question discovery -> KPI design -> SQL generation -> validation -> insight + visualization

That means AutoAnalytics is not just a query translator; it is a question discovery and analytics orchestration framework.

Text2SQL vs AutoAnalytics

Conceptual Diagram (Hero)

The hero image above is intentionally used as the project diagram:
it communicates the transition from data chaos to question clarity to decision confidence, which is the core distinction between AutoAnalytics and classic Text2SQL systems.

Conceptual System Flow

ingest role and organization context,
inspect source schema and data samples,
construct semantic schema understanding,
generate tasks and KPI candidates aligned with role intent,
synthesize SQL implementations,
validate and repair SQL when needed,
produce insight narratives and visual outputs.

Why This Matters

If successful, this approach can reduce time-to-insight by shifting effort from manual dashboard bootstrapping to automated, data-grounded KPI and question generation. The goal is to help teams move from “What can we query?” to “What should we learn next?”

Research Directions

Question-space quality: are proposed questions relevant, novel, and actionable?
Feasibility alignment: are generated KPIs computable from available data?
SQL reliability: execution pass rate before/after repair loops.
Insight faithfulness: are narratives supported by computed results?
Human utility: does the system improve analyst productivity and decision speed?

Repository Layout

AutoAnalytics/
├─ src/                     # Core modules
├─ notebooks/               # Experimental/orchestration notebooks
├─ assets/styles/           # UI styles
├─ docs/figures/            # Gemini-generated conceptual visuals
├─ scripts/                 # Utility scripts
└─ README.md

Generate Figures (Gemini)

# PowerShell
$env:GEMINI_API_KEY="YOUR_KEY"
python scripts/generate_gemini_figures.py

Outputs:

docs/figures/hero_top.png
docs/figures/text2sql_vs_autoanalytics.png

All images are conceptual communication assets intended for research presentation.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.vscode		.vscode
assets/styles		assets/styles
docs/figures		docs/figures
notebooks		notebooks
scripts		scripts
src		src
.gitignore		.gitignore
.sesskey		.sesskey
README.md		README.md
pythonanywere.token		pythonanywere.token

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoAnalytics

Motivation

The Core Research Question

Differentiation From Text2SQL

Text2SQL vs AutoAnalytics

Conceptual Diagram (Hero)

Conceptual System Flow

Why This Matters

Research Directions

Repository Layout

Generate Figures (Gemini)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AutoAnalytics

Motivation

The Core Research Question

Differentiation From Text2SQL

Text2SQL vs AutoAnalytics

Conceptual Diagram (Hero)

Conceptual System Flow

Why This Matters

Research Directions

Repository Layout

Generate Figures (Gemini)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages