Databao: NL queries for data

Natural‑language queries for your data — connect SQL databases and DataFrames, ask questions in plain English, and get tables, plots, and explanations back. Databao runs agents on top of dataframes and your DB connections, and can use both cloud and local LLMs.

Overview

Ask questions like “list all German shows” or “plot revenue by month”.
Works with SQLAlchemy engines and in‑memory DataFrames.
Built‑in visualization via a Vega‑Lite chat visualizer.
Pluggable LLMs: OpenAI/Anthropic or local models via Ollama or any OpenAI‑compatible server.

Installation

Using pip:

pip install databao

Quickstart

1) Create a database connection (SQLAlchemy)

import os
from sqlalchemy import create_engine

user = os.environ.get("DATABASE_USER")
password = os.environ.get("DATABASE_PASSWORD")
host = os.environ.get("DATABASE_HOST")
database = os.environ.get("DATABASE_NAME")

engine = create_engine(
    f"postgresql://{user}:{password}@{host}/{database}"
)

2) Open a databao agent and register sources

import databao
from databao import LLMConfig

# Option A - Local: install and run any compatible local LLM. For list of compatible models, see: "Local models" below 
# llm = LLMConfig(name="ollama:gpt-oss:20b", temperature=0)

# Option B - Cloud (requires an API key, e.g. OPENAI_API_KEY)
llm_config = LLMConfig(name="gpt-4o-mini", temperature=0)
agent = databao.new_agent(name="demo", llm_config=llm_config)

# Add your database to the agent
agent.add_db(engine)

3) Ask questions and materialize results

# Start a conversational thread
thread = agent.thread()

# Ask a question and get a DataFrame
df = thread.ask("list all german shows").df()
print(df.head())

# Get a textual answer
print(thread.text())

# Generate a visualization (Vega-Lite under the hood)
plot = thread.plot("bar chart of shows by country")
print(plot.code)  # access generated plot code if needed

Environment variables

Specify your API keys in the environment variables:

OPENAI_API_KEY — if using OpenAI models
ANTHROPIC_API_KEY — if using Anthropic models
Optional for local/OAI‑compatible servers:
- OPENAI_BASE_URL (aka api_base_url in code)
- OLLAMA_HOST (e.g., 127.0.0.1:11434)

Local models

Databao can be used with local LLMs either using Ollama or OpenAI‑compatible servers (LM Studio, llama.cpp, etc.).

Ollama

Install Ollama for your OS and make sure it is running.
Use an LLMConfig with name of the form "ollama:<model_name>". For example, LLMConfig(name="ollama:gpt-oss:20b", temperature=0)

The model will be downloaded automatically if it doesn't already exist. Alternatively, run ollama pull <model_name> to download it manually.

OpenAI‑compatible servers

You can use any OpenAI‑compatible server by setting api_base_url in the LLMConfig. For an example, see examples/configs/qwen3-8b-oai.yaml.

Examples of compatible servers:

LM Studio (macOS‑friendly; supports the OpenAI Responses API)
Ollama (OLLAMA_HOST=127.0.0.1:8080 ollama serve)
llama.cpp (llama-server)
vLLM

Development

Installation using uv (for development):

Clone this repo and run:

# Install dependencies for the library
uv sync

# Optionally include example extras (notebooks, dotenv)
uv sync --extra examples

We recommend using the same version of uv as the one used in GitHub Actions:

uv self update 0.9.5

Using Makefile targets:

# Lint and static checks (pre-commit on all files)
make check

# Run tests (loads .env if present)
make test

Using uv directly:

uv run pytest -v
uv run pre-commit run --all-files

Tests

Test suite uses pytest.
Some tests are marked @pytest.mark.apikey and require provider API keys.

Run all tests:

uv run pytest -v

Run only tests that do NOT require API keys:

uv run pytest -v -m "not apikey"

Name		Name	Last commit message	Last commit date
Latest commit History 194 Commits
.github/workflows		.github/workflows
.idea		.idea
databao		databao
examples		examples
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE.md		LICENSE.md
Makefile		Makefile
README.md		README.md
RELEASE.md		RELEASE.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Databao: NL queries for data

Overview

Installation

Quickstart

1) Create a database connection (SQLAlchemy)

2) Open a databao agent and register sources

3) Ask questions and materialize results

Environment variables

Local models

Ollama

OpenAI‑compatible servers

Development

Tests

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 7

Uh oh!

Languages

License

JetBrains/databao

Folders and files

Latest commit

History

Repository files navigation

Databao: NL queries for data

Overview

Installation

Quickstart

1) Create a database connection (SQLAlchemy)

2) Open a databao agent and register sources

3) Ask questions and materialize results

Environment variables

Local models

Ollama

OpenAI‑compatible servers

Development

Tests

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 7

Uh oh!

Languages

Packages