Installation¶
Requirements¶
- Python 3.11 or higher
- pip, uv, or another Python package manager
Basic Installation¶
This installs the core package with HTTP-based fetching (httpx) and HTML/JSON extraction.
Optional Dependencies¶
Databrew has optional extras for additional functionality:
Browser Support¶
For JavaScript-heavy websites that require browser rendering:
This installs pydoll, a Python browser automation library that uses Chrome DevTools Protocol.
Analytics Support¶
For faster exports with large datasets (7-10x speedup):
This installs DuckDB, which enables fast JSONL, JSON, and Parquet exports by reading SQLite directly.
All Extras¶
Install everything:
Development Installation¶
To install from source for development:
git clone https://github.com/datakomari/databrew.git
cd databrew
# Using uv (recommended)
uv sync --all-extras
# Using pip
pip install -e ".[browser,analytics]"
Verifying Installation¶
Check that databrew is installed correctly:
You should see the list of available commands.
Next Steps¶
- Quick Start - Create and run your first config
- Core Concepts - Learn how databrew works