Installation¶
Requirements¶
- Python 3.11 or higher
- pip, uv, or another Python package manager
Basic Installation¶
This installs the core package with HTTP-based fetching (httpx), HTML/JSON extraction, and Parquet storage (via DuckDB).
Optional Dependencies¶
Browser Support¶
For JavaScript-heavy websites that require browser rendering:
This installs pydoll, a Python browser automation library that uses Chrome DevTools Protocol.
Development Installation¶
To install from source for development:
git clone https://github.com/datakomari/databrew.git
cd databrew
# Using uv (recommended)
uv sync --all-extras
# Using pip
pip install -e ".[browser]"
Verifying Installation¶
Check that databrew is installed correctly:
You should see the list of available commands.
Next Steps¶
- Quick Start - Create and run your first config
- Core Concepts - Learn how databrew works