Skip to main content
Skip to main content
Edit this page

chDB

chDB is a fast in-process SQL OLAP Engine powered by ClickHouse v25.8.2.1. You can use it when you want to get the power of ClickHouse in a programming language without needing to connect to a ClickHouse server.

Key features

  • In-process SQL OLAP Engine - Powered by ClickHouse, no need to install ClickHouse server
  • Multiple data formats - Input & Output support for Parquet, CSV, JSON, Arrow, ORC and 70+ more formats
  • Minimized data copy - From C++ to Python with python memoryview
  • Rich Python Ecosystem Integration - Native support for Pandas, Arrow, DB API 2.0, seamlessly fits into existing data science workflows
  • Zero dependencies - No need for external database installations
  • DataStore API - Pandas-compatible API with SQL optimization, supporting 630+ methods

DataStore: Pandas-Compatible API

NEW! DataStore provides a pandas-compatible API that combines familiar pandas syntax with ClickHouse performance.

One-Line Migration

# Just change your import - your pandas code works unchanged
- import pandas as pd
+ from chdb import datastore as pd

df = pd.read_csv("data.csv")
result = df[df['age'] > 25].groupby('city')['salary'].mean()

Performance Highlights

OperationpandasDataStoreSpeedup
GroupBy count347ms17ms19.93x
Complex pipeline2,047ms380ms5.39x
Filter+Sort+Head1,537ms350ms4.40x

Benchmarks on 10M rows

DataStore Features

  • 630+ API methods - 209 pandas DataFrame methods, 185+ accessor methods
  • Lazy evaluation - Operations compile to optimized SQL
  • SQL pushdown - Filters and aggregations run at the data source
  • Universal data sources - Read from files, S3, databases, data lakes

Learn more: DataStore Documentation

What languages are supported by chDB?

chDB has the following language bindings:

How do I get started?

For pandas Users

Start with the DataStore API for a familiar pandas experience with ClickHouse performance:

DataStore API Reference

SQL API Guides

An introductory video

Watch a brief introduction to chDB and learn how it brings ClickHouse's power to your Python environment:

Performance benchmarks

chDB delivers exceptional performance across different scenarios:

About chDB

License

chDB is available under the Apache License, Version 2.0. See LICENSE for more information.