Databend, built in Rust, is an open-source cloud data warehouse that serves as a cost-effective alternative to Snowflake. With its focus on fast query execution and data ingestion, it's designed for complex analysis of the world's largest datasets.
-
Full ControlβCloud + On-Prem: Deploy on cloud or on-prem to suit your needs and maintain full control.
-
Blazing-Fast Performance: Built with Rust for efficient, high-speed vectorized query execution. π ClickBench
-
Cost-Effective: Scalable architecture that optimizes performance while reducing storage and compute costs. π TPC-H
-
AI-Enhanced Analytics: Leverage built-in AI Functions for smarter data insights.
-
Simplified ETL: Direct data ingestion without the need for external ETL tools, streamlining the process. π Data Loading
-
Real-Time Data Updates: Keep your analytics up-to-date with real-time incremental data updates. π Stream
-
Advanced Indexing: Boost query performance with Virtual Column, Aggregating Index, and Full-Text Index.
-
ACID Compliance + Version Control: Ensure reliable transactions with full ACID compliance and Git-like versioning.
-
Schema Flexibility: Effortlessly handle semi-structured data with the flexible VARIANT data type.
-
Community-Driven Growth: Open-source and continuously evolving with contributions from a global community.
The fastest way to try Databend, Databend Cloud
Prepare the image (once) from Docker Hub (this will download about 170 MB data):
docker pull datafuselabs/databend
To run Databend quickly:
docker run --net=host datafuselabs/databend
Connecting to Databend
Data Import and Export
- How to load Parquet file into a table
- How to export a table to Parquet file
- How to load CSV file into a table
- How to export a table to CSV file
- How to load TSV file into a table
- How to export a table to TSV file
- How to load NDJSON file into a table
- How to export a table to NDJSON file
- How to load ORC file into a table
Loading Data From Other Databases
Querying Semi-structured Data
Visualize Tools with Databend
Managing Users
Managing Databases
Managing Tables
Managing Views
AI Functions
Data Management
Accessing Data Lake
Performance
Databend thrives on community contributions! Whether it's through ideas, code, or documentation, every effort helps in enhancing our project. As a token of our appreciation, once your code is merged, your name will be eternally preserved in the system.contributors table.
Here are some resources to help you get started:
For guidance on using Databend, we recommend starting with the official documentation. If you need further assistance, explore the following community channels:
- Slack (For live discussion with the Community)
- GitHub (Feature/Bug reports, Contributions)
- Twitter (Get the news fast)
- I'm feeling lucky (Pick up a good first issue now!)
Stay updated with Databend's development journey. Here are our roadmap milestones:
Databend is released under a combination of two licenses: the Apache License 2.0 and the Elastic License 2.0.
When contributing to Databend, you can find the relevant license header in each file.
For more information, see the LICENSE file and Licensing FAQs.
-
Inspiration: Databend's design draws inspiration from industry leaders ClickHouse and Snowflake.
-
Computing Model: Our computing foundation is built upon apache arrow.
-
Documentation Hosting: The Databend documentation website proudly runs on Vercel.