Top Qs
Timeline
Chat
Perspective

DuckDB

Open source column-oriented RDBMS From Wikipedia, the free encyclopedia

DuckDB
Remove ads

DuckDB is an open-source column-oriented Relational Database Management System (RDBMS).[1] It is designed to provide high performance on complex queries against large databases in embedded configuration,[2] such as combining tables with hundreds of columns and billions of rows. Unlike other embedded databases (for example, SQLite) DuckDB is not focusing on transactional (OLTP) applications and instead is specialized for online analytical processing (OLAP) workloads.[3] The project has over 6 million downloads per month.[4][5][6]

Quick facts Developer, Stable release ...
Remove ads

History

DuckDB was originally developed by Mark Raasveldt and Hannes Mühleisen [d] at the Centrum Wiskunde & Informatica (CWI) in the Netherlands.[2] The project co-founders designed DuckDB to address the need for an in-process OLAP database solution.[7] DuckDB was first released in 2019.[8] DuckDB version 1.0.0 was released on June 3, 2024, under the codename SnowDuck.[9]

Features

DuckDB uses a vectorized query processing engine.[3] DuckDB is special amongst database management systems because it does not have any external dependencies and can be built with just a C++11 compiler.[10] DuckDB also deviates from the traditional client–server model by running inside a host process (it has bindings, for example, for a Python interpreter with the ability to directly place data into NumPy arrays[2]). DuckDB's SQL parser is derived from the pg_query library developed by Lukas Fittl, which is itself derived from PostgreSQL's SQL parser that has been stripped down as much as possible.[3][11] DuckDB uses a single-file storage format to store data on disk, designed to support efficient scans and bulk updates, appends and deletes.[12] DuckDB is also compiled to WebAssembly using emscripten which enables DuckDB to run SQL in browser-based analytics tools.[13][14]

Remove ads

Comparison

DuckDB in its OLAP niche does not compete with the traditional DBMS like MSSQL, PostgreSQL and Oracle database. While using SQL for queries, DuckDB targets serverless applications and provides extremely fast responses using either Apache Parquet files or its own format for storage. These attributes make it a popular choice for large dataset analysis in interactive mode.[15]

Commercial use

DuckDB is used at Facebook, Google, and Airbnb.[16]

DuckDB co-author Mühleisen also runs a support and consultancy firm for the software, DuckDB Labs.[8] The company has chosen not to take venture capital funding, stating "We feel investment would force the project direction towards monetization, and we would much prefer keeping DuckDB open and available for as many people as possible".[6] Another company, MotherDuck, has received $100 million funding for its data platform based on DuckDB, with investors including Andreessen Horowitz.[17]

Remove ads

DuckDB Foundation

The independent non-profit DuckDB Foundation safeguards the long-term maintenance and development of DuckDB. The foundation holds much of the intellectual property of the project and is funded by charitable donations.[18] The DuckDB Foundation's statutes ensure DuckDB remains open-source under the MIT license in perpetuity.[19]

Language support

In addition to the native C and C++ APIs, DuckDB supports a range of programming languages.

More information Language, Notes ...
Remove ads

Extensions

DuckDB's architecture supports extensions, allowing additional functionality to be added dynamically.[33] Many popular extensions are maintained by the core DuckDB team, and there are over 30 community extensions maintained by third parties.[34][35][36]

References

Further reading

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.

Remove ads