Apache Parquet
Column-oriented data storage format / From Wikipedia, the free encyclopedia
Dear Wikiwand AI, let's keep it short by simply answering these key questions:
Can you list the top facts and stats about Apache Parquet?
Summarize this article for a 10 year old
SHOW ALL QUESTIONS
Apache Parquet is a free and open-source column-oriented data storage format in the Apache Hadoop ecosystem. It is similar to RCFile and ORC, the other columnar-storage file formats in Hadoop, and is compatible with most of the data processing frameworks around Hadoop. It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk.
Quick Facts Initial release, Stable release ...
Initial release | 13 March 2013; 11 years ago (2013-03-13) |
---|---|
Stable release | |
Repository | |
Written in | Java (reference implementation)[2] |
Operating system | Cross-platform |
Type | Column-oriented DBMS |
License | Apache License 2.0 |
Website | parquet |
Close