Apache Spark
Open-source data analytics cluster computing framework / From Wikipedia, the free encyclopedia
Dear Wikiwand AI, let's keep it short by simply answering these key questions:
Can you list the top facts and stats about Apache Spark?
Summarize this article for a 10 year old
SHOW ALL QUESTIONS
Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance. Originally developed at the University of California, Berkeley's AMPLab, the Spark codebase was later donated to the Apache Software Foundation, which has maintained it since.
Quick Facts Original author(s), Developer(s) ...
Original author(s) | Matei Zaharia |
---|---|
Developer(s) | Apache Spark |
Initial release | May 26, 2014; 9 years ago (2014-05-26) |
Stable release | |
Repository | Spark Repository |
Written in | Scala[1] |
Operating system | Microsoft Windows, macOS, Linux |
Available in | Scala, Java, SQL, Python, R, C#, F# |
Type | Data analytics, machine learning algorithms |
License | Apache License 2.0 |
Website | spark |
Close