Top Qs
Timeline
Chat
Perspective
Apache Samza
Open-source distributed stream processing From Wikipedia, the free encyclopedia
Remove ads
Apache Samza is an open-source, near-realtime, asynchronous computational framework for stream processing developed by the Apache Software Foundation in Scala and Java. It has been developed in conjunction with Apache Kafka. Both were originally developed by LinkedIn.[2]
Remove ads
Overview
Samza allows users to build stateful applications that process data in real-time from multiple sources including Apache Kafka.
Samza provides fault tolerance, isolation and stateful processing. Unlike batch systems such as Apache Hadoop or Apache Spark, it provides continuous computation and output, which result in sub-second[3] response times.
There are many players in the field of real-time stream processing and Samza is one of the mature products.[4][5][6] It was added to Apache in 2013.[7]
Samza is used by multiple companies.[8] The biggest installation is in LinkedIn.
Remove ads
See also
References
External links
Wikiwand - on
Seamless Wikipedia browsing. On steroids.
Remove ads