Real-Time Big Data with Storm, Kafka and GigaSpaces.

Building own Real-Time Google Analytics

Oleksiy Dyagilev

Kafka, not Franz

A high-throughput distributed messaging system

fast, O(1) persistence
scalable
durable
distributed by design
originally developed by LinkedIn
written in Scala

Apache Storm

Free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing

scalable
fault-tolerant
guarantees data will be processed
originally written by Nathan Marz in Java/Clojure, then adopted by Twitter
now in Apache incubator
used by dozens of companies
active community

Real-Time Big Data with Storm, Kafka and GigaSpaces. Building own Real-Time Google Analytics Oleksiy Dyagilev

storm

By Oleksiy Dyagilev

storm

11 years ago
1,747

Oleksiy Dyagilev