Guido García · @palmerabollo · 2016

Realtime processing of data streams

  • Stream: unbounded sequence of tuples.
  • Topology: network of spouts and bolts.
    • Spout: source stream.
    • Bolt: stream processing.

Architecture

Nimbus node (master): code distribution + worker allocation

Zookeeper nodes: coordination

Supervisor nodes: start and stop workers

Example 1: Basic uppercase

Example 2: Word Count

# zookeeper
brew install zookeeper
zkServer start

# storm
brew install storm
storm nimbus
storm supervisor
storm ui

Local env (OSX)

Apache Storm 101

By Guido García

Apache Storm 101

Basic introduction to Apache Storm

  • 1,094