Concept of Cluster Architecture in Apache Storm

Last updated on May 30 2022
Lalit Kolgaonkar

Table of Contents

Concept of Cluster Architecture in Apache Storm

Apache Storm – Cluster Architecture

One of the main highlight of the Apache Storm is that it is a fault-tolerant, fast with no “Single Point of Failure” (SPOF) distributed application. We can install Apache Storm in as many systems as needed to increase the capacity of the application.

Let’s have a glance at how the Apache Storm cluster is meant and its internal architecture. the subsequent diagram depicts the cluster design.

a 2
Architecture

Apache Storm has two sort of nodes, Nimbus (master node) and Supervisor (worker node). Nimbus is that the central component of Apache Storm. the most job of Nimbus is to run the Storm topology. Nimbus analyzes the topology and gathers the task to be executed. Then, it’ll distributes the task to an available supervisor.

A supervisor will have one or more worker process. Supervisor will delegate the tasks to worker processes. Worker process will spawn as many executors as required and run the task. Apache Storm uses an indoor distributed messaging system for the communication between nimbus and supervisors.

Components Description
Nimbus Nimbus is a master node of Storm cluster. All other nodes in the cluster are called as worker nodes. Master node is responsible for distributing data among all the worker nodes, assign tasks to worker nodes and monitoring failures.
Supervisor The nodes that follow instructions given by the nimbus are called as Supervisors. A supervisor has multiple worker processes and it governs worker processes to complete the tasks assigned by the nimbus.
Worker process A worker process will execute tasks related to a specific topology. A worker process will not run a task by itself, instead it creates executors and asks them to perform a particular task. A worker process will have multiple executors.
Executor An executor is nothing but a single thread spawn by a worker process. An executor runs one or more tasks but only for a specific spout or bolt.
Task A task performs actual data processing. So, it is either a spout or a bolt.
ZooKeeper framework Apache ZooKeeper is a service used by a cluster (group of nodes) to coordinate between themselves and maintaining shared data with robust synchronization techniques. Nimbus is stateless, so it depends on ZooKeeper to monitor the working node status.

ZooKeeper helps the supervisor to interact with the nimbus. It is responsible to maintain the state of nimbus and supervisor.

 

Storm is stateless in nature. albeit stateless nature has its own disadvantages, it actually helps Storm to process real-time data within the absolute best and quickest way.

Storm isn’t entirely stateless though. It stores its state in Apache ZooKeeper. Since the state is out there in Apache ZooKeeper, a failed nimbus is often restarted and made to figure from where it left. Usually, service monitoring tools like monit will monitor Nimbus and restart it if there’s any failure.

Apache Storm even have a complicated topology called Trident Topology with state maintenance and it also provides a high-level API like Pig.

Apache Storm – Applications

Apache Storm framework supports many of the today’s best industrial applications. we’ll provide a really brief overview of a number of the foremost notable applications of Storm during this blog.

Klout

 

Klout is an application that uses social media analytics to rank its users supported online social influence through Klout Score, which may be a numerical value between 1 and 100. Klout uses Apache Storm’s inbuilt Trident abstraction to make complex topologies that stream data.

The Weather Channel

 

The Weather Channel uses Storm topologies to ingest weather data. it’s engaged with Twitter to enable weather-informed advertising on Twitter and mobile applications. OpenSignal may be a company that focuses on wireless coverage mapping. StormTag and WeatherSignal are weather-based projects created by OpenSignal. StormTag may be a Bluetooth meteorological observation post that attaches to a keychain. The weather data collected by the device is shipped to the WeatherSignal app and OpenSignal servers.

Telecom Industry

 

Telecommunication providers process many phone calls per second. They perform forensics on dropped calls and poor sound quality. Call detail records flow in at a rate of millions per second and Apache Storm processes those in real-time and identifies any troubling patterns. Storm analysis are often wont to continuously improve call quality.

So, this brings us to the end of blog. This Tecklearn ‘Concept of Cluster Architecture in Apache Storm’ helps you with commonly asked questions if you are looking out for a job in Apache Storm and Big Data Domain.

If you wish to learn Apache Storm and build a career in Apache Storm or Big Data domain, then check out our interactive, Apace Storm Training, that comes with 24*7 support to guide you throughout your learning period. Please find the link for course details:

https://www.tecklearn.com/course/apache-strom-training/

Apache Storm Training

About the Course

Tecklearn Apache Storm training will give you a working knowledge of the open-source computational engine, Apache Storm. You will be able to do distributed real-time data processing and come up with valuable insights. You will learn about the deployment and development of Apache Storm applications in real world for handling Big Data and implementing various analytical tools for powerful enterprise-grade solutions. Upon completion of this online training, you will hold a solid understanding and hands-on experience with Apache Storm.

Why Should you take Apache Storm Training?

  • The average pay of Apache Storm Professional stands at $90,167 P.A – ​Indeed.com​​
  • Groupon, Twitter and many companies using Apache Storm for business purposes like real-time analytics and micro-batch processing.
  • Apache Storm is a free and open source, distributed real-time computation system for processing fast, large streams of data

What you will Learn in this Course?

Introduction to Apache Storm

  • Apache Storm
  • Apache Storm Data Model

Architecture of Storm

  • Apache Storm Architecture
  • Hadoop distributed computing
  • Apache Storm features

Installation and Configuration

  • Pre-requisites for Installation
  • Installation and Configuration

Storm UI

  • Zookeeper
  • Storm UI

Storm Topology Patterns

Got a question for us? Please mention it in the comments section and we will get back to you.

 

0 responses on "Concept of Cluster Architecture in Apache Storm"

Leave a Message

Your email address will not be published. Required fields are marked *