Deep dive into HBase architecture

Last updated on May 30 2022
Sonali Singh

Table of Contents

Deep dive into HBase architecture

HBase – Architecture

In HBase, tables are split into regions and are served by the region servers. Regions are vertically divided by column families into “Stores”. Stores are saved as files in HDFS. Shown below is the architecture of HBase.

Note: The term ‘store’ is employed for regions to explain the storage structure.

image001 24
Architecture

HBase has three major components: the client library, a master server, and region servers. Region servers can be added or removed as per requirement.

MasterServer

The master server –

  • Assigns regions to the region servers and takes the assistance of Apache ZooKeeper for this task.
  • Handles load balancing of the regions across region servers. It unloads the busy servers and shifts the regions to less occupied servers.
  • Maintains the state of the cluster by negotiating the load balancing.
  • Is liable for schema changes and other metadata operations such as creation of tables and column families.

Regions

Regions are nothing but tables that are split up and spread across the region servers.

Region server

The region servers have regions that –

  • Communicate with the client and handle data-related operations.
  • Handle read and write requests for all the regions under it.
  • Decide the dimensions of the region by following the region size thresholds.

When we take a deeper check out the region server, it contain regions and stores as shown below:

image002 29
Region server

The store contains memory store and HFiles. Memstore is just like a cache memory. Anything that’s entered into the HBase is stored here initially. Later, the data is transferred and saved in Hfiles as blocks and therefore the memstore is flushed.

Zookeeper

  • Zookeeper is an open-source project that gives services like maintaining configuration information, naming, providing distributed synchronization, etc.
  • Zookeeper has ephemeral nodes representing different region servers. Master servers use these nodes to get available servers.
  • In addition to availability, the nodes are also employed to track server failures or network partitions.
  • Clients communicate with region servers via zookeeper.
  • In pseudo and standalone modes, HBase itself will take care of zookeeper.

So, this brings us to the end of blog. This Tecklearn ‘Deep dive into HBase architecture’ helps you with commonly asked questions if you are looking out for a job in HBase and No-SQL Database Domain.

If you wish to learn HBase and build a career in HBase or No-SQL Database domain, then check out our interactive, Apache HBase Training, that comes with 24*7 support to guide you throughout your learning period. Please find the link for course details:

https://www.tecklearn.com/course/apache-hbase-training/

Apache HBase Training

About the Course

Tecklearn Apache HBase training will master the powerful NoSQL distributed database. You will learn HBase architecture, data analytics using HBase, integration with Hive, monitoring cluster using ZooKeeper and working on real-life industry projects. Build your career as a certified HBase professional through our hands-on training with real-world examples. Upon completion of this online training, you will hold a solid understanding and hands-on experience with Apache HBase.

Why Should you take Apache HBase Training?

  • HBase is now the largest data-driven service serving top websites including Facebook Messaging Platform.
  • There is Strong demand for HBase qualified professionals and they are paid big bucks for the right skills.
  • According to indeed.com, the average pay of an HBase developer stands at $81,422 per annum.

What you will Learn in this Course?

Introduction to HBase and NoSQL

  • Introduction to HBase
  • Fundamentals of HBase
  • What is NoSQL
  • NoSQL Vs RDBMS
  • Why HBase
  • Where to use HBase

HBase Data Modelling

  • Data Modelling
  • HDFS vs. HBase
  • HBase Use Cases

HBase Architecture and Components

  • HBase Architecture
  • Components of HBase Cluster

HBase Installation

  • Prerequisites for HBase Installation
  • Installation Steps

Programming in HBase

  • Create an Eclipse Project for HBase
  • Simple Table Creation from Java in HBase
  • HBase API
  • HBase Shell
  • Primary operations and advanced operations

Integration of Hive with HBase

  • Create a table and insert data into it
  • Integration of Hive with HBase
  • HBase Mapping

Deep Dive into HBase

  • Input Data into HBase
  • File Loading
  • HDFS File
  • HBase handling files in File System
  • WAL
  • Seek Vs Transfer
  • HBase ACID Properties

Got a question for us? Please mention it in the comments section and we will get back to you.

 

 

0 responses on "Deep dive into HBase architecture"

Leave a Message

Your email address will not be published. Required fields are marked *