How to Set Up Amazon Redshift

Last updated on Dec 10 2021
Padmanabham Suresh

Table of Contents

How to Set Up Amazon Redshift

Amazon Redshift is a fully managed data warehouse service in the cloud. Its datasets range from 100s of gigabytes to a petabyte. The initial process to create a data warehouse is to launch a set of compute resources called nodes, which are organized into groups called cluster. After that you can process your queries.

Following are the steps to set up Amazon Redshift.

Step 1 − Sign in and launch a Redshift Cluster using the following steps.

  • Sign in to AWS Management console and use the following link to open Amazon Redshift console − https://console.aws.amazon.com/redshift/
  • Select the region where the cluster is to be created using the Region menu on the top right side corner of the screen.
  • Click the Launch Cluster button.

1 10

  • The Cluster Details page opens. Provide the required details and click the Continue button till the review page.

2 9

  • A confirmation page opens. Click the Close button to finish so that cluster is visible in the Clusters list.

3 7

  • Select the cluster in the list and review the Cluster Status information. The page will show Cluster status.

4 7

Step 2 − Configure security group to authorize client connections to the cluster. The authorizing access to Redshift depends on whether the client authorizes an EC2 instance or not.

Follow these steps to security group on EC2-VPC platform.

  • Open Amazon Redshift Console and click Clusters on the navigation pane.
  • Select the desired Cluster. Its Configuration tab opens.

5 7

  • Click the Security group.
  • Once the Security group page opens, click the Inbound tab.

6 6

  • Click the Edit button. Set the fields as shown below and click the Save button.
    • Type − Custom TCP Rule.
    • Protocol − TCP.
    • Port Range − Type the same port number used while launching the cluster. By-default port for Amazon Redshift is 5439.
    • Source − Select Custom IP, then type 0.0.0.0/0.

7 5

Step 3 − Connect to Redshift Cluster.

There are two ways to connect to Redshift Cluster − Directly or via SSL.

Following are the steps to connect directly.

  • Connect the cluster by using a SQL client tool. It supports SQL client tools that are compatible with PostgreSQL JDBC or ODBC drivers.

Use the following links to download − JDBC

https://jdbc.postgresql.org/download/postgresql-8.4-703.jdbc4.jar

ODBC https://ftp.postgresql.org/pub/odbc/versions/msi/psqlodbc_08_04_0200.zip or http://ftp.postgresql.org/pub/odbc/versions/msi/psqlodbc_09_00_0101x64.zip for 64 bit machines

  • Use the following steps to get the Connection String.
    • Open Amazon Redshift Console and select Cluster in the Navigation pane.
    • Select the cluster of choice and click the Configuration tab.
    • A page opens as shown in the following screenshot with JDBC URL under Cluster Database Properties. Copy the URL.

8 4

  • Use the following steps to connect the Cluster with SQL Workbench/J.
    • Open SQL Workbench/J.
    • Select the File and click the Connect window.
    • Select Create a new connection profile and fill the required details like name, etc.
    • Click Manage Drivers and Manage Drivers dialog box opens.
    • Click the Create a new entry button and fill the required details.

9 4

  • Click the folder icon and navigate to the driver location. Finally, click the Open button.
  • Leave the Classname box and Sample URL box blank. Click OK.
  • Choose the Driver from the list.
  • In the URL field, paste the JDBC URL copied.
  • Enter the username and password to their respective fields.
  • Select the Autocommit box and click Save profile list.

11 3

Features of Amazon Redshift

Following are the features of Amazon Redshift −

  • Supports VPC − The users can launch Redshift within VPC and control access to the cluster through the virtual networking environment.
  • Encryption − Data stored in Redshift can be encrypted and configured while creating tables in Redshift.
  • SSL − SSL encryption is used to encrypt connections between clients and Redshift.
  • Scalable − With a few simple clicks, the number of nodes can be easily scaled in your Redshift data warehouse as per requirement. It also allows to scale over storage capacity without any loss in performance.
  • Cost-effective − Amazon Redshift is a cost-effective alternative to traditional data warehousing practices. There are no up-front costs, no long-term commitments and on-demand pricing structure.

So, this brings us to the end of blog. This Tecklearn ‘How to Set Up Amazon RedShift’ helps you with commonly asked questions if you are looking out for a job in AWS and Cloud Computing. If you wish to learn AWS and build a career in Cloud Computing domain, then check out our interactive, AWS Solutions Architect Training, that comes with 24*7 support to guide you throughout your learning period. Please find the link for course details:

https://www.tecklearn.com/course/aws-solutions-architect-certification-training/

AWS Solutions Architect Certification Training

About the Course

Tecklearn’s AWS Architect Certification Training is curated by industry professionals as per the industry requirements and demands. The entire AWS training course is in line with the AWS Certified Solutions Architect exam. You will learn various aspects of AWS like Elastic Cloud Compute, Simple Storage Service, Virtual Private Cloud, Aurora database service, Load Balancing, Auto Scaling and more by working on hands-on projects and case studies. You will master AWS architectural principles and services such as IAM, VPC, EC2, EBS and elevate your career to the cloud, and beyond with this AWS solutions architect course.

Why Should you take AWS Architect Certification Training?

  • The Average salary of an AWS Certified Solutions Architect is $129k per annum – Indeed.com
  • AWS market is expected to reach $236 Billion by 2020 at a CAGR of 22% – Forrester
  • Netflix, Twitter, LinkedIn, Facebook, BBC, Baidu, ESPN & other MNCs worldwide use Amazon AWS Cloud

What you will Learn in this Course?

Overview of Cloud Computing and AWS

  • What is Cloud Computing
  • Definition of Cloud Computing
  • On Premises Vs Service Models
  • Advantages and Disadvantages of Cloud Computing
  • Cloud Computing Providers
  • Why AWS
  • What is AWS
  • AWS Benefits
  • AWS Services
  • Traditional Vs AWS Components
  • AWS Global Infrastructure
  • AWS Availability Zone
  • AWS Edge Locations
  • How to Access the AWS Services
  • AWS architecture
  • AWS Management Console
  • AWS offerings Listing (EC2, VPC, AMI, EBS, ELB, Backup)

Amazon Elastic Compute Cloud (EC2)

  • Overview of EC2
  • Elastic IP Vs Public IP
  • Launching of AWS EC2 instance demo
  • How to access EC2
  • EC2 Purchasing Options
  • Amazon Machine Images (AMI)
  • EC2 Storage for the Root Device
  • EC2 Creating AMI
  • EC2 Instance Types
  • Auto Scaling
  • Cost of EC2
  • Best Practices of EC2
  • EC2 Resizing
  • Placement Groups
  • Amazon Backup and various Concepts
  • EC2 Demo
  • Hands On

Networking and Monitoring Services: Amazon Virtual Public Cloud

  • Virtual Private Cloud (VPC) and its benefits
  • Default and Non-Default VPC
  • IP Address
  • CIDR – Classless Inter-domain Routing
  • Subnet: Subnet Mask and Subnet Mask Classes
  • Private and Public Subnet
  • IPv4 v/s IPv6 – As in AWS Infrastructure
  • Internet Gateway and Route Tables
  • Security Group with VPC
  • Access Control List, NACL and Security Group
  • NAT Devices: NAT Gateway and NAT Instance
  • Flow Logs
  • VPC Peering and its working
  • VPN and Direct Connect
  • VPC Limitations
  • Need for Monitoring Services
  • AWS CloudWatch and it’s working
  • AWS Command Line Interface
  • Use Cases
  • Hands On

Amazon Storage Services: Elastic Block Storage

  • What is Storage Services
  • What is Elastic Block Storage (EBS)
  • Persistent Storage
  • EBC Features
  • EBS Benefits
  • EBS Types
  • EBS Pricing
  • EBS Life Cycle
  • EBS Snapshot
  • EBS General Purposed SSD
  • EBS Provisioned IOPS SSD
  • EBS Throughput Optimized HDD
  • EBS Cold HDD
  • EBS Comparison
  • EBS Previous Generation Volumes
  • EBS How Incremental Snapshots Work
  • EBS Deleting an Amazon EBS Snapshot
  • EBS Summary
  • Hands On

Amazon Storage Services: Simple Storage Services (S3)

  • What is Amazon AWS S3
  • Simple Storage Services (S3) Advantages
  • S3 Buckets, Objects, Keys and Endpoints
  • S3 Data Consistency Model
  • S3 Transfer Acceleration
  • S3 Storage Types
  • S3 Versioning
  • S3 Life Cycle Management
  • S3 Data Protection
  • S3 Cross-Region Replication
  • S3 Hosting a Static Website
  • Hands On

Amazon Storage Services

  • Amazon Glacier Storage
  • Amazon Storage Gateway
  • Amazon Snowball (Data Import /Export)
  • Billing with Amazon CloudWatch
  • Hands On

AWS Database Services: Relational Database Service (RDS)

  • Overview of Databases and Relational Database Service (RDS)
  • What is Amazon RDS
  • AWS RDS Components
  • AWS RDS: Interface
  • AWS RDS: Charges
  • AWS RDS Multi-AZ: Benefits
  • AWS RDS Multi-AZ: Failover Process
  • NoSQL Database: Amazon DynamoDB
  • Overview of DynamoDB
  • DynamoDB Benefits
  • Hands On

AWS Database Services Continued

  • Data Warehouse: Amazon Redshift
  • Overview of Amazon Redshift
  • Redshift Architecture
  • Amazon Redshift features
  • In Memory Cache: Amazon ElasticCache
  • Redis Vs MemCache
  • Amazon ElasticCache Cluster
  • Database Migration: AWS Database Migration Service

Load Balancing in AWS

  • What is Fault Tolerant System
  • Features of Elastic Load Balancing
  • What is AWS ELB (Elastic Load Balancer)
  • Types of Load Balancer: Classic, Application and Network
  • Classic Load Balancer: Features, Health Check Configuration, Cross-Zone, Connection Draining, Sticky Sessions, Access Logs, Limitation
  • Application Load Balancer: Features, Application Flow, Limitation
  • Network Load Balancer
  • Access Elastic Load Balancing: AWS Management Console, AWS CLI, AWS SDKs, HTTPS Query API

Amazon Route 53

  • What is Amazon Route 53
  • Domain Name Registration
  • Routing Internet Traffic to Resources
  • Automated check of the health of Resources + Data Pipeline

AWS Identity and Access Management (IAM) – Control user access

  • Authentication (Who can use) and Authorization (Level of Access)
  • IAM Policies – JSON Structure
  • Users, Groups and their Roles
  • AWS IAM Features
  • User Sign-in to Account
  • Switch Role
  • Role to EC2 Instance
  • Password Policy
  • How to Access AWS
  • Multi-Factor Authentication (MFA)
  • Permissions and Permission Types
  • Policies Structure
  • User Based Policies
  • Resource Based Policies
  • Resource Based Permission
  • Policies Types
  • Request Flow
  • Limitations
  • IAM HTTPS API
  • Logging IAM Events with AWS CloudTail
  • Hands On

Amazon CloudWatch

  • What is Amazon CloudWatch
  • Features and Benefits
  • CloudWatch Architecture
  • Hands On

AWS Auto Scaling

  • What is AWS Auto Scaling
  • Auto Scaling Components
  • Auto Scaling Group
  • Auto Scaling Launch Configuration
  • Auto Scaling Benefits
  • Auto Scaling Lifecycle
  • Auto Scaling Plans
  • Manual Scaling
  • Schedule Scaling
  • Dynamic Scaling
  • Auto Scaling Step Adjustment
  • Auto Scaling Termination Policy
  • Default Termination Policy
  • Health Check
  • Hands On

Amazon Application Services

  • Elastic BeanStalk
  • Simple Email Services (SES)
  • Simple Queue Service (SQS)
  • Simple Notification Services (SNS)
  • AWS Lambda
  • Introduction to Elastic OpWorks
  • Hands On

About AWS Solution Architect Associate Exam

Got a question for us? Please mention it in the comments section and we will get back to you.

 

0 responses on "How to Set Up Amazon Redshift"

Leave a Message

Your email address will not be published. Required fields are marked *