Hadoop Architecture & Administration Training for Big Data Solutions

Level: Intermediate
Rating: 4.6/5 4.61/5 Based on 71 Reviews

In this Hadoop Architecture and Administration training course, you gain the skills to install, configure, and manage the Apache Hadoop platform and its associated ecosystem, and build a Hadoop big data solution that satisfies your business requirements. You will learn to install and build a Hadoop cluster capable of processing very large data sets, then configure and tune the Hadoop environment to ensure high throughput and availability.

Additionally, this course will teach attendees how to allocate, distribute and manage resources; monitor the Hadoop file system, job progress and overall cluster performance; as well as exchange information with relational databases.

Key Features of this Hadoop Administration for Big Data Training

  • After-course instructor coaching benefit
  • Learning Tree end-of-course exam included
  • After-course computing sandbox included

You Will Learn How To

  • Architect a Hadoop solution to satisfy your business requirements
  • Install and build a Hadoop cluster capable of processing large data
  • Configure and tune the Hadoop environment to ensure high throughput and availability
  • Allocate, distribute, and manage resources
  • Monitor the file system, job progress, and overall cluster performance

Choose the Training Solution That Best Fits Your Individual Needs or Organisational Goals

LIVE, INSTRUCTOR-LED

In Class & Live, Online Training

  • 4-day instructor-led training course
  • After-course instructor coaching benefit
  • Learning Tree end-of-course exam included
View Course Details & Schedule

Standard £2095

RESERVE SEAT

PRODUCT #1252

TRAINING AT YOUR SITE

Team Training

  • Bring this or any training to your organisation
  • Full - scale program development
  • Delivered when, where, and how you want it
  • Blended learning models
  • Tailored content
  • Expert team coaching

Customize Your Team Training Experience

CONTACT US

Save More on Training with Learning Tree Training Vouchers!

Our flexible, easy-to-redeem training vouchers are available to any employee within your organisation. For details, please call 0800 282 353 or chat live.

In Class & Live, Online Training

Note: This course runs for 4 Days

  • 21 - 24 Jan 2:00 PM - 9:30 PM GMT Rockville, MD / Online (AnyWare) Rockville, MD / Online (AnyWare) Reserve Your Seat

  • 18 - 21 Feb 2:00 PM - 9:30 PM GMT New York / Online (AnyWare) New York / Online (AnyWare) Reserve Your Seat

  • 21 - 24 Jul 2:00 PM - 9:30 PM BST Rockville, MD / Online (AnyWare) Rockville, MD / Online (AnyWare) Reserve Your Seat

Guaranteed to Run

When you see the "Guaranteed to Run" icon next to a course event, you can rest assured that your course event — date, time, location — will run. Guaranteed.

Hadoop Administration Course Information

  • Recommended Experience

    • Knowledge of Linux at the level of:
    • Knowledge of Java at the level of:

Hadoop Administration Course Outline

  • Introduction to Data Storage and Processing

    Installing the Hadoop Distributed File System (HDFS)

    • Defining key design assumptions and architecture
    • Configuring and setting up the file system
    • Issuing commands from the console
    • Reading and writing files

    Setting the stage for MapReduce

    • Reviewing the MapReduce approach
    • Introducing the computing daemons
    • Dissecting a MapReduce job
  • Defining Hadoop Cluster Requirements

    Planning the architecture

    • Selecting appropriate hardware
    • Designing a scalable cluster

    Building the cluster

    • Installing Hadoop daemons
    • Optimising the network architecture
  • Configuring a Cluster

    Preparing HDFS

    • Setting basic configuration parameters
    • Configuring block allocation, redundancy and replication

    Deploying MapReduce

    • Installing and setting up the MapReduce environment
    • Delivering redundant load balancing via Rack Awareness
  • Maximising HDFS Robustness

    Creating a fault–tolerant file system

    • Isolating single points of failure
    • Maintaining High Availability
    • Triggering manual failover
    • Automating failover with Zookeeper

    Leveraging NameNode Federation

    • Extending HDFS resources
    • Managing the namespace volumes

    Introducing YARN

    • Critiquing the YARN architecture
    • Identifying the new daemons
  • Managing Resources and Cluster Health

    Allocating resources

    • Setting quotas to constrain HDFS utilization
    • Prioritising access to MapReduce using schedulers

    Maintaining HDFS

    • Starting and stopping Hadoop daemons
    • Monitoring HDFS status
    • Adding and removing data nodes

    Administering MapReduce

    • Managing MapReduce jobs
    • Tracking progress with monitoring tools
    • Commissioning and decommissioning compute nodes
  • Maintaining a Cluster

    Employing the standard built–in tools

    • Managing and debugging processes using JVM metrics
    • Performing Hadoop status checks

    Tuning with supplementary tools

    • Assessing performance with Ganglia
    • Benchmarking to ensure continued performance
  • Extending Hadoop

    Simplifying information access

    • Enabling SQL–like querying with Hive
    • Installing Pig to create MapReduce jobs

    Integrating additional elements of the ecosystem

    • Imposing a tabular view on HDFS with HBase
    • Configuring Oozie to schedule workflows
  • Implementing Data Ingress and Egress

    Facilitating generic input/output

    • Moving bulk data into and out of Hadoop
    • Transmitting HDFS data over HTTP with WebHDFS

    Acquiring application–specific data

    • Collecting multi–sourced log files with Flume
    • Importing and exporting relational information with Sqoop
  • Planning for Backup, Recovery and Security

    • Coping with inevitable hardware failures
    • Securing your Hadoop cluster

Team Training

Hadoop Administration Training FAQs

  • Can I learn Hadoop Architecture and Administration online?

    Yes! We know your busy work schedule may prevent you from getting to one of our classrooms which is why we offer convenient online training to meet your needs wherever you want, including online training.

Questions about which training is right for you?

call 0800 282 353
chat Live Chat




100% Satisfaction Guaranteed

Your Training Comes with a 100% Satisfaction Guarantee!*

  • If you are not 100 % satisfied, you pay no tuition fee!
  • No advance payment required for most products.
  • Tuition fee can be paid later by invoice - OR - at the time of checkout by credit card.

*Partner-delivered courses may have different terms that apply. Ask for details.

Rockville, MD / Online (AnyWare)
New York / Online (AnyWare)
Rockville, MD / Online (AnyWare)
Preferred method of contact:
Chat Now

Please Choose a Language

Canada - English

Canada - Français