Hadoopexpress - Big Data Training, Consulting and Development
  • Login
  • Sign up

Hadoop Professional Certificate (Batch 6)

Fri,Sat, Sun (2 Weeks)
EST:[19:30-23:30]
New York Time
US $499

The course is scheduled to be held three days a week and lasts for two weeks. Schedule is as follows:
USA Students : Friday, Saturday, Sunday for two weeks
Time: 7.30 pm to 11.30 pm EST (New York time)

This course is designed for programmers and architects wishing to learn Hadoop from scratch or to improve their understanding of Hadoop and its ecosystem. All essential topics of core Hadoop framework as well as popular tools such as PIG, HIVE, Sqoop and OOZIE are included as essential components of the course modules. The course aims to convert new Hadoop learners into practitioners.

About this Course

Fri,Sat, Sun (2 Weeks)
EST:[19:30-23:30]
New York Time
US $499

The course is scheduled to be held three days a week and lasts for two weeks. Schedule is as follows:
USA Students : Friday, Saturday, Sunday for two weeks
Time: 7.30 pm to 11.30 pm EST (New York time)

This course is designed for programmers and architects wishing to learn Hadoop from scratch or to improve their understanding of Hadoop and its ecosystem. All essential topics of core Hadoop framework as well as popular tools such as PIG, HIVE, Sqoop and OOZIE are included as essential components of the course modules. The course aims to convert new Hadoop learners into practitioners.

Course Syllabus

The course contains detailed explanation of Hadoop and Big Data technical aspects such as creation of Hadoop clusters, writing Map Reduce programs, using File Systems commands, Streaming,  data movement and storage in Hadoop clusters, practical demonstration of Pig and Hive as well as essential concepts of Sqoop and Oozie.


It’s an ideal course for getting up to speed quickly on Big Data and Hadoop in order to start writing useful Programs using the Hadoop framework.


  • Topic 1 : Introduction to Hadoop and Big Data

  • Conceptual understanding of Hadoop and Big Data and their relevance in the industry. Uses of Big Data. Architecture of Hadoop and an explanation of Hadoop ecosystem.


  • Topic 2 : Installing and Configuring a Hadoop Cluster

  • Hadoop single and multi-node cluster installation. Live demo of an actual installation. Commands for cluster startup, shutdown and monitoring.


  • Topic 3 : File System Commands

  • Common commands of Unix and Hadoop file system. Differences between local file system and the Hadoop Distributed file system.


  • Topic 4 : Distcp and Archive

  • Usage of distcp and har commands in detail for copying files across Hadoop file systems and archiving of files in Hadoop


  • Topic 5 : Map Reduce Introduction

  • MapReduce framework with emphasis on key-value pair concept. Essential concepts of MapReduce such as input splits, combiners, mappers, reducers, shuffle and sort. The word count example with relevance to MapReduce.


  • Topic 6 : MapReduce: I/O

  • Detailed explanation of input and output types used with mappers and reducers. Reading and writing to files using java APIs within Hadoop. Sample programs for programmatic reading of files, writing to files and querying file contents as well as file metadata.


  • Topic 7 : MapReduce: Advanced Concepts

  • Definitions of default mappers and reducers such as Identity Mapper, Identity Reducer, Inverse Mapper, Chain Mapper, Token Counter Mapper and Regex Mapper. Using Configuration API, Tool, ToolRunner and GenericOptionsParser for command line options. Writing programs that interact with the file system and its metadata. Understanding Sequence and AVRO files


  • Topic 8 : Streaming

  • Hadoop streaming is explained in detail with a demo of streaming example using Python. The lesson deals with detailed explanation of how streaming is executed in Hadoop, the mechanism followed by Hadoop and practical demo of command options.


  • Topic 9 : PIG

  • PIG language demonstrated with commands and examples. Demos on PIG usage for loading unstructured data into structured and formatted forms into Hadoop clusters.


  • Topic 10 : HIVE

  • Demos and examples of Hive usage for querying the data using SQL commands.


  • Topic 11 : SQOOP and OOZIE

  • Sqoop examples and demos for importing data residing in external source like MySQL databases into Hadoop clusters as well as exporting data out of Hadoop to external sources. Live Demo of Oozie to schedule running of scripts and jobs in Hadoop.


  • Topic 12 : Introduction to NoSQL databases and HBASE

Course Structure


  • Topic 1 : Introduction to Hadoop and Big Data

  • Topic 2 : Installing and Configuring a Hadoop Cluster

  • Topic 3 : File System Commands

  • Topic 4 : Distcp and Archive

  • Topic 5 : Map Reduce Introduction

  • Topic 6 : MapReduce: I/O

  • Topic 7 : MapReduce: Advanced Concepts

  • Topic 8 : Streaming

  • Topic 9 : PIG

  • Topic 10 : HIVE

  • Topic 11 : SQOOP and OOZIE

  • Topic 12 : Introduction to NoSQL databases and HBASE

Course Logistics

How the course is delivered:

An instructor delivers the course live over the Internet. Students have two choices to join the lectures:

  1. Join the lecture from home
  2. Join the lecture at our facility at Parsippany, New Jersey

Additional Charges may apply for the classes at our facility.

If you prefer joining the lecture from our facility, you must book a spot at the facility two weeks before the start of the course. You may do so by using the email or phone or live chat provided on our home page. Make sure you have a confirmation email from us for your booking before you arrive at the facility. After receiving a confirmation, you may arrive at the facility with or without a laptop. Ask for Net Serpents education center at the front desk.

If your course is not scheduled between 8 am and 5 PM EST on weekdays or falls on a weekend, a member from our staff will meet you at the building entrance and escort you in as special permission is required outside regular hours of operation.


Steps to join the lecture from home:

  1. If you haven’t done so already, create an account by clicking on Register on top right of home page
  2. Login with your user-id and password and click Enroll Now on the course card in the home page. Click Enroll Now again in the pop-up window. You will navigate to the course order page. Apply a discount code if you have one and then click on Place Order. Fill in the requested credit card and personal details. These are not saved to our database. Your payment is safe and authorized by a secure payment gateway authorize.net.
  3. On successful payment you will receive a confirmation email
  4. On the scheduled date and time of the course, go to hadoopexpress.com and login with your user-id and password
  5. You will see your username on top right. Click on it and go to your dashboard by selecting My Dashboard
  6. In your dashboard page click the Go to course button
  7. Click on Go to live class on right hand side of page
  8. You will land on a Zoom meeting page where you will be able to download zoom and join the meeting. The download is required only the first time
  9. You will be able to see the instructor screen and pick the option to use your phone or computer for sound. Make sure you have a microphone and speaker on your laptop or a headset connected to it.


Steps to join the lecture from our facility:

  1. Create an account and enroll by paying for course
  2. If you haven’t done so already, create an account by clicking on Register on top right of home page
  3. Login with your user-id and password and click Enroll Now on the course card in the home page. Click Enroll Now again in the pop-up window. You will navigate to the course order page. Apply a discount code if you have one and then click on Place Order. Fill in the requested credit card and personal details. Note: we don’t store these details in our database. Your payment is safe and authorized by a secure payment gateway authorize.net.
  4. On successful payment you will receive a confirmation email.
  5. Call or email or use live chat at our home page at least two weeks in advance of the start date to request and confirm your reservation at the facility.
  6. On the scheduled date and time of the course, arrive at our sponsoring facility Net Serpents LLC, 2001 Route 46, Suite 310, Waterview Plaza, Parsippany, NJ 07054. You will be provided a seating space with all necessary equipment to attend the lecture. You may bring your laptop or request a computer from us. Please call for additional details.

The course is delivered over six live sessions of 4 hours each. Each live session is also recorded and made available in your dashboard over the internet for reviewing the lecture afterwards at your convenience. Further, you may download student guides, examples, exercises and videos to your laptop for personal use.

If the software does not require purchasing a license, you may install it on your laptop with guidance from our instructor. If you are unable to do so for any reason, you may request accessing the software provided by us on the cloud.


Discussion Forum:

A discussion forum is available on-line to allow students to post any queries or discuss any topic with other students or the instructor.


Course Material and Videos

Each live session is also recorded and made available in your dashboard over the internet for reviewing the lecture afterwards at your convenience.Further, you may download student guides, examples, exercises and videos to your laptop for personal use. Course material and discussion forum may be disabled anytime one month after the delivery of the last lecture.

Opportunities after the course

Hadoop is an emerging technology that has made rapid progress. It has already been adopted by a majority of Fortune 100 companies and is considered as the technology of the future for dealing with storage, retrieval and analysis of massive amounts of data. Naturally, the careers on this technology are at an upswing and the demand for professionals has started outweighing the supply of knowledgeable professionals.


The salaries of programmers of Hadoop are in the top bracket in IT. Career opportunities exist in large companies across industry segments i.e., social media, banking, pharmaceuticals, energy, insurance, airlines, railways and many others.


The demand for Hadoop professionals is growing at over 20% a year and is expected to peak over the next one or to years. Large companies like Yahoo, Google, Facebook and Twitter are leading users of this relatively new technology.


Opportunities exist in IT Consulting companies as well as Fortune 500 companies.


Delivery Method
Self Placed $ !i(i,r

Additional Batches
Course at a Glance
  • English
  • Skill Level: Intermediate
Online Classes
Assignments: 6
Project: 1
Lifetime Access
Certificates
System Requirements

High speed internet connection, laptop or PC with good screen resolution and ability to connect to internet, Headset with microphone or built-in speaker and microphone on laptop or PC.

Prerequisites
  • Knowledge of an object oriented programming language is highly recommended, though not essential. However, prior programming knowledge or background is required. Such background may be on any programming language.Java knowledge is recommended and would be very useful for the course though it is not essential.

Testimonials

" The course was very interactive and easy to understand even for a beginner like me! It helped me prepare and pass my certification soon after completing the course!! "

- Priyam

" I really loved this course. It was fast paced, very hands on with fun filled exercises. Not only do I have lifetime access to lectures and notes, I can also email the instructor any time for help! Awesome!! "

- Samuel Adlekha

" Loved the the course. The instructor was patient and provided great demos and examples. I am new to programming but felt so comfortable since it was well explained. Awesome! "

- Shveta

" It was a pleasure and great learning experience with Net Serpents under the guidance of Mr. Shashi Prakash. "

- Aijaz

Contact Us:

Hadoop is a registered trademark of the Apache Software Foundation(ASF) and Hadoop is a product owned by Apache. Hadoop Express is not affiliated in any way to ASF . All educational material, resources, videos and other content available on this site is created and owned by Net Serpents and is intended only to provide training. This website does not own any of the products on which it provides training, many of which are owned by Apache while others are owned companies such as SAS, Python and Oracle. Net Serpents LLC is committed to education and online learning. All recognizable terms, names of software, tools, programming languages that appear on this site belong to the respective copyright and/or trademark owners.