Mô tả

What is ETL?

The ETL (extract, transform, load) process is the most popular method of collecting data from multiple sources and loading it into a centralized data warehouse. ETL is an essential component of data warehousing and analytics.

Why Pentaho for ETL?

Pentaho has phenomenal ETL, data analysis, metadata management and reporting capabilities. Pentaho is faster than other ETL tools (including Talend). Pentaho has a user-friendly GUI which is easier and takes less time to learn. Pentaho is great for beginners. Also, Pentaho Data Integration (PDI) is an important skill in data analytics field.

How much can I earn?

In the US, median salary of an ETL developer is $74,835 and in India average salary is Rs. 7,06,902 per year. Accenture, Tata Consultancy Services, Cognizant Technology Solutions, Capgemini, IBM, Infosys etc. are major recruiters for people skilled in ETL tools; Pentaho ETL is one of the most sought-after skills that recruiters look for. Demand for Pentaho Data Integration (PDI) techniques is increasing day after day.

What makes us qualified to teach you?

The course is taught by Abhishek and Pukhraj. Instructors of the course have been teaching Data Science and Machine Learning for over a decade. We have experience in teaching and implementing Pentaho ETL, Pentaho Data Integration (PDI) for data mining and data analysis purposes.

We are also the creators of some of the most popular online courses - with over 150,000 enrollments and thousands of 5-star reviews like these ones:

I had an awesome moment taking this course. It broaden my knowledge more on the power use of Excel as an analytical tools. Kudos to the instructor! - Sikiru

Very insightful, learning very nifty tricks and enough detail to make it stick in your mind. - Armand

Our Promise

Teaching our students is our job and we are committed to it. If you have any questions about the course content on Pentaho, ETL, practice sheet or anything related to any topic, you can always post a question in the course or send us a direct message.

Download Practice files, take Quizzes, and complete Assignments

With each lecture, there is a practice sheet attached for you to follow along. You can also take quizzes to check your understanding of concepts on Pentaho, ETL, Pentaho Data Integration, Pentaho ETL. Each section contains a practice assignment for you to practically implement your learning on Pentaho, ETL, Pentaho Data Integration, Pentaho ETL. Solution to Assignment is also shared so that you can review your performance.

By the end of this course, your confidence in using Pentaho ETL and Pentaho Data Integration (PDI) will soar. You'll have a thorough understanding of how to use Pentaho for ETL and Pentaho Data Integration (PDI) techniques for study or as a career opportunity.

Go ahead and click the enroll button, and I'll see you in lesson 1 of this Pentaho ETL course!

Cheers

Start-Tech Academy

Bạn sẽ học được gì

Understanding of the entire data integration process using PDI

Extracting data from all popular data sources including Excel, JSON, Zipped files, TXT files and even cloud storage

Cleaning the data using Pentaho Data Integration

Applying business rules on the data in PDI

Different types of Data transformations

Loading the data into different formats

Managing SQL database using PDI

Metadata Injection - a powerful tool offered by PDI

Understanding of the concepts of data marts and data warehouse

Yêu cầu

  • Basic understanding of the data storage concepts will be helpful. Coding background is NOT required for this course

Nội dung khoá học

23 sections

Introduction

2 lectures
Welcome to the course
04:08
Course resources
00:04

Pentaho Data Integration (PDI) Installation and Setup

3 lectures
Setting up environment and installing PDI
05:16
This is a milestone!
03:31
Opening Spoon - The Graphical UI
07:19

A Simple ETL Demonstration

4 lectures
The example problem statement
06:46
Demonstration of a PDI transformation
22:45
Demonstration of a PDI Job
17:40
Quizzes
2 questions

Basic concepts - Theory for foundational understanding

5 lectures
What is ETL?
03:30
Check your understanding
1 question
Data Warehouse, Ops Database and Data mart
05:22
Inmon vs Kimball Architecture
03:33
ETL vs ELT
03:11

The ETL process: The practical part begins here

2 lectures
Data and the ETL process
10:10
Quizzes
2 questions

DATA EXTRACTION: Extracting tabular data

6 lectures
Manually entering data into PDI
12:45
Inputting Data from a TXT (text) file
13:15
Input from multiple CSV files at the same time
15:41
Inputting Data from an Excel file
07:00
Extracting Data from Zipped files
07:54
Quizzes
2 questions

DATA EXTRACTION: Extracting non-tabular data

2 lectures
Extracting from XML
06:47
Extracting from JSON
05:20

Extracting from an SQL table

4 lectures
Plan for importing sales data
03:23
Installing PostgreSQL and pgAdmin in your PC
10:44
Creating Sales table in SQL
09:14
Extracting from an SQL table
05:40

Storing and Retrieving Data from Cloud storage

2 lectures
Storing Data on AWS S3
06:13
Reading data from AWS S3
07:58

Merging Data Streams

6 lectures
Concepts: Merging Data Streams
08:03
Sorted Merge Step - Merging customer data
19:35
Merging product data
11:18
Time to check your understanding
1 question
Append data stream - merging sales data
15:55
Time to check your understanding
1 question

Data Cleansing

11 lectures
Introduction to Data Cleansing
07:25
Value Mapper Step
06:50
Replace in String Step
05:35
Time to check your understanding
1 question
Fuzzy Match concepts
03:14
Fuzzy Match Step in PDI
11:25
Fuzzy Match Algorithms
09:01
Time to check your understanding
1 question
Formula Step and changing data format
09:38
Common Data Cleaning Steps
03:31
Quiz
3 questions

Data Validation

6 lectures
Introduction to Data validation
04:50
Data_validation 1 - String-to-Int and integer range validations
11:27
Data validation 2 - Checking Reference Values using stream look-up
08:24
Data validation 3 - Order date < shipping date using calculator step
06:00
Common Data Validation steps
03:52
Quiz
3 questions

Error Handling

6 lectures
Correcting the errors and merging with main stream
11:47
Time to check your understanding
1 question
Writing the errors to the log
03:34
Time to check your understanding
1 question
Writing the errors to a separate file
05:30
Time to check your understanding
1 question

Transformation and Analytics steps

4 lectures
Concatenating Address Fields
05:41
Data Aggregation using Group-by
09:08
Normalization and Denormalization
07:05
Number Range Step
04:57

PDI SQL Connection

4 lectures
Introduction to PDI - SQL connection
06:04
Reading and filtering data from DB into PDI
05:01
Updating and Inserting data into DB from PDI
08:10
Deleting data from SQL DB using PDI
03:46

Conceptual understanding for Loading Data

7 lectures
Facts and Dimensions tables
03:29
Time to check your understanding
1 question
Surrogate Keys in Dimension tables
03:52
Type 1 & 2 Slowly Changing Dimensions
03:17
Time to check your understanding
1 question
Schemas
03:32
Quiz
1 question

Loading the data into a Data Mart

4 lectures
Creating tables in DB
07:37
Loading Customer Data using combination lookup/ update step
13:48
Loading product data using dimension lookup step
12:03
Loading sales data after database lookup steps
09:57

Running Java and Javascript

1 lectures
Scripting Steps
07:15

PDI Jobs

7 lectures
PDI Jobs vs Transformation
02:49
Controlling the flow of execution
05:57
Setting variables using set variables step
07:55
File and Folder Management
07:22
Sending Email Step
10:21
Abort Job Step
03:48
Time to check your understanding
1 question

Scheduling a job for production environment

1 lectures
Running using command prompt and scheduling
06:33

Metadata injection

1 lectures
Metadata injection
13:14

Regex Notation

1 lectures
Regular Expressions for advanced String Matching
15:21

Congratulations and about your certificate

3 lectures
Alternative to Pentaho
00:24
The final milestone!
01:33
Bonus Lecture
00:57

Đánh giá của học viên

Chưa có đánh giá
Course Rating
5
0%
4
0%
3
0%
2
0%
1
0%

Bình luận khách hàng

Viết Bình Luận

Bạn đánh giá khoá học này thế nào?

image

Đăng ký get khoá học Udemy - Unica - Gitiho giá chỉ 50k!

Get khoá học giá rẻ ngay trước khi bị fix.