Mô tả

In this Course you will learn the Fundamentals of XPath, Selenium and the Web Scraping Process. We will cover the Fundamentals and afterwards we are going to scrape Data from real Websites. The first Real Life Project will be the extraction of Data from Yelp and the next Project will cover the scraping process of tables. But before we start with this Real Life Projects, you will get familiar to all the basic knowledge which is required to complete it. Whenever you have a question, don't hesitate to ask in the forum section. Either me or the other students will reply to your question as soon as possbile.

After completing this course you will be confident using Selenium for Web Scraping in your personal Projects. Especially for Data Scientists it is important to be able to extract the data they need to analyze and work with. You will get downloadable files so that you can refer to all topics which we have covered through this course. This course will be updated on a reglular basis. My goal is that all my students understand the Concepts of Selenium, XPath and the whole Web Scraping Process. For this course it's good to know the very basics of Python Programming.


Disclaimer : I teach web scraping as a tutor for educational purposes. That's it.

The first rule of scraping the web is: do not harm a certain website. The second rule of web crawling is: do NOT harm a certain website.

Bạn sẽ học được gì

Web Scraping with Selenium

Most Important Concepts of XPath

Scraping Tables

Data Extraction for Data Science

Combination of Python, Selenium, Pandas

Yêu cầu

  • Basic Understanding of Python

Nội dung khoá học

6 sections

Course Overview and Goals

2 lectures
Course Introduction
04:05
Before you start
00:33

Basics of XPath Expressions

6 lectures
Basics of XPath - Topic Overview
02:05
Importance of XPath
01:58
XPath - Syntax
09:16
Absolute and Relative XPath
07:23
Difference between Single Slash and Double Slash
03:24
Parents and Siblings
11:34

Basics of Selenium for Web Scraping

11 lectures
Set Up
09:10
Overview
04:10
Initialization
09:20
XPath - Locator
11:10
Class Name - Locator
05:02
ID - Locator
06:42
Name - Locator
03:44
(Partial)- Link Text - Locator
07:07
Alternative XPath - Syntax
05:23
Selenium in Action
03:57
(Optional) - Selenium in Headless Mode
06:10

Real Life Project No.1 - Scrape Yelp.com

4 lectures
Intro and Set Up
06:27
Locators for the Data
21:39
Output Data in Dataframes
13:50
Cleaning Data and Save in Excel
07:06

Real Life Project #2 - Scrape Pokemon Table

5 lectures
Set Up and Initialization
07:40
Locators for the necessary Data
09:27
Store and Prepare the Data
14:34
Cleaning the Data
06:05
Save our cleaned Data in the Excel File
02:27

Next Steps

1 lectures
Resources
00:13

Đánh giá của học viên

Chưa có đánh giá
Course Rating
5
0%
4
0%
3
0%
2
0%
1
0%

Bình luận khách hàng

Viết Bình Luận

Bạn đánh giá khoá học này thế nào?

image

Đăng ký get khoá học Udemy - Unica - Gitiho giá chỉ 50k!

Get khoá học giá rẻ ngay trước khi bị fix.