Mô tả

Have you ever thought about how important is data in our everyday life?

It doesn't matter if you're a data scientist, data analyst, or programmer, getting access to the data you need is important! This is why some of the top Fortune 500 companies such as Walmart, Target, and CNN use web scraping to stay ahead with data. We can say this is one of their best secret weapons.

Welcome to Web Scraping in Python with BeautiuflSoup, Selenium, and Scrapy!

The most complete and project-oriented course. In this course, we're going to learn web scraping solving cool projects that will help us scrape data off some of the most well-known websites. To name a few:


  • Amazon Audible

  • Twitter

  • Google

  • Football Stats

  • Worldometers

  • Movie pages


This is the most complete web scraping course in Python and is split into 3 parts: BeautiuflSoup, Selenium, and Scrapy. My goal is that you become an expert in web scraping. This is why only 10% of the content of this course is focused on Beautiful Soup, while 40% is dedicated to Selenium and 50% to Scrapy.


Why I built the course this way?

Beautiful Soup has a lot of limitations, but it's still useful for a gentle introduction to web scraping. In contrast, if you learn Selenium you will be able to scrape any website. and if you add Scrapy to your toolkit you will be able to scrape any website faster than with any other web scraping library. Throughout this course, you will see the power of web scraping with each Python library.


What makes this course different from the others, and why you should enroll?

  • This is the most updated and complete web scraping course in Python. You will be using Python 3.7, Scrapy 2.4, Selenium 3.141.0 and BS4

  • This is the most project-based course you will find. We will scrape many of the most well-known websites

  • You will have an in-depth step by step guide on how to become a professional web scraper.

  • You will learn how to use Selenium, Scrapy, and Splash to scrape JavaScript-driven websites. You won't find any tutorials out there that teach how to use these web scraping tools the way I do in this course.

  • You will learn how to use ChatGPT for web scraping

  • 30 days money back guarantee by Udemy


At the end of the course, you'll know how to build your own web scraper to obtain new data, learn the best techniques for web scraping and more.


Join me now and let's start scraping the web together!

Bạn sẽ học được gì

Yêu cầu

Nội dung khoá học

17 sections

Introduction

7 lectures
Update 2024: Read this before you start
00:28
What is the best web scraping library? BS4 vs Selenium vs Scrapy
03:30
Installing Python and PyCharm
04:57
Python Basics for Web Scraping
30:31
Web Scraping Cheat Sheet for The Course (Don't skip it)
00:16
HTML for Web Scraping: Tags and Elements
07:49
HTML for Web Scraping: Tree Structure
03:52

Introduction to Beautiful Soup

6 lectures
Important: Versions of libraries that work for this section
00:31
Installing Beautiful Soup
03:13
Beautiful Soup Basics
04:01
How To Get The HTML from a Website
04:41
How to Scrape a Single Page
10:34
Exporting Data to a TXT File
03:20

Project #1: Scraping Multiple Pages with Beautiful Soup

3 lectures
Update Scripts 2024
00:35
Scraping Multiple Links within The Same Page
16:28
Pagination
18:27

XPath

4 lectures
Important Note (Don't skip it)
00:11
XPath Syntax, Functions and Operators
03:41
Test Your XPath
05:29
Special Characters and Syntax
04:37

Introduction to Selenium

9 lectures
Important: Versions of libraries that work for this section
00:42
How to Identify Java Script Driven Websites
03:25
Installing Selenium and ChromeDriver
02:19
Importing Libraries and Creating The Driver
03:37
How to Find Elements with Selenium (Theory)
03:41
Clicking on a Button
06:36
Extracting Data from a Table
12:26
Exporting Data to a CSV File with Pandas
05:15
Selecting Elements within a Dropdown
07:41

Project #1: Dealing with Multiple Pages using Selenium (Amazon Audible Bot)

5 lectures
Update Scripts 2024
00:34
Building the bot
20:42
Headless Mode
05:04
Handling Pagination
17:05
Implicit vs Explicit Waits
09:03

Introduction to Scrapy

12 lectures
Important: Versions of libraries that work for this section
00:22
Scrapy Installation
02:23
Setting up Scrapy with Pycharm
02:40
Note: How to Show Hidden Folders on Mac
01:07
Scrapy's Commands
04:13
Creating our First Project and Spider
06:00
Scrapy Templates & How to Find Elements with Scrapy
03:21
Scrapy's Shell Command
04:08
Building a Spider
02:59
Getting links listed in a website
06:19
Relative vs Absolute Links
05:10
Scraping Data from Multiple Links
10:53

Dealing with Multiple Pages using Scrapy

4 lectures
Update Scripts 2024
00:10
Building The Spider
14:24
Pagination
06:22
Changing User-Agent
07:03

Project #3: Building Crawlers with Scrapy

4 lectures
Update Scripts 2024
00:17
Crawler - Rules
11:25
Crawler - Pagination
05:11
Crawler - Changing User-Agent
04:30

Exporting Data to a Database: MongoDB and SQLite

4 lectures
Update Scripts 2024 (and MongoDB)
00:21
Pipelines
06:00
MongoDB
09:29
SQLite3
09:37

Scraping APIs & Login to website with Scrapy

3 lectures
Scraping APIs - Building the Spider
11:34
Scraping APIs - Dealing with Pagination
05:31
Login to website - Building the Spider
10:04

Splash Basics

4 lectures
Section Overview
00:14
Setting up Splash with Docker
04:30
Introduction to Splash
06:02
How to Find Elements with Splash
05:10

Project #4: Scraping JavaScript websites using Scrapy/Splash

4 lectures
Building the Script in Splash
08:15
Using Splash with Scrapy
06:45
Parsing
05:45
Changing User-Agent
04:28

How to Make Money Using Web Scraping

4 lectures
Section Overview
00:18
How I Monetized My Web Scraping Skills Over The Past 2 Years
14:45
Getting Started - Part 1
13:20
Getting Started - Part 2
20:17

Prize: ChatGPT for Web Scraping

7 lectures
Section Overview
00:14
Web Scraping with ChatGPT - Part 1
11:48
Web Scraping with ChatGPT - Part 2
14:57
Web Scraping with ChatGPT Plugin
09:21
Web Scraping with ChatGPT Code Interpreter
12:30
How to create a GPT that scrapes websites
09:59
Congratulations!
00:18

Appendix: Infinite Scrolling & Login (Twitter Bot)

5 lectures
Important Note
00:48
How to Login to Websites
11:51
Building a Function
22:17
Infinite Scrolling
08:33
Scraping the Website with Infinite Scrolling
12:28

Bonus: Python for Data Science Bootcamp 2022: From Zero to Hero

10 lectures
Section Overview (+New Cheat Sheet)
00:26
Dataset Overview and Making Pivot Table
09:35
Lineplot
04:25
Barplot
07:45
Piechart
03:41
Boxplot
03:34
Histogram
01:29
Scatterplot
04:44
Save Plot and Export Pivot Table
03:06
Interactive Visualization with Pandas
27:12

Đánh giá của học viên

Chưa có đánh giá
Course Rating
5
0%
4
0%
3
0%
2
0%
1
0%

Bình luận khách hàng

Viết Bình Luận

Bạn đánh giá khoá học này thế nào?

image

Đăng ký get khoá học Udemy - Unica - Gitiho giá chỉ 50k!

Get khoá học giá rẻ ngay trước khi bị fix.