Mô tả

In this course you will build MULTIPLE practical systems using natural language processing, or NLP - the branch of machine learning and data science that deals with text and speech. This course is not part of my deep learning series, so it doesn't contain any hard math - just straight up coding in Python. All the materials for this course are FREE.

After a brief discussion about what NLP is and what it can do, we will begin building very useful stuff. The first thing we'll build is a cipher decryption algorithm. These have applications in warfare and espionage. We will learn how to build and apply several useful NLP tools in this section, namely, character-level language models (using the Markov principle), and genetic algorithms.

The second project, where we begin to use more traditional "machine learning", is to build a spam detector. You likely get very little spam these days, compared to say, the early 2000s, because of systems like these.

Next we'll build a model for sentiment analysis in Python. This is something that allows us to assign a score to a block of text that tells us how positive or negative it is. People have used sentiment analysis on Twitter to predict the stock market.

We'll go over some practical tools and techniques like the NLTK (natural language toolkit) library and latent semantic analysis or LSA.

Finally, we end the course by building an article spinner. This is a very hard problem and even the most popular products out there these days don't get it right. These lectures are designed to just get you started and to give you ideas for how you might improve on them yourself. Once mastered, you can use it as an SEO, or search engine optimization tool. Internet marketers everywhere will love you if you can do this for them!

This course focuses on "how to build and understand", not just "how to use". Anyone can learn to use an API in 15 minutes after reading some documentation. It's not about "remembering facts", it's about "seeing for yourself" via experimentation. It will teach you how to visualize what's happening in the model internally. If you want more than just a superficial look at machine learning models, this course is for you.

"If you can't implement it, you don't understand it"

  • Or as the great physicist Richard Feynman said: "What I cannot create, I do not understand".

  • My courses are the ONLY courses where you will learn how to implement machine learning algorithms from scratch

  • Other courses will teach you how to plug in your data into a library, but do you really need help with 3 lines of code?

  • After doing the same thing with 10 datasets, you realize you didn't learn 10 things. You learned 1 thing, and just repeated the same 3 lines of code 10 times...


Bạn sẽ học được gì

Write your own cipher decryption algorithm using genetic algorithms and language modeling with Markov models

Write your own spam detection code in Python

Write your own sentiment analysis code in Python

Perform latent semantic analysis or latent semantic indexing in Python

Have an idea of how to write your own article spinner in Python

Yêu cầu

  • Install Python, it's free!
  • You should be at least somewhat comfortable writing Python code
  • Know how to install numerical libraries for Python such as Numpy, Scipy, Scikit-learn, Matplotlib, and BeautifulSoup
  • Take my free Numpy prerequisites course (it's FREE, no excuses!) to learn about Numpy, Matplotlib, Pandas, and Scikit-Learn, as well as Machine Learning basics
  • Optional: If you want to understand the math parts, linear algebra and probability are helpful

Nội dung khoá học

15 sections

Natural Language Processing - What is it used for?

3 lectures
Introduction and Outline
07:48
Why Learn NLP?
05:59
The Central Message of this Course (Big Picture Perspective)
08:12

Course Preparation

3 lectures
How to Succeed in this Course
03:04
Where to get the code and data
12:03
How to Open Files for Windows Users
02:18

Machine Learning Basics Review

11 lectures
Machine Learning: Section Introduction
16:07
What is Classification?
12:22
Classification in Code
14:38
What is Regression?
12:13
Regression in Code
08:29
What is a Feature Vector?
06:48
Machine Learning is Nothing but Geometry
04:50
All Data is the Same
05:23
Comparing Different Machine Learning Models
09:46
Machine Learning and Deep Learning: Future Topics
05:55
Section Summary
05:47

Markov Models

13 lectures
Markov Models Section Introduction
02:42
The Markov Property
07:34
The Markov Model
12:30
Probability Smoothing and Log-Probabilities
07:50
Building a Text Classifier (Theory)
07:29
Building a Text Classifier (Exercise Prompt)
06:33
Building a Text Classifier (Code pt 1)
10:32
Building a Text Classifier (Code pt 2)
12:06
Language Model (Theory)
10:15
Language Model (Exercise Prompt)
06:52
Language Model (Code pt 1)
10:45
Language Model (Code pt 2)
09:25
Markov Models Section Summary
03:00

Decrypting Ciphers

12 lectures
Section Introduction
07:11
Ciphers
03:59
Language Models
16:06
Genetic Algorithms
21:23
Code Preparation
04:46
Code pt 1
03:06
Code pt 2
07:20
Code pt 3
04:52
Code pt 4
04:03
Code pt 5
07:12
Code pt 6
05:25
Section Conclusion
06:00

Build your own spam detector

11 lectures
Build your own spam detector - description of data
02:08
Build your own spam detector using Naive Bayes and AdaBoost - the code
05:14
Key Takeaway from Spam Detection Exercise
05:56
Naive Bayes Concepts
09:56
AdaBoost Concepts
05:11
Other types of features
01:30
Spam Detection FAQ (Remedial #1)
08:45
What is a Vector? (Remedial #2)
06:04
SMS Spam Example
06:23
SMS Spam in Code
10:17
Suggestion Box
03:10

Build your own sentiment analyzer

7 lectures
Description of Sentiment Analyzer
03:12
Logistic Regression Review
07:32
Preprocessing: Tokenization
04:48
Preprocessing: Tokens to Vectors
06:20
Sentiment Analysis in Python using Logistic Regression
19:48
Sentiment Analysis Extension
06:01
How to Improve Sentiment Analysis & FAQ
12:19

NLTK Exploration

4 lectures
NLTK Exploration: POS Tagging
02:00
NLTK Exploration: Stemming and Lemmatization
02:06
NLTK Exploration: Named Entity Recognition
03:13
Want more NLTK?
01:59

Latent Semantic Analysis

5 lectures
Latent Semantic Analysis - What does it do?
02:30
SVD - The underlying math behind LSA
15:49
Latent Semantic Analysis in Python
10:08
What is Latent Semantic Analysis Used For?
09:40
Extending LSA
06:16

Write your own article spinner

6 lectures
Article Spinning Introduction and Markov Models
02:43
Trigram Model
02:11
More about Language Models
09:53
Precode Exercises
05:05
Writing an article spinner in Python
11:33
Article Spinner Extension Exercises
05:42

How to learn more about NLP

1 lectures
What we didn't talk about
02:45

Setting Up Your Environment (FAQ by Student Request)

3 lectures
Pre-Installation Check
04:12
Anaconda Environment Setup
20:20
How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow
17:32

Extra Help With Python Coding for Beginners (FAQ by Student Request)

4 lectures
How to Code by Yourself (part 1)
15:54
How to Code by Yourself (part 2)
09:23
Proof that using Jupyter Notebook is the same as not using it
12:29
Python 2 vs Python 3
04:38

Effective Learning Strategies for Machine Learning (FAQ by Student Request)

4 lectures
How to Succeed in this Course (Long Version)
10:24
Is this for Beginners or Experts? Academic or Practical? Fast or slow-paced?
22:04
Machine Learning and AI Prerequisite Roadmap (pt 1)
11:18
Machine Learning and AI Prerequisite Roadmap (pt 2)
16:07

Appendix / FAQ Finale

2 lectures
What is the Appendix?
02:48
BONUS
05:48

Đánh giá của học viên

Chưa có đánh giá
Course Rating
5
0%
4
0%
3
0%
2
0%
1
0%

Bình luận khách hàng

Viết Bình Luận

Bạn đánh giá khoá học này thế nào?

image

Đăng ký get khoá học Udemy - Unica - Gitiho giá chỉ 50k!

Get khoá học giá rẻ ngay trước khi bị fix.