Mô tả

Why should I learn DuckDB?


  1. + 1200% of searches in the last 2 years
    Its popularity is growing RAPIDLY!


  2. Data lakes and bulky Big Data Infrastructure (like Apache Hadoop & Spark) are not optimal solution to every Data problem

    DuckDB is an awesome solution for running a database very similar to PostgreSQL, but with HUGE Analytical Capabilities, locally without any fuss


  3. 100% free & supports dozens of various integrations

    duckdb Python, duckdb dbt, duckdb Streamlit, duckdb s3 & wasm & Docker + many more: you can almost anything with it. Additionally, you can easily do data exports: duckdb csv, duckdb parquet, duckdb json are all ways to share your analysis results in no time! Python integration is as easy as doing "pip install duckdb" & you're ready to go! We will dive deep into duckdb Python integration in one of the cases.


  4. Ease of use
    Rather than having a PostgreSQL/Mariadb for each developer on the team, you can setup configuration to spawn an in memory instance of DuckDB. If you need to fetch data from the Internet, it's no problem either: Duckdb Httpfs is a package that we'll also study.

  5. Local Analysis of BigData
    If you want to run a columnar database locally on pretty big data, there isn't really anything else like it. You could instead run PySpark locally but that would be much more of a headache. Duckdb Pivot can even help you create Spreadsheet-like tables.

  6. Easy to learn after SQLite
    It's a step forward to Analytics field from SQLite. DuckDB performs great when running aggregate queries on limited columns whereas SQLite works great when fetching one or more rows using filters. In the Course we will compare and contrast duckdb vs Sqlite and duckdb vs Clickhouse.

  7. 300%+ faster than Pandas
    Pandas loads all data into memory and runs on a single thread. Hence it can't operate on larger than memory datasets and also doesn't use all of your CPU  cores. Whereas DuckDB can operate on datasets larger than memory. Moreover, it can distribute load across all the CPU cores. All that using SQL language by default!


This Course is not just a duckdb tutorial: it's a packaged solution to master this new & rapidly growing technology.


Expected Outcomes

After this Course:

  • You will learn how to Architect & Implement Analytics Solutions that use duck db as the database

  • You will learn the underlying principles that make DuckDB so fast on any machine (Theory)

  • You will understand how DuckDB is different from other data bases: both Analytical (Clickhouse, Redshift, Cassandra) and OLTP (PostgreSQL, SQLite)

  • You will learn to work with DuckDB from Python environment (Practice)

  • You will learn to work with DuckDB from CLI (command line) environment (Practice)

  • Use DuckDB as a backend database for your Streamlit Python Analytics Apps (Practice)

  • Use a DuckDB dbt (Data Build Tool) combo to streamline Analytics Data Warehouse development (Practice)

  • You will learn to work in MotherDuck: a Cloud-native environment (SaaS) for duck db (Practice). You can think of it as DuckDB GUI that you might miss in CLI

  • Learn to interact with DuckDB inside Docker environment

  • Understand how DuckDB fits into Micro-service architecture of Analytical services

  • Use Rill: a DuckDB-powered BI-as-Code "last-mile ETL" platform for blazing fast Dashboarding


What's inside


  • Video lectures (with interactive annotations)

  • PDFs with Practice Cases Outlines

  • Demo Resources

  • Fully packaged code base for Practice Projects

  • Full lifetime access with all future updates

  • Certificate of course completion

  • 30-Day Money-Back Guarantee

The course isn't static! I collect students' feedback and work on improving it


[Course Updates]:

01.2024: + Bonus Section: Let's build a DuckDB-powered Recommender Micro-service

02.2024: + "Rill Data" Section: DuckDB-powered BI-as-Code "last-mile ETL" platform


Digital assets used:

-Image from freepik with free licence from freepik dot com "Free vector gradient dynamic blue lines background"

Bạn sẽ học được gì

Architect & Implement Analytics Solutions that use DuckDB as the database

You will learn the underlying principles that make DuckDB so fast on any machine (Theory)

You will learn to work with DuckDB from Python environment (Practice)

You will learn to work with DuckDB from CLI (command line) environment (Practice)

Use DuckDB as a backend database for your Streamlit Python Analytics Apps (Practice)

Combine DuckDB with dbt (Data Build Tool) to streamline Analytics Data Warehouse development (Practice)

You will learn to work in MotherDuck: a Cloud-native environment (SaaS) for DuckDB (Practice)

You will understand how DuckDB is different from other data bases: both Analytical (Clickhouse, Redshift, Cassandra) and OLTP (PostgreSQL, SQLITE)

Yêu cầu

  • Basic SQL is helpful but not necessary (we'll use guides provided)
  • Basic Python
  • Laptop or PC

Nội dung khoá học

10 sections

Course Introduction

3 lectures
Welcome!
02:18
What will You Learn in this Course?
02:39
What is DuckDB & Why is it SO COOL?
02:39

DuckDB Introduction

6 lectures
What is DuckDB? (detailed)
06:12
Why use DuckBD?
06:07
What role does DuckDB play in modern Analytics World?
04:15
DuckDB's competition & market niche
06:57
When should you use DuckDB? (typical use cases)
06:26
Who Should Use DuckDB?
02:41

Environment Setup & Demo

4 lectures
DuckDB Installation
07:19
Environment configuration
07:56
Getting started with DuckDB's SQL
05:48
Outputting SQL's results into files
08:33

CLI usage: DuckDB's Innovations in SQL

7 lectures
Practice Case Description
03:02
Importing Data
03:15
DuckDB SQL Innovations: SUMMARIZE & REPLACE
05:10
DuckDB SQL Innovations: EXCLUDE & COLUMNS & GROUP BY ALL
05:26
Window Functions: the DuckDB way
04:07
PIVOTing in DuckDB
03:13
TABLE Functions in DuckDB
01:54

Duckdb Python

5 lectures
Practice Case Description
01:09
Downloading Data
03:31
Duckdb and Python: Analytics workflow - part1
06:34
Duckdb and Python: Analytics workflow - part2
05:01
Duckdb and Python: Analytics workflow - part3
04:21

Streamlit + Duckdb

5 lectures
Streamlit Introduction
01:28
Practice Case Description
04:39
Fetching Data - part1
02:32
Fetching Data - part2
04:59
Launching the App
05:38

Duckdb + DBT

9 lectures
Data Build Tool (dbt) Introduction
01:44
Practice Case Description
02:45
Data Walkthrough
03:26
Fetching Data - part1
07:21
Fetching Data - part2
02:32
Running dbt Pipeline
07:17
DBeaver: Amazing Database Management Tool
02:46
DuckDB Backward Compatibility Issue: SOLVED
06:52
Exploring End Result: duckdb DataWarehouse
02:59

MotherDuck: Cloud offering of DuckDB as a SaaS

5 lectures
What is MotherDuck?
00:56
MotherDuck's Features
07:43
Attaching a Remote Database
05:54
Detaching a Remote Database
01:53
Automating Authentication to MotherDuck Platform
03:47

Rill: a DuckDB-powered BI-as-Code platform

14 lectures
Rill Intro
01:01
Case End Product DEMO
03:03
What is Rill?
03:54
Case Data
04:36
Data Sources
05:06
Data Models
03:43
Dashboard Outlining
02:41
UI: Part 1
05:01
UI: Part 2
04:33
Setting up a Github repo
05:05
Connecting Rill Cloud to Github
02:36
Sharing access to Dashboard
04:02
Scheduling Data Refresh
05:18
Deleting Rill Project
01:31

Bonus: Spinning up a DuckDB-powered Recommender Microservice

14 lectures
Case Intro
01:28
Business Case Architecture
06:01
Disclaimer: Fast Forward if needed
01:09
Movies Data base API
04:24
Dockerfile: Packaging the Project
03:46
Managing Python Dependencies with Poetry
03:41
Fetching Data from API
05:53
Understanding Data
03:50
Recommender System Codebase
05:59
FastAPI Microservice
02:15
Building Docker Image
05:54
Exploring Data inside DuckDB
06:24
Getting Recommendations
05:21
Wrapping Up the Case
01:11

Đánh giá của học viên

Chưa có đánh giá
Course Rating
5
0%
4
0%
3
0%
2
0%
1
0%

Bình luận khách hàng

Viết Bình Luận

Bạn đánh giá khoá học này thế nào?

image

Đăng ký get khoá học Udemy - Unica - Gitiho giá chỉ 50k!

Get khoá học giá rẻ ngay trước khi bị fix.