The Data Engineering Cookbook

数据工程手册。「The Data Engineering Cookbook」

Github星跟蹤圖

The Data Engineering Cookbook

I get asked super often how to become a Data Engineer.
That's why I decided to start this cookbook with all the topics you need to look into.

It's not only useful for beginners, professionals will definitely like the case study section.

Here's the download shortcut:
Data Engineering Cookbook PDF

How to use the cookbook

I split this cookbook into five parts

  • Part one is the introduction to the book
  • In part two you will learn the basic data engineering skills
  • Part three contains a real world data engineering example we currently work on
  • The fourth part contains over 30 case studies with links from companies like Netflix, Twitter, Spotify
  • Part five is a collection of one thousand and one interview questions (currently approx. 150)

How to contribute

If you have some cool links or topics for the cookbook, please become a contributor.
Simply open an issue and add your links. Or pull the repo, add them and create a pull request.

Please pull only the "working-branch" branch.
This way we keep the master branch clean and I don't have to mess around resolving conflicts. You just need to change the .tex file. I'll recompile it later when I merge the branch with the master

For comments please also use the "Issues" function.

Support

Everything is free, but please support what you like!
Join my Patreon and become a plumber yourself:
Link to my Patreon

Subscribe to my Plumbers of data science YouTube channel:
Link to YouTube

Check out my personal blog. Get updated via mail and get on my mailing list:
andreaskretz.com

I have a Medium publication where you can publish your data engineer articles:
Medium publication

主要指標

概覽
名稱與所有者andkret/Cookbook
主編程語言Python
編程語言TeX (語言數: 1)
平台
許可證Apache License 2.0
所有者活动
創建於2019-03-10 22:20:03
推送於2025-03-25 15:06:11
最后一次提交2025-03-25 16:05:03
發布數0
用户参与
星數14.3k
關注者數546
派生數2.6k
提交數398
已啟用問題?
問題數135
打開的問題數114
拉請求數72
打開的拉請求數8
關閉的拉請求數13
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?