🧠Second Brain
Search
People of Data Engineering
A list of the people of Data Engineering (sorted alphabetically). This list is not meant to be 100% complete, and descriptions are minimal. It’s a personal list that I am sharing with you.
- Adi Polak: Author of Scaling Machine Learning with Spark
- Alexey Grigorev: Blog, Zoomcamps and GitHub tutorials
- Ananth Packkildurai: Functional Data Engineering, Creator of dataengineeringweekly.com.
- Andreas Kretz: Creator of The Data Engineering Cookbook
- Andy Grove: Apache Arrow PMC. Creator of DataFusion & Ballista (Arrow) query engines)
- Andy Petrella: Writing on Data Observability
- Barr Moses: Great articles on Data Observability.
- Bartosz Konieczny: Author of Wait for Code, a course Becoming a Data Engineer, and a book called Data Engineering Patterns on the Cloud
- Ben Rogojan: Seattle Data Guy with a popular and initial YouTube Channel.
- Benn Stancil: Prolific writer on his blogs, and they usually start with it Friday, let’s fight…
- Cássio Bolba: Amazing Modeling Blogs on Medium
- Chad Sanderson: Data contracts, and captivating articles
- Chris Riccomini: Essays on tech, data, and streaming
- Christophe Blefari: A combination of aggregate newsletters and one-off articles on data engineering
- Daniel Beach: Broad range of data engineering topics
- Darshil Parmar: Popular youtube
- Denny Lee: Delta Lake, Rust, OSS
- Dipankar Mazumdar: Specialized in Open Table Formats and lakehouses.
- Ergest Xheblati: Explores data patterns in data engineering, data science and analytics
- Erik Bernhardsson: Building a simple version of Kubernetes Modal
- Itai Yaffe: Druid expert, working with Apache Spark, Databricks, Women in Big Data, and is also a frequent public speaker.
- Jacek Laskowski: ApacheSpark DeltaLake Databricks ApacheKafka KafkaStreams ksqlDB
- James Trunk: Clojure and Functional Programming.
- Jérémy Ravenel: Naas, Jupyter Notebooks into powerful automation, analytical, and AI
- Joe Reis: Co-Creator of Fundamentals of data engineering
- Jonathan Neo: Creator of Data Engineering Bootcamp
- Jorrit Sandbrink: Strong understanding of the field and deep dive articles on LinkedIn.
- Joseph Machado: Lots of great how-tos and projects on Start Data Engineering
- Long Bui: Great Data Engineering Handbook (DEH)
- Kyle Weller: Specialist in Apache Hudi and comparisons among different Data Lake Table Formats
- Marc Lamberti: Airflow
- Martin Kleppmann: Author of Data-Intensive Apps, researcher TU Munich.
- Matei Zaharia: Chief Technologist at Databricks
- Matt Housley: Co-Creator of Fundamentals of data engineering
- Matt Turck: Creator of MAD landscape
- Matt Weingarten: Data Engineer at Disney Streaming Services. Previously at Facebook and Nielsen.
- Maxime Beauchemin: Father of Data Engineering and Functional Data Engineering.
- Mehdi Ouazza: Awesome written content, also now on YouTube. Creator of Data Creators Club.
- Michael Armbrust: Creator of Delta Lake, distributed databases, query languages, scala, and other nerdy stuff…
- Michael Kahan: Popular YouTube and Content on DE
- Mike Driscoll: Founded Rill Data, Metamarkets and CustomInk.com. Founding partner at DCVC.
- Nick Schrock: Dagster, data orchestration
- Peter Marshall: Druid Advocate
- Petr Janda: Awesome blogs on petr@substack now working on Synq
- Robert Sahlin: Data Platform with Google Cloud
- Ryan Yackel: MAD Data Podcast and genuine news of data engineering.
- Sandy Ryza: Dagster and passionate about Partitioning and Backfill.
- Sarah Krasnik: Great for infra and solutions insights
- Shane Gibson: Data modeling, in data for 30 years. Not technical, but about agile and data modeling.
- Simon Whiteley: Databricks, Data Engineering), popular YouTube
- Stephen Bailey: Exploring the world of data and its adjacencies at Data People Etc.
- Thalia Barrera: Excellent post on date engineering
- ThePrimeagen: Rust, Netflix, programming, Neovim
- Tobias Macey: Data engineering podcast
- Vu Trinh: Author of VuTrinh Substack with deep dives on common DE tools
- Wayne Eckerson: Author, keynote speaker, and consultant Eckerson Group
- Wes McKinney: Pandas / Arrow
- Xinran Waibel: Personalization Data Engineering at Netflix
- Zach Wilson: Data Engineering Challenges at Hyperscale
- and Simon Späti (myself :): Lots of open-source data engineering
Find more on:
- Data Creators Club
- The Top Data Engineering Influencers on LinkedIn
- RSS feeds for Data Engineering
- Open-Source Data Engineering Projects
- Books of Data Engineering
- and Data Engineering Twitter Lists
Origin:
References: Discuss on socials:
Twitter,
LinkedIn, or
Reddit
Created 2023-04-17