Search

Search IconIcon to open search

Data Engineering Vault


Welcome to the Data Engineering Vault, an integral part of my larger Second Brain. This curated network of data engineering knowledge is designed to facilitate exploration, discovery, and deep learning in the field of data engineering. Here, you'll find a rich ecosystem of 1000+ interconnected terms and concepts, each serving as a gateway to deeper insights. Functioning like a Digital Garden for data engineering, this network allows you to organically explore and connect ideas.

Key Topics & Concepts

As you navigate through the concepts, you'll uncover hidden relationships, expanding your understanding and providing a unique, immersive learning experience whether you're a seasoned data engineer or just starting your journey.

Data engineering is a term that has shifted over the years from a Database Admins (DBA), ETL Developer, and Business Intelligence Specialist and merged with Software Engineers to a Data Engineer with the growth of data made his title.

It’s still not well defined, the latest book on Fundamentals of Data Engineering (Joe Reis, Matt Housley) tries and does probably best as of today; it’s getting clearer. Besides several boot camps, universities are also starting to get a degree in data engineering like Data Science did before. Let’s start by defining what data engineering is.

# What is Data Engineering

Data engineering is the less famous sibling of data science. Data science is growing like no tomorrow, as is data engineering, but it is much less heard of. Compared to existing roles, it would be a software engineering plus business intelligence engineer, including big data capabilities across the Hadoop ecosystem, streaming, and computation at scale.

Find the full story at What is Data Engineering.

# Evolution of Data Engineering

The History of Data Engineering is long and here are some articles that explain the evolution we went through:

# Getting Started with Data Engineering

Additional resources that can further enhance your understanding of data engineering. Whether you’re just starting out or looking to deepen your expertise, these resources are handpicked for their clarity, depth, and practical insights.

Essential Toolkit

Start with the Data Engineering Toolkit - a comprehensive guide to 70+ technologies and tools that form the foundation of modern data engineering practice.

# Must-Read Articles

Begin your journey with the “holy trinity” from Maxime Beauchemin, defining the essence of data engineering:

# Community and Learning

Don’t miss out on these foundational reads and thought leaders in the field:

Feel free to explore, learn, and contribute to this ever-growing field. Your journey in data engineering is just beginning.


Origin: Data Engineering, the future of Data Warehousing?

🔶 This note was on Hacker News. Read the comments