🧠 Second Brain

Search

Search IconIcon to open search

Entity-Centric Data Modeling (ECM)

Last updated Feb 9, 2024

A new term was introduced by Maxime Beauchemin . Entity-centric data modeling (ECM), is a new approach to data modeling for analytics that elevates the core idea of an “entity” (i.e.: user, customer, product, business unit, ad campaign, etc.) at the very top of things.

Essentially, the entity-centric model is to push it further by enriching those dimensions with metrics and data structures, combining Dimensional Modeling with Feature Engineering.

Max recommends Snapshotting over SCD2 (for several reasons, see RW Introducing Entity-Centric Data Modeling for Analytics), but mainly for simplicity and for the tradeoff of more storage of dimensions which are usually small. But with ECM, he wants to add metrics directly to the dimensions, e.g. customers with weekly_active_users can be queried with visits.7d >= 3. These columns like visits.7d are getting updated on each snapshot, making it easy to query.

It’s not a Semantic Layer, as Max wrote about:

The Case for Dataset-Centric visualization”, the idea being that visualizations/BI tools work best with simple tabular datasets as opposed to complex semantic layers or sets of more normalized datasets.

# Highlights from the Article

Key Points from RW Introducing Entity-Centric Data Modeling for Analytics by Max:


Origin: Introducing Entity-Centric Data Modeling for Analytics
References:
Created 2023-04-17