What is the Medallion Data Lakehouse Architecture all about?

Design Principle for designing a Modern Data Platform

Christianlauer
CodeX
Published in
3 min readOct 1, 2022

--

Photo by Fineas Anton on Unsplash

When you build a Data Lakehouse, it is not that different from the architecture of a classical Data Warehouse, where the Medallion approach has established itself.

Data Lakehouse Recap

Building up a Data Lakehouse is not just about integrating a Data Lake with a Data Warehouse, but rather integrating a Data Lake, a Data Warehouse, and purpose-built storage to enable unified governance and ease of data movement [1]. From my own experience, Data Lakes can be realized much faster. Once all data is available, Data Warehouses can still be built on top of it as a hybrid solution.

Hybrid Data Lake Concept — Image from Author

The Medallion Approach

The Medallion approach does not question this principle but describes the underlying level of data management. This architecture guarantees indivisibility, consistency, isolation, and permanence as data passes through multiple levels of validation and transformation before being stored in a layout optimized for efficient analysis. The terms bronze (raw), silver…

--

--

Christianlauer
CodeX
Editor for

Big Data Enthusiast based in Hamburg and Kiel. Thankful if you would support my writing via: https://christianlauer90.medium.com/membership