Member-only story

What is Dark Data?

How it occurs and how to make it usable

Christianlauer
3 min readJul 20, 2022
Photo by Breno Machado on Unsplash

Today in the era of Big Data, organizations are constantly collecting masses of data. But often the collected data is stored without being analyzed. This data, which exists but is not used is referred to as Dark Data.

What is Dark Data? — A Definition

Dark Data is data that is not accessible to an organization. This can be data that is incomplete, has not been evaluated, exists in secret or has not been recorded at all. Essential to our understanding of the term is that it is relative. Dark Data is particularly evident in the context of Big Data fields like IoT, social media and so. Often, so much data is continuously generated that it cannot be processed and analyzed in a timely manner [1][2].

Dark Data illustration — Source: Devopedia

Reasons for Dark Data

There are various reasons for the emergence of Dark Data or the decision to allow Dark Data. These can be, for example [1][2]:

Legal Reasons and/or Archiving

Maybe you (have to) back up data or/and archive it without taking into account how often it is used for example due to…

--

--

Christianlauer
Christianlauer

Written by Christianlauer

Big Data Enthusiast based in Hamburg and Kiel. Thankful if you would support my writing via: https://christianlauer90.medium.com/membership

No responses yet