Member-only story
What is Dark Data?
How it occurs and how to make it usable
Today in the era of Big Data, organizations are constantly collecting masses of data. But often the collected data is stored without being analyzed. This data, which exists but is not used is referred to as Dark Data.
What is Dark Data? — A Definition
Dark Data is data that is not accessible to an organization. This can be data that is incomplete, has not been evaluated, exists in secret or has not been recorded at all. Essential to our understanding of the term is that it is relative. Dark Data is particularly evident in the context of Big Data fields like IoT, social media and so. Often, so much data is continuously generated that it cannot be processed and analyzed in a timely manner [1][2].
Reasons for Dark Data
There are various reasons for the emergence of Dark Data or the decision to allow Dark Data. These can be, for example [1][2]:
Legal Reasons and/or Archiving
Maybe you (have to) back up data or/and archive it without taking into account how often it is used for example due to…