Intrinsic indicators for numerical data quality

  • Milen S. Marev
  • , Ernesto Compatangelo
  • , Wamberto W. Vasconcelos

Research output: Chapter in Book/Report/Conference proceedingPublished conference contribution

2 Citations (Scopus)

Abstract

This paper focuses on data quality indicators conceived to measure the quality of numerical datasets. We have devised a set of three different indicators, namely Intrinsic Quality, Distance-based Quality Factor and Information Entropy. The results of quality measures based on these indicators can be used in further data processing, helping to support actual data quality improvements. We argue that the proposed indicators can adequately capture in a quantitative way the impact of different numerical data quality issues including (but not limited to) gaps, noise or outliers.

Original languageEnglish
Title of host publicationIoTBDS 2020 - Proceedings of the 5th International Conference on Internet of Things, Big Data and Security
EditorsGary Wills, Peter Kacsuk, Victor Chang
PublisherSciTePress
Pages341-348
Number of pages8
ISBN (Electronic)9789897584268
DOIs
Publication statusPublished - 2020
Event5th International Conference on Internet of Things, Big Data and Security, IoTBDS 2020 - Virtual, Online
Duration: 7 May 20209 May 2020

Conference

Conference5th International Conference on Internet of Things, Big Data and Security, IoTBDS 2020
CityVirtual, Online
Period7/05/209/05/20

Funding

This research is funded by EPSRC Doctoral Training Partnership 2016-2017 University of Aberdeen with award number: EP/N509814/1

Keywords

  • Data Quality
  • Data Quality Indicators
  • Intrinsic Data Quality
  • Numerical Data Quality
  • Pre-processing

Fingerprint

Dive into the research topics of 'Intrinsic indicators for numerical data quality'. Together they form a unique fingerprint.

Cite this