de

Welcome Gast


  • Login
Full load

Quality of Uncertain Data
Abstract
Many real-life applications, for example data integration, data extraction, risk-management or sensor systems, naturally produce uncertain data. One of the most important goals in these applications is to produce data of high quality. This leads to the following open questions:
  • What does high quality exactly mean with respect to uncertainty and impreciseness?
  • What metrics are most qualified for quantifying the quality with respect to these means?
Currently, the most quality metrics have been defined for appropriately scoring the fineness of certain data and hence only insufficiently capture what is intuitively the quality of uncertain data. As we think, for adequately scoring quality of uncertain data new metrics for existing quality criteria as well as new quality criteria themselves are required. Moreover, as one of the most important methods for improving quality, we consider the integration of uncertain data. In this context, we focus on three elementary questions:
  • How to efficiently and effectively detect duplicates, if data are not only unclean but also imprecise and uncertain?
  • How to combine the uncertain information given by multiple duplicates so that a tuple of higher quality results?
  • How can the expressive modeling power of uncertain data models be used to capture uncertainty coming up during the integration process?
The QloUD project aims to develop techniques for properly scoring the quality of uncertain data as well as to develop techniques for properly integrating uncertain data.
Participating members
Publications of Project
2013
pdflogo
bibTexLogo
Fabian Panse, Maurice van Keulen, Norbert Ritter
In: ACM Journal of Data and Information Quality
2012
pdflogo
bibTexLogo
In: The 17th International Conference on Information Quality
2011
pdflogo
bibTexLogo
In: 5th International Conference on Scalable Uncertainty Management
pdflogo
bibTexLogo
In: Ingénierie des Systèmes d'Information
2009
pdflogo
bibTexLogo
Fabian Panse, Maurice van Keulen, Ander de Keijzer, Norbert Ritter
In: Centre for Telematics and Information Technology (CTIT), University of Twente, Technical Report Series
Student theses within the project
2020
bibTexLogo
Bachelorarbeit of Daniel Kötter
bibTexLogo
Bachelorarbeit of Heiko Eckmann
bibTexLogo
Bachelorarbeit of Johannes Bolduan
2019
bibTexLogo
Master Thesis of David Zschocke
bibTexLogo
Bachelorarbeit of Jan Synwoldt
2018
bibTexLogo
Master Thesis of Jennifer Soltau
bibTexLogo
Bachelorarbeit of Manuela Buchholz
bibTexLogo
Master Thesis of Timm Holler
Supervisors: Fabian Panse, Wolfgang Menzel
2016
bibTexLogo
Bachelorarbeit of Alexander Keck
2014
bibTexLogo
Master Thesis of Erik Meyer
2013
bibTexLogo
Bachelorarbeit of Kai Hildebrandt
Tutor: Fabian Panse     Supervisors: Norbert Ritter, Wolfgang Menzel
2012
pdflogo
bibTexLogo
Tutor: Fabian Panse     Supervisors: Norbert Ritter, Wolfgang Menzel
bibTexLogo
Diploma Thesis of Lennart Helm
Tutor: Fabian Panse     Supervisors: Norbert Ritter, Matthias Rarey
bibTexLogo
Diploma Thesis of David Haasenleder
Tutor: Fabian Panse     Supervisors: Norbert Ritter, Guido Gryczan
2011
bibTexLogo
Bachelorarbeit of Lars Grote
Tutor: Fabian Panse     Supervisors: Norbert Ritter, Axel Schmolitzky
2010
pdflogo
bibTexLogo
Tutor: Fabian Panse     Supervisors: Norbert Ritter, Wolfgang Menzel