Repository logo
 
Publication

Ontologies for reusing data cleaning knowledge

dc.contributor.authorAlmeida, Ricardo
dc.contributor.authorOliveira, Paulo
dc.contributor.authorBraga, Luís
dc.contributor.authorBarroso, João
dc.date.accessioned2013-05-14T11:00:02Z
dc.date.available2013-05-14T11:00:02Z
dc.date.issued2012
dc.description.abstractThe emergence of new business models, namely, the establishment of partnerships between organizations, the chance that companies have of adding existing data on the web, especially in the semantic web, to their information, led to the emphasis on some problems existing in databases, particularly related to data quality. Poor data can result in loss of competitiveness of the organizations holding these data, and may even lead to their disappearance, since many of their decision-making processes are based on these data. For this reason, data cleaning is essential. Current approaches to solve these problems are closely linked to database schemas and specific domains. In order that data cleaning can be used in different repositories, it is necessary for computer systems to understand these data, i.e., an associated semantic is needed. The solution presented in this paper includes the use of ontologies: (i) for the specification of data cleaning operations and, (ii) as a way of solving the semantic heterogeneity problems of data stored in different sources. With data cleaning operations defined at a conceptual level and existing mappings between domain ontologies and an ontology that results from a database, they may be instantiated and proposed to the expert/specialist to be executed over that database, thus enabling their interoperability.por
dc.identifierDOI 10.1109/ICSC.2012.19
dc.identifier.isbn978-1-4673-4433-3
dc.identifier.urihttp://hdl.handle.net/10400.22/1567
dc.language.isoengpor
dc.peerreviewedyespor
dc.publisherIEEEpor
dc.relation.publisherversionhttp://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6337110por
dc.subjectData cleaningpor
dc.subjectOntologiespor
dc.subjectOWLpor
dc.subjectData qualitypor
dc.titleOntologies for reusing data cleaning knowledgepor
dc.typeconference object
dspace.entity.typePublication
oaire.citation.conferencePlacePalermo, Itáliapor
oaire.citation.endPage241por
oaire.citation.startPage238por
oaire.citation.titleIEEE Sixth International Conference on Semantic Computingpor
rcaap.rightsclosedAccesspor
rcaap.typeconferenceObjectpor

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
COM_RicardoAlmeida_2012_GECAD.pdf
Size:
231.54 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: