Utilize este identificador para referenciar este registo: http://hdl.handle.net/10400.22/1583
Título: SmartClean: an incremental data cleaning tool
Autor: Oliveira, Paulo
Rodrigues, Fátima
Henriques, Pedro
Palavras-chave: Limpeza de dados
Problemas de qualidade de dados
Data cleaning
Detection
Correction
Data quality problems
Architecture
Tool
Data: 2009
Editora: IEEE
Relatório da Série N.º: Quality Software
Resumo: This paper presents the SmartClean tool. The purpose of this tool is to detect and correct the data quality problems (DQPs). Compared with existing tools, SmartClean has the following main advantage: the user does not need to specify the execution sequence of the data cleaning operations. For that, an execution sequence was developed. The problems are manipulated (i.e., detected and corrected) following that sequence. The sequence also supports the incremental execution of the operations. In this paper, the underlying architecture of the tool is presented and its components are described in detail. The tool's validity and, consequently, of the architecture is demonstrated through the presentation of a case study. Although SmartClean has cleaning capabilities in all other levels, in this paper are only described those related with the attribute value level.
Peer review: yes
URI: http://hdl.handle.net/10400.22/1583
ISBN: 978-1-4244-5912-4
ISSN: 1550-6002
Versão do Editor: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5381543
Aparece nas colecções:ISEP – GECAD – Comunicações em eventos científicos

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
COM_PauloOliveira_2009_GECAD.pdf115,47 kBAdobe PDFVer/Abrir    Acesso Restrito. Solicitar cópia ao autor!


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote Degois 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.