Oliveira, AlexandraGaio, RitaBaylina, PilarRebelo, CarlosReis, Luís Paulo2019-11-212019-11-212019Oliveira, A., Gaio, R., Baylina, P., Rebelo, C., & Reis, L. P. (2019). Data quality mining. Em New Knowledge in Information Systems and Technologies. WorldCIST’19 (Vol. 930, pp. 361–372). https://recipp.ipp.pt/handle/10400.22/14895http://hdl.handle.net/10400.22/14895We are living in a world of information abundance, surplus, and access. We have technologies to acquire any type of information but we still face the challenge of extracting the underlying valuable knowledge. Data analyses and mining processes may be severely impaired whenever data are corrupted by noise, ambiguity and distortions. This paper aims to provide a systematic procedure for data cleaning in single files data sources without schema that may be corrupted by the most common data problems. The methodology is guided by the dimensions of data quality standards and focuses on the goal of performing reasonable posterior statistical analyses.engData qualityData MiningValidationImprove qualityData quality miningbook part10.1007/978-3-030-16181-1_34