Repository logo
 
No Thumbnail Available
Publication

Applying Data Mining Techniques to Improve Breast Cancer Diagnosis

Use this identifier to reference this record.
Name:Description:Size:Format: 
ART_GoretiMarreiros_GECAD_2016.pdf314.11 KBAdobe PDF Download

Advisor(s)

Abstract(s)

In the field of breast cancer research, and more than ever, new computer aided diagnosis based systems have been developed aiming to reduce diagnostic tests false-positives. Within this work, we present a data mining based approach which might support oncologists in the process of breast cancer classification and diagnosis. The present study aims to compare two breast cancer datasets and find the best methods in predicting benign/malignant lesions, breast density classification, and even for finding identification (mass / microcalcification distinction). To carry out these tasks, two matrices of texture features extraction were implemented using Matlab, and classified using data mining algorithms, on WEKA. Results revealed good percentages of accuracy for each class: 89.3 to 64.7 % - benign/malignant; 75.8 to 78.3 % - dense/fatty tissue; 71.0 to 83.1 % - finding identification. Among the different tests classifiers, Naive Bayes was the best to identify masses texture, and Random Forests was the first or second best classifier for the majority of tested groups.

Description

Keywords

Breast cancer diagnosis Features extraction Data mining techniques

Citation

Research Projects

Organizational Units

Journal Issue

Publisher

Springer Verlag

CC License

Altmetrics