Publication
Balancing Plug-In for Stream-Based Classification
dc.contributor.author | de Arriba-Pérez, Francisco | |
dc.contributor.author | García-Méndez, Silvia | |
dc.contributor.author | Leal, Fátima | |
dc.contributor.author | Malheiro, Benedita | |
dc.contributor.author | Burguillo-Rial, Juan Carlos | |
dc.date.accessioned | 2024-03-11T10:55:56Z | |
dc.date.available | 2024-03-11T10:55:56Z | |
dc.date.issued | 2024-02-16 | |
dc.description.abstract | The latest technological advances drive the emergence of countless real-time data streams fed by users, sensors, and devices. These data sources can be mined with the help of predictive and classification techniques to support decision-making in fields like e-commerce, industry or health. In particular, stream-based classification is widely used to categorise incoming samples on the fly. However, the distribution of samples per class is often imbalanced, affecting the performance and fairness of machine learning models. To overcome this drawback, this paper proposes Bplug, a balancing plug-in for stream-based classification, to minimise the bias introduced by data imbalance. First, the plug-in determines the class imbalance degree and then synthesises data statistically through non-parametric kernel density estimation. The experiments, performed with real data from Wikivoyage and Metro of Porto, show that Bplug maintains inter-feature correlation and improves classification accuracy. Moreover, it works both online and offline. | pt_PT |
dc.description.version | info:eu-repo/semantics/publishedVersion | pt_PT |
dc.identifier.citation | de Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., Burguillo-Rial, J.C. (2024). Balancing Plug-In for Stream-Based Classification. In: Rocha, A., Adeli, H., Dzemyda, G., Moreira, F., Colla, V. (eds) Information Systems and Technologies. WorldCIST 2023. Lecture Notes in Networks and Systems, vol 799. Springer, Cham. https://doi.org/10.1007/978-3-031-45642-8_6 | pt_PT |
dc.identifier.doi | 10.1007/978-3-031-45642-8_6 | pt_PT |
dc.identifier.isbn | 978-3-031-45641-1 | |
dc.identifier.uri | http://hdl.handle.net/10400.22/25140 | |
dc.language.iso | eng | pt_PT |
dc.peerreviewed | yes | pt_PT |
dc.publisher | Springer | pt_PT |
dc.relation | INESC TEC- Institute for Systems and Computer Engineering, Technology and Science | |
dc.relation.ispartofseries | Lecture Notes in Networks and Systems; | |
dc.relation.publisherversion | https://link.springer.com/chapter/10.1007/978-3-031-45642-8_6 | pt_PT |
dc.subject | Data bias | pt_PT |
dc.subject | Fairness | pt_PT |
dc.subject | Imbalanced data sets | pt_PT |
dc.subject | Machine learning algorithm | pt_PT |
dc.subject | Stream classification | pt_PT |
dc.title | Balancing Plug-In for Stream-Based Classification | pt_PT |
dc.type | conference object | |
dspace.entity.type | Publication | |
oaire.awardTitle | INESC TEC- Institute for Systems and Computer Engineering, Technology and Science | |
oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F50014%2F2020/PT | |
oaire.citation.conferencePlace | Cham, Switzerland | pt_PT |
oaire.citation.endPage | 74 | pt_PT |
oaire.citation.startPage | 65 | pt_PT |
oaire.citation.title | WorldCIST 2023: Information Systems and Technologies | pt_PT |
oaire.citation.volume | 799 | pt_PT |
oaire.fundingStream | 6817 - DCRRNI ID | |
person.familyName | Leal | |
person.familyName | BENEDITA CAMPOS NEVES MALHEIRO | |
person.givenName | Fátima | |
person.givenName | MARIA | |
person.identifier.ciencia-id | 2211-3EC7-B4B6 | |
person.identifier.ciencia-id | 7A15-08FC-4430 | |
person.identifier.orcid | 0000-0003-4418-2590 | |
person.identifier.orcid | 0000-0001-9083-4292 | |
person.identifier.rid | Y-3460-2019 | |
person.identifier.scopus-author-id | 57190765181 | |
project.funder.identifier | http://doi.org/10.13039/501100001871 | |
project.funder.name | Fundação para a Ciência e a Tecnologia | |
rcaap.rights | closedAccess | pt_PT |
rcaap.type | conferenceObject | pt_PT |
relation.isAuthorOfPublication | 8e77ca2d-3cb2-4346-927b-a706a5580c9e | |
relation.isAuthorOfPublication | babd4fda-654a-4b59-952d-6113eebbb308 | |
relation.isAuthorOfPublication.latestForDiscovery | babd4fda-654a-4b59-952d-6113eebbb308 | |
relation.isProjectOfPublication | 7a2d9a82-ee07-4c57-bbbf-2d88b942688d | |
relation.isProjectOfPublication.latestForDiscovery | 7a2d9a82-ee07-4c57-bbbf-2d88b942688d |