Explainable Classification of Wiki Streams

García-Méndez, Silvia; Leal, Fátima; de Arriba-Pérez, Francisco; Malheiro, Benedita; Burguillo-Rial, Juan Carlos

http://hdl.handle.net/10400.22/25141

Use this identifier to reference this record.

Name:	Description:	Size:	Format:
WorldCist_2023_Wiki.pdf		761.81 KB	Adobe PDF	Download

Send Feedback

Authors

García-Méndez, Silvia

Leal, Fátima

de Arriba-Pérez, Francisco

Malheiro, Benedita

Burguillo-Rial, Juan Carlos

Abstract(s)

Web 2.0 platforms, like wikis and social networks, rely on crowdsourced data and, as such, are prone to data manipulation by ill-intended contributors. This research proposes the transparent identification of wiki manipulators through the classification of contributors as benevolent or malevolent humans or bots, together with the explanation of the attributed class labels. The system comprises: (i) stream-based data pre-processing; (ii) incremental profiling; and (iii) online classification, evaluation and explanation. Particularly, the system profiles contributors and contributions by combining features directly collected with content- and side-based engineered features. The experimental results obtained with a real data set collected from Wikivoyage – a popular travel wiki – attained a 98.52 % classification accuracy and 91.34 % macro F-measure. In the end, this work seeks to address data reliability to prevent information detrimental and manipulation.

Keywords

Classification Data modelling Intelligent decision support system Natural language processing Stream processing

URI

http://hdl.handle.net/10400.22/25141

Citation

García-Méndez, S., Leal, F., de Arriba-Pérez, F., Malheiro, B., Burguillo-Rial, J.C. (2024). Explainable Classification of Wiki Streams. In: Rocha, A., Adeli, H., Dzemyda, G., Moreira, F., Colla, V. (eds) Information Systems and Technologies. WorldCIST 2023. Lecture Notes in Networks and Systems, vol 799. Springer, Cham. https://doi.org/10.1007/978-3-031-45642-8_7