Repository logo
 
No Thumbnail Available
Publication

Explainable Classification of Wiki Streams

Use this identifier to reference this record.
Name:Description:Size:Format: 
WorldCist_2023_Wiki.pdf761.81 KBAdobe PDF Download

Advisor(s)

Abstract(s)

Web 2.0 platforms, like wikis and social networks, rely on crowdsourced data and, as such, are prone to data manipulation by ill-intended contributors. This research proposes the transparent identification of wiki manipulators through the classification of contributors as benevolent or malevolent humans or bots, together with the explanation of the attributed class labels. The system comprises: (i) stream-based data pre-processing; (ii) incremental profiling; and (iii) online classification, evaluation and explanation. Particularly, the system profiles contributors and contributions by combining features directly collected with content- and side-based engineered features. The experimental results obtained with a real data set collected from Wikivoyage – a popular travel wiki – attained a 98.52 % classification accuracy and 91.34 % macro F-measure. In the end, this work seeks to address data reliability to prevent information detrimental and manipulation.

Description

Keywords

Classification Data modelling Intelligent decision support system Natural language processing Stream processing

Citation

García-Méndez, S., Leal, F., de Arriba-Pérez, F., Malheiro, B., Burguillo-Rial, J.C. (2024). Explainable Classification of Wiki Streams. In: Rocha, A., Adeli, H., Dzemyda, G., Moreira, F., Colla, V. (eds) Information Systems and Technologies. WorldCIST 2023. Lecture Notes in Networks and Systems, vol 799. Springer, Cham. https://doi.org/10.1007/978-3-031-45642-8_7

Organizational Units

Journal Issue