Name: | Description: | Size: | Format: | |
---|---|---|---|---|
761.81 KB | Adobe PDF |
Advisor(s)
Abstract(s)
Web 2.0 platforms, like wikis and social networks, rely on crowdsourced data and, as such, are prone to data manipulation by ill-intended contributors. This research proposes the transparent identification of wiki manipulators through the classification of contributors as benevolent or malevolent humans or bots, together with the explanation of the attributed class labels. The system comprises: (i) stream-based data pre-processing; (ii) incremental profiling; and (iii) online classification, evaluation and explanation. Particularly, the system profiles contributors and contributions by combining features directly collected with content- and side-based engineered features. The experimental results obtained with a real data set collected from Wikivoyage – a popular travel wiki – attained a 98.52 % classification accuracy and 91.34 % macro F-measure. In the end, this work seeks to address data reliability to prevent information detrimental and manipulation.
Description
Keywords
Classification Data modelling Intelligent decision support system Natural language processing Stream processing
Citation
García-Méndez, S., Leal, F., de Arriba-Pérez, F., Malheiro, B., Burguillo-Rial, J.C. (2024). Explainable Classification of Wiki Streams. In: Rocha, A., Adeli, H., Dzemyda, G., Moreira, F., Colla, V. (eds) Information Systems and Technologies. WorldCIST 2023. Lecture Notes in Networks and Systems, vol 799. Springer, Cham. https://doi.org/10.1007/978-3-031-45642-8_7
Publisher
Springer