Repository logo
 
Publication

Interpretable Classification of Wiki-Review Streams

dc.contributor.authorGarcía-Méndez, Silvia
dc.contributor.authorLeal, Fátima
dc.contributor.authorMalheiro, Benedita
dc.contributor.authorBurguillo-Rial, Juan Carlos
dc.date.accessioned2024-03-11T11:58:32Z
dc.date.available2024-03-11T11:58:32Z
dc.date.issued2023-12-13
dc.description.abstractWiki articles are created and maintained by a crowd of editors, producing a continuous stream of reviews. Reviews can take the form of additions, reverts, or both. This crowdsourcing model is exposed to manipulation since neither reviews nor editors are automatically screened and purged. To protect articles against vandalism or damage, the stream of reviews can be mined to classify reviews and profile editors in real-time. The goal of this work is to anticipate and explain which reviews to revert. This way, editors are informed why their edits will be reverted. The proposed method employs stream-based processing, updating the profiling and classification models on each incoming event. The profiling uses side and content-based features employing Natural Language Processing, and editor profiles are incrementally updated based on their reviews. Since the proposed method relies on self-explainable classification algorithms, it is possible to understand why a review has been classified as a revert or a non-revert. In addition, this work contributes an algorithm for generating synthetic data for class balancing, making the final classification fairer. The proposed online method was tested with a real data set from Wikivoyage, which was balanced through the aforementioned synthetic data generation. The results attained near-90% values for all evaluation metrics (accuracy, precision, recall, and F-measure).pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationS. García-Méndez, F. Leal, B. Malheiro and J. C. Burguillo-Rial, "Interpretable Classification of Wiki-Review Streams," in IEEE Access, vol. 11, pp. 141137-141151, 2023, doi: 10.1109/ACCESS.2023.3342472pt_PT
dc.identifier.doi10.1109/ACCESS.2023.3342472pt_PT
dc.identifier.eissn2169-3536
dc.identifier.urihttp://hdl.handle.net/10400.22/25145
dc.language.isoengpt_PT
dc.peerreviewedyespt_PT
dc.publisherIEEEpt_PT
dc.relationINESC TEC- Institute for Systems and Computer Engineering, Technology and Science
dc.relation.publisherversionhttps://ieeexplore.ieee.org/document/10356073pt_PT
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/pt_PT
dc.subjectData reliability and fairnesspt_PT
dc.subjectData-stream processing and classificationpt_PT
dc.subjectSynthetic datapt_PT
dc.subjectTransparencypt_PT
dc.subjectVandalismpt_PT
dc.subjectWikispt_PT
dc.titleInterpretable Classification of Wiki-Review Streamspt_PT
dc.typejournal article
dspace.entity.typePublication
oaire.awardTitleINESC TEC- Institute for Systems and Computer Engineering, Technology and Science
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDP%2F50014%2F2020/PT
oaire.citation.conferencePlacePiscataway, New Jerseypt_PT
oaire.citation.endPage141151pt_PT
oaire.citation.startPage141137pt_PT
oaire.citation.titleIEEE Accesspt_PT
oaire.citation.volume11pt_PT
oaire.fundingStream6817 - DCRRNI ID
person.familyNameLeal
person.familyNameBENEDITA CAMPOS NEVES MALHEIRO
person.givenNameFátima
person.givenNameMARIA
person.identifier.ciencia-id2211-3EC7-B4B6
person.identifier.ciencia-id7A15-08FC-4430
person.identifier.orcid0000-0003-4418-2590
person.identifier.orcid0000-0001-9083-4292
person.identifier.ridY-3460-2019
person.identifier.scopus-author-id57190765181
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsopenAccesspt_PT
rcaap.typearticlept_PT
relation.isAuthorOfPublication8e77ca2d-3cb2-4346-927b-a706a5580c9e
relation.isAuthorOfPublicationbabd4fda-654a-4b59-952d-6113eebbb308
relation.isAuthorOfPublication.latestForDiscoverybabd4fda-654a-4b59-952d-6113eebbb308
relation.isProjectOfPublication5efdbedb-4666-4d5b-94f0-0a938b0d5ce4
relation.isProjectOfPublication.latestForDiscovery5efdbedb-4666-4d5b-94f0-0a938b0d5ce4

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Interpretable_Classification_of_Wiki-Review_Streams.pdf
Size:
2.12 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: