Repository logo
 
Loading...
Project Logo
Research Project

INESC TEC- Institute for Systems and Computer Engineering, Technology and Science

Authors

Publications

Online detection and infographic explanation of spam reviews with data drift adaptation
Publication . de Arriba Pérez, Francisco; García Méndez, Silvia; Leal, Fátima; Malheiro, Benedita; Burguillo, Juan C.
Spam reviews are a pervasive problem on online platforms due to its significant impact on reputation. However, research into spam detection in data streams is scarce. Another concern lies in their need for transparency. Consequently, this paper addresses those problems by proposing an online solution for identifying and explaining spam reviews, incorporating data drift adaptation. It integrates (i) incremental profiling, (ii) data drift detection & adaptation, and (iii) identification of spam reviews employing Machine Learning. The explainable mechanism displays a visual and textual prediction explanation in a dashboard. The best results obtained reached up to 87 % spam F-measure.
Exposing and explaining fake news on-the-fly
Publication . de Arriba Pérez, Francisco; García Méndez, Silvia; Leal, Fátima; Malheiro, Benedita; Burguillo, Juan C.
Social media platforms enable the rapid dissemination and consumption of information. However, users instantly consume such content regardless of the reliability of the shared data. Consequently, the latter crowdsourcing model is exposed to manipulation. This work contributes with an explainable and online classification method to recognize fake news in real-time. The proposed method combines both unsupervised and supervised Machine Learning approaches with online created lexica. The profiling is built using creator-, content- and context-based features using Natural Language Processing techniques. The explainable classification mechanism displays in a dashboard the features selected for classification and the prediction confidence. The performance of the proposed solution has been validated with real data sets from Twitter and the results attain 80 % accuracy and macro F-measure. This proposal is the first to jointly provide data stream processing, profiling, classification and explainability. Ultimately, the proposed early detection, isolation and explanation of fake news contribute to increase the quality and trustworthiness of social media contents.
Emotional evaluation of open-ended responses with transformer models
Publication . Pajón-Sanmartín, Alejandro; De Arriba Pérez, Francisco; García Méndez, Silvia; Burguillo, Juan C.; Leal, Fátima; Malheiro, Benedita
This work applies Natural Language Processing (NLP) techniques, specifically transformer models, for the emotional evaluation of open-ended responses. Today’s powerful advances in transformer architecture, such as ChatGPT, make it possible to capture complex emotional patterns in language. The proposed transformer-based system identifies the emotional features of various texts. The research employs an innovative approach, using prompt engineering and existing context, to enhance the emotional expressiveness of the model. It also investigates spaCy’s capabilities for linguistic analysis and the synergy between transformer models and this technology. The results show a significant improvement in emotional detection compared to traditional methods and tools, highlighting the potential of transformer models in this domain. The method can be implemented in various areas, such as emotional research or mental health monitoring, creating a much richer and complete user profile.
Interpretable Classification of Wiki-Review Streams
Publication . García-Méndez, Silvia; Leal, Fátima; Malheiro, Benedita; Burguillo-Rial, Juan Carlos
Wiki articles are created and maintained by a crowd of editors, producing a continuous stream of reviews. Reviews can take the form of additions, reverts, or both. This crowdsourcing model is exposed to manipulation since neither reviews nor editors are automatically screened and purged. To protect articles against vandalism or damage, the stream of reviews can be mined to classify reviews and profile editors in real-time. The goal of this work is to anticipate and explain which reviews to revert. This way, editors are informed why their edits will be reverted. The proposed method employs stream-based processing, updating the profiling and classification models on each incoming event. The profiling uses side and content-based features employing Natural Language Processing, and editor profiles are incrementally updated based on their reviews. Since the proposed method relies on self-explainable classification algorithms, it is possible to understand why a review has been classified as a revert or a non-revert. In addition, this work contributes an algorithm for generating synthetic data for class balancing, making the final classification fairer. The proposed online method was tested with a real data set from Wikivoyage, which was balanced through the aforementioned synthetic data generation. The results attained near-90% values for all evaluation metrics (accuracy, precision, recall, and F-measure).
Citizen Engagement in Urban Planning – An EPS@ISEP 2022 Project
Publication . Cardani, Carla G.; Couzyn, Carmen; Degouilles, Eliott; Benner, Jan M.; Engst, Julia A.; Duarte, Abel J.; Malheiro, Benedita; Ribeiro, Cristina; Justo, Jorge; Silva, Manuel F.; Ferreira, Paulo; Guedes, Pedro
Involving people in urban planning offers many benefits, but current methods are failing to get a large number of citizens to participate. People have a high participation barrier when it comes to public participation in urban planning – as it requires a lot of time and initiative, only a small non-diverse group of citizens take part in governmental initiatives. In this paper, a product is developed to make it as easy as possible for citizens to get involved in construction projects in their community at an early stage. As a solution, a public screen is proposed, which offers citizens the opportunity to receive information, view 3D models, vote and comment at the site of the construction project via smartphone – the solution was named Parcitypate. To explain the functions of the product, a prototype was created and tested. In addition, concepts for branding, marketing, ethics, and sustainability are presented.

Organizational Units

Description

Keywords

Contributors

Funders

Funding agency

Fundação para a Ciência e a Tecnologia

Funding programme

6817 - DCRRNI ID

Funding Award Number

UIDP/50014/2020

ID