Repository logo
 

ISEP - DM – Engenharia de Inteligência Artificial

Permanent URI for this collection

Browse

Recent Submissions

Now showing 1 - 10 of 68
  • Estratégias de aprendizagem por reforço para configuração dinâmica de meta-heurísticas
    Publication . Oliveira, Vítor José Henriques; Pinto, Tiago Manuel Campelos Ferreira; Ramos, Carlos Fernando da Silva
    A eficácia da otimização de problemas complexos está intimamente ligada à configuração de parâmetros em algoritmos meta-heurísticos. Embora já tenham sido propostos métodos automatizados para a escolha dos parâmetros de algoritmos para reduzir a necessidade de ajuste manual, existe ainda um potencial significativo, não explorado, de ajuste dinâmico de parâmetros de algoritmos durante a execução, o que pode melhorar o seu desempenho. Este estudo visa aferir a eficácia da definição manual de parâmetros em comparação com uma abordagem dinâmica baseada em aprendizagem por reforço, reduzindo a necessidade de intervenção humana e aumentando a eficiência operacional dos algoritmos. Para alcançar este objetivo, adaptaram-se os métodos SARSA (State-Action-Reward-State-Action) e Deep SARSA para regular os parâmetros de algoritmos meta-heurísticos, em especial, o algoritmo genético. O modelo adotado é independente do problema a ser otimizado ou do algoritmo meta-heurístico selecionado, por isso, oferece a flexibilidade necessária, sendo apenas crucial escolher os parâmetros a ajustar durante o decorrer do processo de otimização de qualquer problema estudado. Estas metodologias foram testadas em funções benchmark, amplamente reconhecidas na literatura, e aplicadas nesta investigação nos seguintes cenários práticos: a otimização de portfólios de investimentos, na qual um participante possui ou pretende adquirir energia elétrica num mercado de eletricidade e a melhoria relacionada com a alocação de pacientes em Unidades de Cirurgia (UC) e em Unidades de Cuidados Intensivos (UCI), com o intuito de melhorar a eficiência da utilização de recursos limitados. Os resultados demonstram que o algoritmo Deep SARSA, baseado em aprendizagem por reforço e redes neuronais, obtém frequentemente um melhor desempenho em comparação com a configuração manual, de cariz completamente aleatório. Este facto pode ser comprovado pela análise dos resultados das médias do número de execuções, nomeadamente, no problema das UC, onde o valor do teste ANOVA apresentou um 𝑝-value significativo igual a 0.014. Este desfecho sugere que abordagens dinâmicas de ajuste de parâmetros podem ser mais eficazes e oferecer uma alternativa viável a métodos estáticos de configuração, que possam potenciar soluções propostas para enfrentar os desafios em ambientes dinâmicos e incertos.
  • AI-driven information retrieval system for candidate screening
    Publication . Silva, Vasco Reid Ferreira da; Conceição, Luís Manuel Silva
    Efficient screening and evaluation in the recruitment process are tasks that demand substantial time and effort from Human Resources professionals. These processes often suffer from long waiting periods, inconsistent candidate evaluation, and the potential to overlook qualified candidates. In this context, leveraging state-of-the-art natural language processing architectures, specifically large language models (LLMs), holds significant promise. LLMs can generate evaluations using advanced prompt techniques to improve the accuracy and reliability of the output. This thesis researches the feasibility of employing 7 billion parameter LLMs in candidate screening to reduce response times, decrease workload, and improve evaluation consistency. The study involves a comparative analysis of various state-of-the-art large language models to identify those most suitable for this application. Additionally, it examines different prompt engineering techniques to optimize the performance of these models. A comprehensive analysis of the results is conducted to determine the most effective combinations of LLMs and prompt engineering techniques. This includes a two-way validation process, utilizing both the state-of-the-art GPT-4 model and manual human resources validation, to ensure the robustness and reliability of the findings. The outcomes of this thesis aim to enhance the quality of candidate screening by integrating LLMs into the process. Furthermore, this work aspires to provide valuable insights into the capabilities of 7 billion parameter large language models in the field of human resources and their application in real-world scenarios.
  • Development and evaluation of a complex-valued neural network library: the Renplex open-source project
    Publication . Alves, Pedro Manuel Ferreira; Faria, Luiz Felipe Rocha de
    Complex-Valued Neural Networks (CVNN) have shown to be a promising type of Artificial Neural Networks (ANN) when compared to its real-valued counter-parts. However, it has been a research field where authors autonomously developed and tested CVNN with no common tools or library to module them. This Master Thesis presents a library called Renplex capable of modulating CVNN as an open-source project for research and even for small scale applications. Although not suitable for beginners in the field of ANN or programming, the library provides a low-level interactive with Machine Learning (ML) pipeline, in order to accurately control CVNN evaluation. To test the library’s core functionalities, architectures such as Complex-Valued Multi-Layer Perceptron, Auto-encoder and Convolutional Neural Network were trained. These achieved test results that outperformed their real-valued counterparts for the MNIST dataset and a synthetically generated dataset for signal reconstruction. Such improvement on performance, has been previously stated throughout literature. It consisted in greater test accuracy (or lower loss values), more stability in training, faster convergence in terms of epochs needed, greater capability of generalization, and subsequently less prone to over-fitting. This work will introduce a new tool for exploring CVNN, capable of scaling and potentially uncovering many of their hidden potentials for ML-related tasks.
  • Intelligent recommendation system to enhance youth football athletes development
    Publication . Fraga, Pedro Filipe Monteiro; Martins, António Constantino Lopes; Matos, Paulo Sérgio dos Santos
    With the constant evolution of sports, technological advancements are constantly being made with the objective of maximizing the performance of teams and players. Looking specifically at football, which is the most played sport across the world, teams are constantly looking to develop the best players, and that process starts when they are young. However, only the top football clubs in the world have the funding necessary to create the best conditions to improve the best players. Consequently, for most of the football academies, it is hard to maximize the potential of their young players and enhance their growth. Therefore, this thesis pretends to solve this issue by proposing a football recommendation system for young athletes improvement, where the main objectives were to help the coaching staff by recommending the most relevant skills for a player to improve, using different approaches to achieve these recommendations, such as the input of experts, test them in a real life football academy environment, with the aid of recommendation system evaluation metrics, and discuss the results obtained. In the State of the Art chapter, a systematic literature review with PRISMA methodology was used, to assess the existing recommendation systems in football and their algorithms, as well as the aspects typically included in player modeling. The system was tested in a U14 and a U17 men’s football team, and the results obtained are very good indicators of the effectiveness of the system, showcasing 93.1% of accuracy in the recommendations while maintaining recall, precision and F1-measure values above 80%. The results obtained, combined with an interview with a coach that tested the system, show evidence that the system enhances the growth of the players, by aiding the coaching staff at suggesting the most relevant skills to a player. It also indicates that the system can be implemented in football academies, to complement the coaching staff. There are some limitations to the system, notably not having an interface built, the goalkeeper position not being addressed by the system and the experimentation tests having a relatively small sample size. Future work includes improving the performance of the algorithms, testing on a bigger set of players, adding the goalkeeper position, reanalyzing the skills used to modulate a player and the addition of an interface, which will aid the utilization of the system and enable the customization of the parameters used in the models.
  • Metrix analyzer: automatic insights generator for teams performance
    Publication . Panda, Miguel Francisco Coutinho; Martins, António Constantino Lopes
    The African continent is witnessing a technological renaissance, with its market competitiveness escalating at an unprecedented pace. In this rapidly evolving landscape, software companies must swiftly adapt to not only survive but thrive. Central to this endeavour is the imperative to enhance team performance, as it is vital for sustaining profitability and driving innovation. As demand for comprehensive performance analysis grows across teams, Agile Coaches are increasingly stretched thin, balancing the urgent need for insights with their ongoing projects. This challenge underscores the critical necessity for advanced automation—a solution that can streamline the generation of actionable insights while requiring fewer human resources. It was within this context that the Metrix project was born. This dissertation examines the detailed and step-by-step process of developing a hybrid expert system solution designed to meet the growing needs of the company. It covers the technical research carried out, the careful analysis of requirements, and the practical application design that laid the groundwork for the project. Additionally, it describes the data preparation and the experimentation phase that led to accuracy results of 85%. By the conclusion of this thesis, the results and insights resultant from this journey will be presented, illuminating the transformative potential of the Metrix project in revolutionizing team performance analysis and paving the way for future advancements.
  • Sistema de vigilância inteligente para ambientes domésticos com visão computacional
    Publication . Costa, Miguel Ferreira da; Martins, António Constantino Lopes
    Nos dias de hoje, num mundo onde a segurança privada é cada vez mais crucial, a Inteligência Artificial emerge como uma aliada poderosa para proteger os nossos lares. No entanto, os sistemas de vigilância para ambientes domésticos ainda enfrentam desafios significativos, nomeadamente, na falta de personalização e na dependência de tecnologias tradicionais, que não conseguem atender às exigências modernas de segurança. Esta dissertação apresenta o sistema iDetect, um sistema de vigilância inteligente com deteção e reconhecimento facial voltado para ambientes domésticos. O sistema utiliza uma combinação de tecnologias tradicionais e deep learning, alcançando uma precisão de 93% na deteção e uma acurácia de 97% para o reconhecimento facial, com base numa seleção criteriosa de algoritmos, utilizando uma versão adaptada da metodologia PRISMA. Devido à crescente utilização de smartphones, o sistema possui a sua aplicação móvel, totalmente configurável e intuitiva, que permite ao utilizador personalizar o sistema e receber notificações e alertas em tempo real. Os resultados mostram que o sistema implementado oferece uma solução eficaz e acessível, que une o poder da Inteligência Artificial à portabilidade dos dispositivos móveis, proporcionando uma resposta mais rápida e eficiente às ameaças domésticas. Foi possível implementar um sistema que supera os atuais sistemas de videovigilância doméstica, fornecendo respostas mais ágeis e precisas, reduzindo significativamente o número de falsos positivos. O sistema desenvolvido representa um avanço no campo da segurança doméstica, ao disponibilizar uma solução inovadora e de fácil utilização que pode ser adaptada às necessidades individuais de cada utilizador.
  • Application of active learning on medical images to enhance machine learning models
    Publication . Santos, Maria Inês Salvador dos; Marreiros, Maria Goreti Carvalho
    Artificial intelligence has made some huge advancements in the healthcare field, particularly in medical imaging. However, data and annotations in this area are often scarce and expensive to obtain. Labeling images, although essential for machine learning models, is a tedious and time-consuming task. Active learning addresses this challenge by selecting informative samples to try and create a subset of unlabeled data where the model could have more difficulty predicting the labels which are then given to experts to annotate. The goal is to try to use less amount of annotated data, whilst still getting a good model performance. Breast cancer is one of the most common cancers in women. The proposed solution uses the Patch- Camelyon dataset, a variation of the Camelyon16 dataset with patches from histopathologic scans of sentinel lymph node sections for the detection of metastatic tissue of breast cancer patients. This work proposes an active learning approach that includes the division of the unlabeled data into clusters which are then classified based on their level of informativeness (based on Shannon Entropy). Then, from each cluster several samples are selected based on the previously defined informativeness level and each sample is scored based on a formula that includes both entropy and Euclidean distance to the cluster centroid. Finally, samples with the lowest uncertainty score are added to the training dataset with the model’s prediction. The proposed method includes both model uncertainty and data distribution. The solution showed promising results when compared with a random sampling approach. To evaluate the proposed solution, greyscale and Macenko normalization techniques were used in all different approaches (random sampling approach, a variation of the proposed solution with no pseudo label task and the proposed solution). In some iterations, the difference between the F1 score in the proposed active learning solution and random sampling was more than 0,20. With the application of this method, experts can spend less time annotating images while still achieving a high-performance model.
  • Ajuste dinâmico de dificuldade em videojogos usando aprendizagem automática
    Publication . Felício, Jorge Emanuel Coelho Mendonça de Anciães; Faria, Luiz Felipe Rocha de
    In the constantly evolving field of video games, traditional difficulty settings fail to accommodate the wide range of skill levels among players. The resulting mismatch between the player’s skill and the game’s challenge can make the game boring for skilled players or frustrating for less experienced ones, negatively affecting player engagement. Dynamic Difficulty Adjustment (DDA) seeks to resolve this issue by adapting the game’s difficulty in real time in response to the player’s performance. While advancements in artificial intelligence (AI), particularly machine learning (ML), have enabled more adaptive DDA systems, the full potential of certain advanced techniques or tools has yet to be explored. This thesis thus explores possible innovations in the integration of AI in DDA systems for video games. The research begins by reviewing the techniques used for DDA, focusing on methodologies such as player modeling, rule-based systems, and ML. Based on this research, potential areas for innovation were identified and the application of Deep Reinforcement Learning (DRL) in the Unity game development platform through the usage of the MLAgents toolkit was chosen as a promising approach for this research. Using this methodology, this research aims to implement a DDA system that adjusts a game’s difficulty based on the player’s skills, enhancing their engagement and maintaining a consistent challenge. This project has several critical phases of development, including the creation of a game prototype, data collection for model training, development and integration of the DDA system into the game prototype, and conducting an experiment comparing the prototype with DDA integrated with a version of the prototype that used traditional static difficulty scaling. The experiment conducted was done with 20 participants of varying skill levels and used a combination of collected gameplay metrics and a modified Game Experience Questionnaire (GEQ) survey to evaluate the DDA system’s effectiveness. The results showed that the DDA system demonstrated a statistically significant increase in the player engagement component and appropriately adjusted the difficulty to be harder for participants of higher skill. However, the system sometimes exhibited some issues with drastic adjustments in difficulty between levels, which led to a slightly lower Post-Game positive experience score compared to the static difficulty scaling system. Despite these fluctuations, the proposed system demonstrates the potential of the ML-Agents toolkit in implementing DDA with DRL in games made on the Unity platform. By identifying underexplored areas in the current literature and applying advanced techniques like DRL, this thesis aims to contribute to both academic research and game development regarding the approach to DDA in video games.
  • NERdy: enhancing information discovery through named entity recognition
    Publication . Magalhães, João Vilas Boas da Silva; Faria, Luiz Felipe Rocha de
    Education is essential for individual and societal progress, playing a pivotal role in economic development, creativity, and social mobility. However, a significant challenge remains in ensuring equitable access to quality education, particularly in diverse classrooms where personalized learning is increasingly critical. Research highlights the benefits of tailored learning approaches, but current educational tools often lack the ability to organize raw information into structured formats suitable for individualized learning. This gap underscores the need for advancements in Natural Language Processing (NLP) to enhance educational tools. In response to this challenge, this project focuses on developing a Named Entity Recognition (NER) model to improve the organization and extraction of information from raw text. NER, a key task in NLP, identifies and classifies entities such as people, organizations, and locations, providing the groundwork for future tools designed to structure educational content. The primary objective of this study is to construct an entity extraction tool, with the ultimate goal of enhancing personalized learning by facilitating the automatic organization of educational materials. To achieve this, a model combining a pretrained BERT encoder, a BiLSTM layer, and a Conditional Random Field (CRF) correction layer was developed. The model was trained on curated datasets to ensure both performance and fairness. Through extensive testing and finetuning, the model demonstrated strong results, achieving an F1 score of 87.22%, comparing favorably to state-of-the-art models. Key techniques such as class balancing, weight decay, and dropout were used to prevent overfitting, while validation and training losses were monitored to assess the model’s performance. The findings of this project not only confirm the effectiveness of the developed NER model but also highlight its potential in addressing educational challenges. The model shows promise for future expansion, including the development of relation extraction techniques and knowledge graph generation to further enhance learning tools. Ethical considerations, including data privacy, fairness, and transparency, were prioritized throughout the project. Future work will focus on refining the model and expanding its capabilities to better serve the educational sector, contributing to the broader goal of improving access to quality, personalized education.
  • Self-Checkout System for product recognition
    Publication . Azevedo, João Nuno Silva; Ramos, Carlos Fernando da Silva
    Recently, retail environments have increasingly turned to self-checkout systems as a way to simplify operations, providing a seamless customer experience. These systems reduce the need for manual labor, speeding up checkout times to offer customers more autonomy. There is a growing need for technologies to address the current challenges related to product identification. The integration of deep learning and computer vision into self-checkout systems has the potential to revolutionize product identification. With real-time classification of products, there is no need for manual input from costumers or employees. However, a significant challenge in product identification is the classification of fruits and vegetables, being a hard challenge due to many similarities between them. These technologies offer more efficiency and convenience to customers, enhancing customer satisfaction, reducing employee-related expenses while reducing transaction errors and optimizing the overall efficiency of the retail checkout process. This dissertation explores the development of two neural network models for the task of fruit classification, one of the current challenges related to product identification. The objective of this research is to assess the effectiveness of these architectures in fruit classification. Both networks were trained and evaluated on a fruit dataset under the same conditions. Then the experiments were conducted to compare the classification accuracy and model efficiency of both approaches. This study gives valuable insights into the application of deep learning techniques for image recognition, with potential for broader classification tasks and future work in fine-tuning model architectures for optimized performance.