Repository logo
 
Publication

Automatic FoodEx2 classification system for food description

dc.contributor.advisorFaria, Brígida Mónica
dc.contributor.advisorReis, Luís Paulo
dc.contributor.advisorPimenta, Rui
dc.contributor.authorFonseca, João Emanuel Sousa
dc.date.accessioned2023-01-12T11:51:24Z
dc.date.available2023-11-28T01:34:22Z
dc.date.issued2022-11-28
dc.description.abstractFood is an impacting factor in human health. Food security protects the consumers by offering a safety net from which they can trust the quality of the product. In Europe, entities such as the European Food and Safety Authority (EFSA) are risk assessors. They provide information used to shape laws around food security. To collect data regarding food safety the EFSA developed a comprehensive food classification and description system, called FoodEx2. The FoodEx2 coding system uses manual process to map food descriptions to FoodEx2 codes. The motivation for this work comes from the reduced time that could be obtained by using an algorithm to automate the code generation. It is already known that the application of Knowledge Discovery in Databases is a fundamental area to automatically produce patterns from large quantities of data. The main objective of this project is to explore automatic approaches to classify food descriptions with FoodEx2 codes. In this work several classic classifiers are compared in the prediction of FoodEx2 base codes, a multiclass classification task. The performances were explored in distinct datasets along with different levels of text preprocessing using the metrics exact match ratio and the f1-score and document representation Bag-Of-Words with TF IDF weighting. All the datasets contain imbalanced data distributions. The documents are composed of short texts describing ingredients, dishes, and animal sample details. The performances varied mainly between datasets and classifiers. The best performing classifiers were Random Forests, Decision Trees, and Linear Support Vector Machines. The results show that the creation of an automatic classifier is dependent on further exploration of the available data.pt_PT
dc.identifier.tid203147391pt_PT
dc.identifier.urihttp://hdl.handle.net/10400.22/21470
dc.language.isoengpt_PT
dc.subjectKnowledge discovery in databasespt_PT
dc.subjectMachine learningpt_PT
dc.subjectText miningpt_PT
dc.subjectFood classificationpt_PT
dc.subjectFoodEx2pt_PT
dc.titleAutomatic FoodEx2 classification system for food descriptionpt_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.rightsopenAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.nameBioestatística e Bioinformática Aplicadas à Saúdept_PT

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Dissertacao_Joao_Fonseca_vf.pdf
Size:
9.62 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: