Repository logo
 
Publication

Automated integration of transport timetable information

dc.contributor.advisorPereira, António Jorge Santos
dc.contributor.authorWesterberg, João Baptista Monteiro
dc.date.accessioned2021-02-02T14:27:03Z
dc.date.available2021-02-02T14:27:03Z
dc.date.issued2020
dc.description.abstractThe ever-growing Web contains a large amount of data. This large amount of data is useful when combined with applications that can refine it and use it to improve its users’ lives. However, using the data available is not an easy task since most of the information is not represented in machine-friendly formats. Instead, this information is represented in formats ideal for human users, resulting in an additional effort for having machines interpreting, extracting, and integrating it, while at the same time ensuring the consistency of information from different sources. In this project, a solution using an ontology-based integration combined with web robots’ extraction automates the process required for updating information regarding schedules of public transports. An already existing application receives that information and uses it to calculate efficient routes for commuters. The proposed solution can extract information from multiple online sources and transform it into different formats. It can extract and transform the information from PDFs and HTML. The system provides a web service for the exportation of these formats by a route optimization system. This document contains the detailed process of the design and construction of the integration system. It describes the alternatives and selections that lead to the application created. Lastly, it evaluates the solution by performing extraction from several sources relevant to the project’s domain.pt_PT
dc.identifier.tid202550087pt_PT
dc.identifier.urihttp://hdl.handle.net/10400.22/16830
dc.language.isoengpt_PT
dc.subjectInformation Retrievalpt_PT
dc.subjectWeb crawlingpt_PT
dc.subjectInformation Integrationpt_PT
dc.subjectOntologypt_PT
dc.subjectPDF extractionpt_PT
dc.titleAutomated integration of transport timetable informationpt_PT
dc.title.alternativeIntegração automatizada de informação de horários de transportespt_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.rightsopenAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.nameMestrado em Engenharia Informática - Engenharia de Softwarept_PT

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
DM_JoaoWesterberg_2020_MEI.pdf
Size:
4.69 MB
Format:
Adobe Portable Document Format