Repository logo
 
Publication

Machine Reading at Scale: A Search Engine for Scientific and Academic Research

dc.contributor.authorSousa, Norberto
dc.contributor.authorOliveira, Nuno
dc.contributor.authorPraça, Isabel
dc.date.accessioned2023-01-25T11:33:03Z
dc.date.available2023-01-25T11:33:03Z
dc.date.issued2022
dc.description.abstractThe Internet, much like our universe, is ever-expanding. Information, in the most varied formats, is continuously added to the point of information overload. Consequently, the ability to navigate this ocean of data is crucial in our day-to-day lives, with familiar tools such as search engines carving a path through this unknown. In the research world, articles on a myriad of topics with distinct complexity levels are published daily, requiring specialized tools to facilitate the access and assessment of the information within. Recent endeavors in artificial intelligence, and in natural language processing in particular, can be seen as potential solutions for breaking information overload and provide enhanced search mechanisms by means of advanced algorithms. As the advent of transformer-based language models contributed to a more comprehensive analysis of both text-encoded intents and true document semantic meaning, there is simultaneously a need for additional computational resources. Information retrieval methods can act as low-complexity, yet reliable, filters to feed heavier algorithms, thus reducing computational requirements substantially. In this work, a new search engine is proposed, addressing machine reading at scale in the context of scientific and academic research. It combines state-of-the-art algorithms for information retrieval and reading comprehension tasks to extract meaningful answers from a corpus of scientific documents. The solution is then tested on two current and relevant topics, cybersecurity and energy, proving that the system is able to perform under distinct knowledge domains while achieving competent performance.pt_PT
dc.description.sponsorshipThis work has received funding from the following projects: UIDB/00760/2020 and UIDP/00760/2020.pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.doi10.3390/systems10020043pt_PT
dc.identifier.urihttp://hdl.handle.net/10400.22/21850
dc.language.isoengpt_PT
dc.peerreviewedyespt_PT
dc.publisherMDPIpt_PT
dc.relationResearch Group on Intelligent Engineering and Computing for Advanced Innovation and Development
dc.relationResearch Group on Intelligent Engineering and Computing for Advanced Innovation and Development
dc.relation.publisherversionhttps://www.mdpi.com/2079-8954/10/2/43pt_PT
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/pt_PT
dc.subjectNatural language processingpt_PT
dc.subjectDeep learningpt_PT
dc.subjectQuestion answering systempt_PT
dc.subjectReading comprehensionpt_PT
dc.subjectInformation retrievapt_PT
dc.subjectMachine reading at scalept_PT
dc.titleMachine Reading at Scale: A Search Engine for Scientific and Academic Researchpt_PT
dc.typejournal article
dspace.entity.typePublication
oaire.awardTitleResearch Group on Intelligent Engineering and Computing for Advanced Innovation and Development
oaire.awardTitleResearch Group on Intelligent Engineering and Computing for Advanced Innovation and Development
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F00760%2F2020/PT
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDP%2F00760%2F2020/PT
oaire.citation.issue2pt_PT
oaire.citation.startPage43pt_PT
oaire.citation.titleSystemspt_PT
oaire.citation.volume10pt_PT
oaire.fundingStream6817 - DCRRNI ID
oaire.fundingStream6817 - DCRRNI ID
person.familyNameGomes Lopes de Sousa
person.familyNameOliveira
person.familyNamePraça
person.givenNameNorberto João
person.givenNameNuno
person.givenNameIsabel
person.identifier299522
person.identifier.ciencia-id091D-1166-0B2E
person.identifier.ciencia-id3E1B-B728-9524
person.identifier.ciencia-idC710-4218-1BFF
person.identifier.orcid0000-0003-2919-4817
person.identifier.orcid0000-0002-5030-7751
person.identifier.orcid0000-0002-2519-9859
person.identifier.ridK-8430-2014
person.identifier.scopus-author-id22734900800
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsopenAccesspt_PT
rcaap.typearticlept_PT
relation.isAuthorOfPublication0e10d289-1e87-4729-9e31-b2c274b3f6c4
relation.isAuthorOfPublicatione49f38bc-accb-44eb-8f49-7e7e555f34a5
relation.isAuthorOfPublicationee4ecacd-c6c6-41e8-bca1-21a60ff05f50
relation.isAuthorOfPublication.latestForDiscoverye49f38bc-accb-44eb-8f49-7e7e555f34a5
relation.isProjectOfPublicationdb3e2edb-b8af-487a-b76a-f6790ac2d86e
relation.isProjectOfPublication6eb94c83-adf9-4d9d-a75c-be95f44e3ca5
relation.isProjectOfPublication.latestForDiscoverydb3e2edb-b8af-487a-b76a-f6790ac2d86e

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
systems-10-00043-v3 (2).pdf
Size:
777.77 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: