Logo do repositório
 
Publicação

ViruScopeDB: a comprehensive multi-omics software and database for highly infectious viruses

dc.contributor.advisorCarneiro, João
dc.contributor.advisorSousa, Sérgio
dc.contributor.advisorSá, Vítor
dc.contributor.authorLima, Ana Sofia Fafiães Pires de
dc.date.accessioned2026-02-23T11:52:14Z
dc.date.available2026-02-23T11:52:14Z
dc.date.issued2025-11-07
dc.description.abstractHighly infectious viruses, such as HIV, Ebolavirus, and SARS-CoV-2 have presented ongoing challenges to global health, leaving devastating consequences in their wake. These disruptions were not limited to the health field, as their profound economic and societal impact has been, and will continue to be, felt for decades. Effective surveillance and rapid diagnostics remain crucial to controlling their spread, yet current approaches to oligonucleotide data curation are fragmented, virus-specific, and generally highly dependent on manual approaches. To address these limitations, this thesis presents ViruScope, an automated bioinformatics tool for multi-omics data mining, in silico primer generation, validation, and scoring, together with ViruScopeDB, a centralized and continuously updated cross-viral database built on the results of the tool. The methodology integrates large-scale genomic data retrieval and alignment, literature-based extraction of experimentally validated primers (AROLit v.3), in silico primer generation and filtering (iSOP v.3), and a novel scoring framework, encompassing GC content, melting temperature, cross-species conservation, and a way to rank primers by how likely they are to not form undesirable secondary structures (No-Fold%). These modules, previously designed for specific viruses, were re-engineered into a scalable, optimized and generalized pipeline, deployed as both a command-line program and a user-friendly Shiny application, and compiled into ViruScopeDB as an openly accessible repository. Case studies conducted on Ebola datasets demonstrated the tool’s capacity to retrieve, validate, and score primers from both literature and computational approaches, producing results consistent with experimental best practices while enabling more nuanced ranking of primer candidates. Filtering by conservation scores and No-Fold% - the final and ViruScope-exclusive parameters - revealed gene and region-specific differences in conservation and stability, reinforcing the advantage in automated and scalable approaches for diagnostic tool design. By unifying data retrieval, primer design, and validation into a reproducible, automated workflow, ViruScope and ViruScopeDB provide a novel, scalable, and FAIR (Findable, Accessible, Interoperable, Reusable) solution to some of the challenges of molecular diagnostics. This work demonstrates the potential of computational pipelines to accelerate diagnostic assay development, improve cross-viral surveillance, and strengthen global preparedness for both ongoing and potential emerging viral threats.eng
dc.identifier.tid204177812
dc.identifier.urihttp://hdl.handle.net/10400.22/31875
dc.language.isoeng
dc.relationUIDB/04423/2020 and UIDP/04423/2020, 10.54499/LA/P/0008/2020,10.54499/UIDP/50006/2020, UIDB/04050/2020; UIDB/00319/2020; UIDB/00127/2020
dc.rights.uriN/A
dc.subjectBioinformatic
dc.subjectVirology
dc.subjectDatabase
dc.subjectPCR
dc.subjectOligonucleotide
dc.titleViruScopeDB: a comprehensive multi-omics software and database for highly infectious viruseseng
dc.typemaster thesis
dspace.entity.typePublication
thesis.degree.nameMaster’s degree in Biostatistics and Bioinformatics Applied to Health

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
DISSERTAÇÃO_MBBAS_Ana_Lima_VFINAL.pdf
Tamanho:
14.7 MB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
4.03 KB
Formato:
Item-specific license agreed upon to submission
Descrição: