With 5+ years of experience using machine learning models and statistical analysis to develop predictive models, optimise data handling processes and create meaningful visualisations. Proficient in Python and cloud technologies with an interest in data analysis.
Aveiro, Portugal 🇵🇹 | Chelmsford, England 🏴
Senior Science Officer - ELIXIR Hub
January 2025 - Present
Bioinformatician / Data Scientist - Prana-Tech, Ltd
September 2023 - December 2024
Project Manager / Junior Software Developer - PromptEquation, Lda
September 2021 - December 2024
Research Fellow - Catholic University of Portugal
September 2015 - August 2021
Research Assistant Lecturer - Catholic University of Portugal
September 2017 - August 2018
Spetember 2020 - August 2021
Programming | Python, R, Julia, Bash/Shell scripting |
Data Analysis | Genomic and proteomic data processing |
Statistics & Machine Learning | Biostatistics, scikit-learn, PyTorch, Bioconductor |
Visualization | Matplotlib, ggplot2, Seaborn, Cytoscape |
Database Management | SQL, PostgreSQL |
Cloud & Containers | Google Cloud, Docker |
Version Control | Git, GitHub, GitLab |
Operating Systems | Linux, UNIX |
Proteomics | Protein quantitation, identification (MS), purification (HPLC/FPLC) |
Genomics | DNA isolation, PCR, RT-PCR, sequence alignment, NGS analysis |
Interactomics | Host-pathogen interactions, Cytoscape, GO enrichment |
Network & Pathway Analysis | Gene regulatory networks, KEGG, Reactome |
Course | Position | Institution | Start Date | End Date |
---|---|---|---|---|
Biomolecular Laboratories II | Assistant Lecturer | Catholic University of Portugal | September 2020 | August 2021 |
Molecular Biology | Assistant Lecturer | Catholic University of Portugal | September 2020 | August 2021 |
Seminars Projects | Assistant Lecturer | Catholic University of Portugal | September 2020 | August 2021 |
PhD in Biomedicine | University of Beira Interior (July 2023) |
MSc in Cell and Molecular Biology | University of Aveiro (December 2014) |
BSc in Biomedical Sciences | Catholic University of Portugal (July 2012) |
Alongside Prana-Tech we aim to develop and apply Machine-Learning methods to multiple disease cnditions and merge into a user-centric health app.
Technologies | Data analysis, Biostatistics, Machine-Learning. |
Roles | Bioinformatician |
Status | Ongoing since September 2023 |
Project funded by CENTRO2020 and the European Regional Development Fund (ERDF), this platform will allow patients to access their oral health history, including diagnoses and treatments carried out, in a clear and objective way. This record can be shared with the new dentist, contributing to correct clinical decision making.
Technologies | Django (backend) and Vue (frontend). |
Role | Project Manager |
Status | Complete July 2023 |
Project Home page | Project on the media |
The analysis of NGS microbial hypervariable regions of 16S rDNA focused on evaluating bacterial species diversity within samples using alpha diversity metrics like Shannon, Simpson, and Chao1. These metrics assess species richness and evenness (Shannon and Simpson) or species richness alone (Chao1). The study compared diversity in different sample types (saliva and biofilm) and between initial (T0) and final (T2 or T3) treatment time points to observe changes in microbial communities over time.
Technologies | Genomic Data analysis (NGS), Biostatistics |
Roles | Bioinformatician |
Status | Complete March 2024 |
This study combined proteomics and in silico interactomics to generate a salivary protein profile for COVID-19. Techniques such as partial least squares discriminant analysis and enrichment analysis with FunRich were used to narrow down the differential proteome. OralInt was used to investigate protein-protein interactions between the host and SARS-CoV-2. The analysis identified five dysregulated biological processes in COVID-19: apoptosis, energy pathways, immune response, protein metabolism and transport. Ten proteins not previously associated with SARS-CoV-2 were discovered, revealing new aspects of the virus’ effects. The study also showed that SARS-CoV-2 affects the host’s immune response, energy metabolism and apoptosis mechanisms.
Technologies | Proteomic Data analysis, Biostatistics, Machine-Learning, Network & Pathway Analysis, Interactomics |
Roles | Research Scientist, Bioinformatician |
Status | Complete September 2022 |
The CoVTec project focuses on developing protocols for processing saliva, detecting SARS-CoV-2, and assessing the host immune response using saliva samples. It has established an R&DT platform to support healthcare institutions in the Viseu region of Portugal, enhancing diagnostic strategies for current and future pandemics. The project also created a platform for real-time data collection and randomization, enabling streamlined operations and controlled data access for medical staff and laboratory personnel based on pre-defined roles.
Technologies | Python in software development and in data analysis. |
Roles | Research Scientist, Bioinformatician |
Status | Complete July 2020 |
How Odoo helped creating a saliva tests platform in record time
SalivaPRINT Toolkit enables the analysis of protein profile patterns by the identification of molecular weight ranges altered in a particular condition and therefore potentially involved in the underlying dysregulated mechanisms. SalivaPRINT Toolkit is a CLI for electrophoretic protein profile evaluation.
Technologies | Python in software development and in data analysis. |
Roles | Research Scientist, Bioinformatician |
Status | Complete January 2017 |
Member of Direction of Beira-Mar Squash team | Aveiro (September 2024) |
Hackathon finalist | Aveiro (October 2024) |