Portal de Programas de Pós-Graduação (UFRN)

SIGAA - Sistema Integrado de Gestão de Atividades Acadêmicas

PPGBIONF PROGRAMA DE PÓS-GRADUAÇÃO EM BIOINFORMÁTICA DIRETORIA DE ENSINO - IMD Téléphone/Extension: Indisponible E-mail: Indisponible https://posgraduacao.ufrn.br/PPGBIONF

Banca de DEFESA: DANIELA COELHO BATISTA GUEDES PEREIRA

Uma banca de DEFESA de DOUTORADO foi cadastrada pelo programa.
STUDENT : DANIELA COELHO BATISTA GUEDES PEREIRA
DATE: 11/12/2025
TIME: 08:30
LOCAL: ICe/Híbrida
TITLE: Classification of Cancer-Associated Mutations Integrating Machine Learning with Structural and Topological Parameters of Residue Interaction Networks

KEY WORDS:

Missense Mutations, Predictors, Residue interaction Networks, Machine Learning

PAGES: 217
BIG AREA: Ciências Biológicas
AREA: Biologia Geral
SUMMARY:

The large volume of single nucleotide polymorphism data currently available has driven the development of methods capable of distinguishing neutral alterations from those associated with diseases such as cancer. Obtaining experimental evidence on the pathogenicity of variants is a labor-intensive, time-consuming, and costly process. Several in silico tools have been employed for pathogenicity prediction, including PolyPhen-2, PROVEAN, SIFT, FATHMM, MutationTaster, MutationAssessor, and LRT, as well as ensemble-based methods that combine multiple independent predictors, such as ClinPred, MetaLR, and MetaSVM. However, most of these approaches rely primarily on genomic information and allele frequency data. In recent decades, tools that integrate topological features from residue interaction networks (RINs) with outputs from conventional predictors have demonstrated superior performance. The objective of this work was to develop a classification model capable of assessing the impact of structural and topological RIN features on improving the accuracy of mutation classifiers. To this end, curated databases were constructed containing functional predictions, genomic, structural, and functional information associated with 33 cancer types, followed by the application and evaluation of several supervised machine learning algorithms. The results showed that integrating structural and topological parameters derived from RINs enhances the predictive performance of machine learning models in classifying cancer-associated missense mutations. The XGBoost-based model achieved consistent performance, with an accuracy of 74.0%, sensitivity of 73.9%, specificity of 74.1%, and an F1-score of 74.5%. These findings indicate that the proposed model presents a well-balanced trade-off between sensitivity and specificity, avoids bias toward either class, and demonstrates strong generalization capability in a highly heterogeneous scenario comprising multiple genes and distinct tumor contexts.

COMMITTEE MEMBERS:
Interno - 218.705.618-05 - ANDRÉ SALLES CUNHA PERES
Externo à Instituição - GILDERLANIO SANTANA DE ARAÚJO
Presidente - 1513597 - JOAO PAULO MATOS SANTOS LIMA
Externa à Instituição - MARIA FERNANDA RIBEIRO DIAS - UFRJ
Externo à Instituição - SIDNEY EMANUEL BATISTA DOS SANTOS - UFPA
Interna - ***.117.554-** - THAIS GAUDENCIO DO REGO - UFPB

Notícia cadastrada em: 11/12/2025 07:20