Banca de DEFESA: ELIDIEL DANTAS DA COSTA

Uma banca de DEFESA de MESTRADO foi cadastrada pelo programa.
STUDENT : ELIDIEL DANTAS DA COSTA
DATE: 03/06/2026
TIME: 14:30
LOCAL: DIMAp presencial e remoto https://meet.google.com/nuh-dnxr-xsb?hs=122&authuser=1
TITLE:

Seismic Data Lakehouse: An Architecture for Curation, Governance, and Reproducibility of Seismic Data in Northeast Brazil.


KEY WORDS:

Seismological Data Lakehouse; FAIR principles; Data governance; Data lineage; Traceability; Seismological data management


PAGES: 109
BIG AREA: Ciências Exatas e da Terra
AREA: Ciência da Computação
SUBÁREA: Metodologia e Técnicas da Computação
SPECIALTY: Banco de Dados
SUMMARY:

The management of seismological data involves significant challenges related to integration, governance, traceability, and reuse within scientific workflows. In traditional environments, these processes are often fragmented and dependent on manual procedures, which limits reproducibility, scalability, and operational efficiency.

This dissertation proposes a domain-oriented Data Lakehouse architecture designed to support the complete lifecycle of seismological data. The architecture integrates data ingestion, storage, processing, and access within a unified and scalable framework, complemented by a transversal governance layer responsible for metadata management and end-to-end data lineage. This approach enables traceability, interoperability, and alignment with FAIR principles.

The solution was implemented using open-source Big Data technologies, including Apache Hadoop, Apache Spark, Apache Airflow, and Apache Atlas, combined with domain-specific tools such as ObsPy and MiniSEED for seismic data processing.

The evaluation was conducted through a multi-method approach, combining experimental validation with real-world seismic data, quantitative analysis, expert-based assessment, and Fuzzy Comprehensive Evaluation. Results demonstrate that the proposed architecture improves data organization, reduces manual intervention, and enhances traceability when compared to traditional workflows.

Experts reported high acceptance levels, with average scores above 4.6 (on a 5-point scale), and 90\% indicated willingness to adopt the solution. These findings confirm that the proposed Data Lakehouse provides a robust, scalable, and FAIR-aligned framework for seismological data management.

 



 


COMMITTEE MEMBERS:
Presidente - 1221251 - MARTIN ALEJANDRO MUSICANTE
Interna - 2195240 - MARCIA JACYNTHA NUNES RODRIGUES LUCENA
Externo ao Programa - 1451214 - ADERSON FARIAS DO NASCIMENTO - UFRNExterna à Instituição - CRISTINA DUTRA DE AGUIAR - USP
Notícia cadastrada em: 18/05/2026 15:08
SIGAA | Superintendência de Tecnologia da Informação - (84) 3342 2210 | Copyright © 2006-2026 - UFRN - sigaa04-producao.info.ufrn.br.sigaa04-producao