Software Development Digital Business Intelligence and Computer Engineering
Vol. 4 No. 2 (2026): SESSION (MARET)

Analisis Persebaran Flora di Sumatera Melalui Sistem Data Lakehouse Menggunakan Interpolasi Spatial Analysis Berbasis Hadoop dan Apache Spark

Elok, Elok Fiola (Unknown)
Asa Do’a Uyi1 (Unknown)
Dea Mutia Risani (Unknown)
Yohana Manik (Unknown)
Ardika Satria (Unknown)
Luluk Muthoharoh (Unknown)



Article Info

Publish Date
01 Apr 2026

Abstract

Flora biodiversity on Sumatra Island is increasingly under pressure due to environmental changes and the limited ability to manage large-scale biodiversity data. This condition requires an approach that can efficiently integrate and analyze data to support data-driven conservation. This study aims to develop a spatial analysis system based on a data lakehouse using Hadoop and Apache Spark to map flora distribution in Sumatra. Data processing is carried out using the Medallion architecture (Bronze, Silver, Gold) and the Extract–Transform–Load (ETL) process with Apache Spark on data from the Global Biodiversity Information Facility (GBIF) for the period 2019–2023. The results show a significant improvement in processing performance, up to 16 times faster, with storage efficiency increased by 28%. This improvement enables large-scale data integration, allowing flora distribution patterns to be identified more clearly and comprehensively. Analysis of 12,840 species shows a dominance of Near Threatened (58.4%), followed by Least Concern (40.8%) and Endangered (0.7%), with distributions concentrated in the western and central regions of Sumatra. These findings indicate that most flora are in a vulnerable condition and confirm the effectiveness of integrating data lakehouse and spatial analysis in supporting data-driven conservation decision-making.  

Copyrights © 2026






Journal Info

Abbrev

session

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Engineering Other

Description

SESSION: Software Development, Digital Business Intelligence, and Computer Engineering. Jurnal SESSION adalah salah satu jurnal open-access yang dikelola oleh tim dari Jurusan Teknik Informatika Politeknik Negeri Banyuwangi. Jurnal ini dalam satu tahun terbit 2 kali. Aim and Scope Software ...