Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Software Development Digital Business Intelligence and Computer Engineering

Analisis Persebaran Flora di Sumatera Melalui Sistem Data Lakehouse Menggunakan Interpolasi Spatial Analysis Berbasis Hadoop dan Apache Spark Elok, Elok Fiola; Asa Do’a Uyi1; Dea Mutia Risani; Yohana Manik; Ardika Satria; Luluk Muthoharoh
Software Development, Digital Business Intelligence, and Computer Engineering Vol. 4 No. 2 (2026): SESSION (MARET)
Publisher : Politeknik Negeri Banyuwangi Jl. Raya Jember km. 13 Labanasem, Kabat, Banyuwangi, Jawa Timur (68461) Telp. (0333) 636780

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.57203/session.v4i2.2026.12-20

Abstract

Flora biodiversity on Sumatra Island is increasingly under pressure due to environmental changes and the limited ability to manage large-scale biodiversity data. This condition requires an approach that can efficiently integrate and analyze data to support data-driven conservation. This study aims to develop a spatial analysis system based on a data lakehouse using Hadoop and Apache Spark to map flora distribution in Sumatra. Data processing is carried out using the Medallion architecture (Bronze, Silver, Gold) and the Extract–Transform–Load (ETL) process with Apache Spark on data from the Global Biodiversity Information Facility (GBIF) for the period 2019–2023. The results show a significant improvement in processing performance, up to 16 times faster, with storage efficiency increased by 28%. This improvement enables large-scale data integration, allowing flora distribution patterns to be identified more clearly and comprehensively. Analysis of 12,840 species shows a dominance of Near Threatened (58.4%), followed by Least Concern (40.8%) and Endangered (0.7%), with distributions concentrated in the western and central regions of Sumatra. These findings indicate that most flora are in a vulnerable condition and confirm the effectiveness of integrating data lakehouse and spatial analysis in supporting data-driven conservation decision-making.