Garuda - Garba Rujukan Digital

Sistemasi: Jurnal Sistem Informasi

Vol 14, No 3 (2025): Sistemasi: Jurnal Sistem Informasi

Mangalla, Bileam (Unknown)
Suharyadi, Suharyadi (Unknown)

Publish Date
13 May 2025

With its rich diversity of ethnicities, cultures, races, and religions, Indonesia is one of the countries with the highest number of regional languages in the world. This linguistic diversity often leads to communication challenges, particularly when conveying information or engaging in textual conversations. This study aims to identify and classify the Toraja, Batak, and Ambon languages using machine learning-based computational methods. The techniques employed include Decision Tree and Gradient Boost algorithms to evaluate the accuracy of each model. The results demonstrate that both Decision Tree and Gradient Boost are effective in language identification, achieving accuracy rates above 77%. However, based on the confusion matrix analysis, the Gradient Boost method proved to be more effective, with an accuracy rate of 81.06%, compared to 78.39% achieved by the Decision Tree. These findings suggest that Gradient Boost offers better performance for classifying these regional languages.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Sistemasi: Jurnal Sistem Informasi

Website

Abbrev

stmsi

Publisher

Universitas Islam Indragiri

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

Sistemasi adalah nama terbitan jurnal ilmiah dalam bidang ilmu sains komputer program studi Sistem Informasi Universitas Islam Indragiri, Tembilahan Riau. Jurnal Sistemasi Terbit 3x setahun yaitu bulan Januari, Mei dan September,Focus dan Scope Umum dari Sistemasi yaitu Bidang Sistem Informasi, ...

Article Info

Abstract

Classification of Toraja, Batak and Ambon Languages using Decision Tree and Gradient Boost methods

Article Info

Abstract