Infotekmesin
Vol 17 No 1 (2026): Infotekmesin: Januari 2026

Studi Perbandingan Kinerja Support Vector Machine Pada Klasifikasi Diabetes Mellitus Menggunakan Fitur Regular Expression dan Non-Regular Expression

Prasetya, Nur Wachid Adi (Unknown)
Wanti, Linda Perdana (Unknown)
Purwanto, Riyadi (Unknown)



Article Info

Publish Date
30 Jan 2026

Abstract

Diabetes mellitus is a rapidly progressing non-communicable disease that significantly affects quality of life. Clinical information in electronic medical records, such as prescriptions and laboratory results, often appears as unstructured text and therefore requires text-mining techniques for accurate classification. This research compares the performance of the Support Vector Machine (SVM) classifier on diabetes mellitus data processed with and without feature extraction using Regular Expressions (Regex). The workflow includes data preprocessing, feature extraction, TF-IDF weighting, model training, and evaluation using accuracy, precision, recall, and F1-score. Results show that both approaches achieve high accuracy (98.8–98.9%), with the non-Regex model performing slightly better at 98.93% compared to 98.83% for the Regex-based model. These findings indicate that SVM is effective for classifying text-based clinical data, while Regex provides potential benefits but requires further optimization to ensure its suitability for various medical text contexts.

Copyrights © 2026






Journal Info

Abbrev

infotekmesin

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering Mechanical Engineering

Description

INFOTEKMESIN is a peer-reviewed open-access journal with e-ISSN 2685-9858 and p-ISSN: 2087-1627 published by Pusat Penelitian dan Pengabdian Masyarakat (P3M) Politeknik Negeri Cilacap. The journal invites scientists and engineers to exchange and disseminate theoretical and practice-oriented in the ...