Ayun Hapsari
Faculty of Law, Social and Political Science, Universitas Terbuka, Tangerang Selatan, Indonesia

Published : 3 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 3 Documents
Search

Comparative Analysis of Trademark Class Identification Using IndoBERT and Multilingual BERT Ayun Hapsari; Suyahman Suyahman
Journal of Artificial Intelligence and Legal Technology Vol. 1 No. 1 (2025): August 2025
Publisher : Sah Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

The rapid growth of trademark registrations in Indonesia has increased the demand for efficient and accurate classification into the internationally recognized NICE system. Manual assignment of classes remains time-consuming and prone to human error, motivating the need for an automated approach. This study investigates the application of Transformer-based language models for trademark class identification based solely on product and service descriptions. Two models were evaluated: the Multilingual BERT (mBERT) and the monolingual IndoBERT, both fine-tuned for sequence classification across 45 NICE classes using 59,948 trademark entries collected from the Directorate General of Intellectual Property (DGIP) database. The research methodology encompassed data preprocessing, stratified train-test splitting (80:20), and tokenization with a maximum sequence length of 64 tokens. Both models were trained for two epochs using the AdamW optimizer, and evaluated with accuracy, precision, recall, F1-score, and per-class accuracy (one-vs-all). Experimental results reveal that IndoBERT significantly outperforms mBERT, achieving an overall accuracy, precision, recall, and F1-score of 0.90, compared to 0.85 for mBERT. IndoBERT demonstrated particularly robust performance in low-support classes, indicating its superior ability to capture domain-specific linguistic nuances in Indonesian trademark descriptions. The findings underscore the potential of monolingual Transformer models for automating trademark classification in national intellectual property systems. The integration of such models can accelerate trademark registration, reduce examiner workload, and enhance consistency in class assignment. These results contribute to advancing the deployment of AI in legal and administrative contexts, while providing a foundation for future work involving multimodal features and explainable AI for comprehensive trademark management solutions.
Optimized Machine Learning with TPE for Air Quality Classification and Public Health Risk Estimation Ayun Hapsari
Journal of Artificial Intelligence and Legal Technology Vol. 1 No. 1 (2025): August 2025
Publisher : Sah Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

Air pollution in rapidly urbanizing cities such as Delhi poses a critical threat to public health due to the persistent exceedance of safe thresholds for particulate matter and gaseous pollutants. Accurate air quality classification and timely health risk estimation are essential to support early warning systems and guide urban policy interventions. This study develops a multi-class Air Quality Index (AQI) classification framework using Logistic Regression, Random Forest, Decision Tree, Support Vector Classifier (SVC), K-Nearest Neighbors (KNN), and Gradient Boosting, applied to a comprehensive dataset of daily pollutant concentrations (PM2.5, PM10, NO₂, SO₂, CO, and O₃) and meteorological parameters from Delhi. Data preprocessing included outlier removal, feature scaling, and label encoding of AQI categories, followed by an 80:20 train-test split to ensure robust model evaluation. Model performance was assessed using Accuracy, Precision, Recall, and F1-score. The experimental results show that ensemble and kernel-based models achieved the highest predictive accuracy, with Random Forest reaching an accuracy of 0.7611 and an F1-score of 0.7522, followed closely by Decision Tree and Gradient Boosting with F1-scores above 0.74. Logistic Regression and SVC maintained moderate yet consistent performance, while KNN was more sensitive to data distribution, achieving an F1-score of 0.605. Confusion matrix analysis revealed that misclassifications were mostly confined to adjacent AQI categories, reflecting the natural difficulty of distinguishing borderline pollution levels. The novelty of this study lies in integrating multi-class AQI classification with a structured machine learning framework capable of mapping environmental conditions directly to health risk levels. By aligning predictions with WHO and US-EPA thresholds, the framework facilitates actionable insights for public health authorities, enabling the design of early warning systems and targeted interventions for vulnerable populations. These findings advance the technical landscape of urban air quality management and provide a scalable foundation for health-oriented environmental decision-making in highly polluted megacities.
Data Analysis and Explainable Machine Learning for Stunting Prediction Ardy Wicaksono; Deny Prasetyo; Yulaikha Mar'atullatifah; Dwi Utari Iswavigra; Himmatunnisak Mahmudah; Ayun Hapsari
Journal of Artificial Intelligence and Legal Technology Vol. 1 No. 1 (2025): August 2025
Publisher : Sah Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

Childhood stunting remains a critical global health concern, reflecting chronic malnutrition that affects both physical growth and long-term cognitive development. Despite ongoing interventions, early detection in many low- and middle-income countries is still hindered by limited resources and the absence of interpretable decision-support tools. This study aims to develop and evaluate an explainable machine learning framework to predict stunting among toddlers using simple anthropometric and demographic data, thereby supporting evidence-based public health interventions. Data were collected from 40,071 children aged 0–59 months in Jeneponto Regency, South Sulawesi, Indonesia, covering the period 2021–2024. Key features included age in months, gender, weight, and height, while stunting status served as the target variable. Several machine learning algorithms were implemented, including Logistic Regression, Support Vector Machine, Multilayer Perceptron, K-Nearest Neighbors, Decision Tree, Random Forest, XGBoost, and Convolutional Neural Network. Data preprocessing involved imputation of missing values, feature encoding, and an 80/20 train-test split, while model interpretability was achieved using SHAP (SHapley Additive exPlanations) to provide both global and local feature attributions. The experimental results show that XGBoost achieved the highest accuracy of 97.57%, followed closely by Random Forest (97.28%) and Decision Tree (96.62%). SHAP analysis revealed that height was the most influential predictor, followed by age, gender, and weight, providing actionable insights for early identification of at-risk children. Local SHAP force plots further enabled case-level interpretation, enhancing the trustworthiness of the model in clinical or community health applications. The novelty of this research lies in integrating high-performing machine learning models with explainable AI for stunting prediction using minimal, easily collected health features in a resource-limited context. This framework not only improves the accuracy and transparency of early stunting detection but also provides a scalable approach to strengthen nutrition surveillance systems, with potential to inform targeted interventions and reduce the long-term impacts of childhood malnutrition.