Garuda - Garba Rujukan Digital

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)

Vol 10 No 2 (2026): April - In progress

Khouya, Nabila (Unknown)
Retbi, Asmaâ (Unknown)
Bennani, Samir (Unknown)

Publish Date
06 Apr 2026

The exponential growth of scientific literature on platforms such as arXiv presents a major challenge in identifying and comparing key contributions to machine learning across diverse academic domains. To address this, we propose GraphiBERT-ML, a knowledge-enhanced extension of BERT that integrates semantic embeddings extracted from DBpedia to improve named entity recognition (NER) in scientific articles. To the best of our knowledge, this study presents the first knowledge-enhanced NER model that explicitly integrates DBpedia-based embeddings for large-scale cross-domain scientific analyses. The model was evaluated on a cross-domain dataset spanning eight fields, including computer science, physics, biology, finance, and economics. Experimental results show that GraphiBERT-ML achieves its highest performance in computer science, with an accuracy of 0.9372, an F1-score of 0.9368, and a precision of 0.9376. Physics and mathematics also demonstrate strong performance (F1-scores of 0.9115 and 0.8970), while more heterogeneous domains such as biology and finance show lower scores (F1-scores of 0.7946 and 0.7872), reflecting the complexity and variability of their terminology. Across all domains, GraphiBERT-ML consistently outperformed the baseline BERT model, confirming the benefit of external knowledge integration for scientific NER. These findings highlight domain-specific challenges in entity extraction and demonstrate the potential of knowledge-augmented models to advance cross-disciplinary analysis of machine learning research.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)

Website

Abbrev

RESTI

Publisher

Ikatan Ahli Informatika Indonesia

Subject

Computer Science & IT Engineering

Description

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) dimaksudkan sebagai media kajian ilmiah hasil penelitian, pemikiran dan kajian analisis-kritis mengenai penelitian Rekayasa Sistem, Teknik Informatika/Teknologi Informasi, Manajemen Informatika dan Sistem Informasi. Sebagai bagian dari semangat ...

Article Info

Abstract

GraphiBERT-ML: A Knowledge-Enhanced NER Approach for Cross-Domain Comparative Analysis of Machine Learning Literature

Article Info

Abstract