Jurnal CoreIT
Vol 11, No 2 (2025): December 2025

Combining BERT and Graph-Based Ranking for Extractive Summarization of Indonesian News Articles

Trisna, I Nyoman Prayana (Unknown)
Vihikan, Wayan Oger (Unknown)
Azizah, Anis Zahra Nur (Unknown)



Article Info

Publish Date
30 Dec 2025

Abstract

Automatic text summarization is an effective solution to manage the vast amount of information in the digital age. This study aims to develop an extractive text summarization system for Indonesian news articles using sentence embeddings generated by IndoBERT and mBERT, combined with TextRank and LexRank algorithms for sentence ranking. The dataset used is Indonesian Text Summarization (IndoSum), which contains thousands of manually summarized articles. The research includes data collection, cleaning, preprocessing, embedding extraction, sentence similarity calculation, and ranking using graph-based methods. Model performance was evaluated using ROUGE and BERTScore. The results show that the combination of IndoBERT and LexRank achieved the highest performance with ROUGE-1 score 0.7018 and BERTscore 0.8696. The model was then implemented into a web prototype using Streamlit to allow users to summarize texts interactively. This study contributes to the advancement of automatic summarization technology for the Indonesian language.

Copyrights © 2025






Journal Info

Abbrev

coreit

Publisher

Subject

Computer Science & IT

Description

Jurnal CoreIT: Jurnal Hasil Penelitian Ilmu Komputer dan Teknologi Informasi published by Informatics Engineering Department – Universitas Islam Negeri Sultan Syarif Kasim Riau with Registration Number: Print ISSN 2460-738X | Online ISSN 2599-3321. This journal is published 2 (two) times a year ...