Richi Nayak
Queensland University of Technology, Brisbane

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Enhancing multi-document summarization through topic–pattern-based sentence selection Shaufiah Shaufiah; Yuefeng Li; Richi Nayak; Yutong Wu
International Journal of Advances in Intelligent Informatics Vol 12, No 2 (2026): May 2026
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

The growing digital text content requires automated summarization systems as fundamental tools to enable users to access information at high speed. The automatic Multi-Document Summarization (MDS) requires systems to produce a summary which combines essential information from multiple documents. The extractive methods which use lexical signals and sentence-based rules only produce repetitive results because they cannot identify complex thematic relationships. This study developed an improved extractive MDS model which combines topic modeling with pattern-based semantic indicators and a method to choose diverse sentences. The model employs LDA to identify concealed thematic structures and retrieves typical word patterns which improve topic models and chooses topics through a greedy algorithm that reduces redundancy to achieve suitable salience and coverage. The proposed system achieves better results than classical baselines in experiments performed on DUC 2006 and DUC 2007 datasets by outperforming Lead and CLASSY04 and KL-SUM and LexRank and TextRank and PETMSUM. The system demonstrates superior performance to all baseline methods by achieving better results in ROUGE-1 and ROUGE-2 and ROUGE-SU4 evaluation metrics. The results show that extractive summarization tasks reach their best results when topic–pattern representations work together with diversity-aware scoring methods.