Building of Informatics, Technology and Science
Vol 8 No 1 (2026): June 2026

Segmentasi Pelanggan E-Commerce Berbasis Integrasi Text Mining dan RFM untuk Deteksi Dini Churn

Violin Juneyla Nandita (Universitas Sriwijaya, Palembang)
Juseia Wulandari (Universitas Sriwijaya, Palembang)
Apriyadi Apriyadi (Universitas Sriwijaya, Palembang)
Ali Ibrahim (Universitas Sriwijaya, Palembang)
Fathoni Fathoni (Universitas Sriwijaya, Palembang)



Article Info

Publish Date
05 Jun 2026

Abstract

The growth of transactions on e-commerce platforms generates a massive volume of unstructured customer review data. However, traditional Customer Relationship Management (CRM) models such as RFM often only focus on quantitative transaction data and ignore the emotional dimension contained in customer reviews. This study aims to analyze the relationship between purchase frequency and customer comment polarity through the integration of Text Mining and CRM Analytics approaches. The novelty offered is the development of a hybrid method that combines Lexicon Refinement-based sentiment extraction with the Random Forest algorithm to overcome rating bias in global e-commerce platform data (Kaggle). The proposed method includes the use of Natural Language Processing (NLP) techniques, topic modeling based on Latent Dirichlet Allocation (LDA), and sentiment analysis to extract polarity scores. The test results show that the initial lexicon model has limitations with an accuracy of 52.14% due to noise in neutral reviews (3-star rating). However, after optimization using the Random Forest algorithm and neutral data filtering, the classification accuracy increased significantly to 74.62%. These results prove that sentiment integration is able to provide more accurate loyalty mapping and help e-commerce management detect potential churn in the At-Risk customer segment.

Copyrights © 2026






Journal Info

Abbrev

bits

Publisher

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...