Claim Missing Document
Check
Articles

Found 17 Documents
Search

Improving Cervical Cancer Classification Using ADASYN and Random Forest with GridSearchCV Optimization Saputra, Resha Mahardhika; Alzami, Farrikh; Pramudi, Yuventius Tyas Catur; Erawan, Lalang; Megantara, Rama Aria; Pramunendar, Ricardus Anggi; Yusuf, Moh.
Infotekmesin Vol 16 No 1 (2025): Infotekmesin: Januari 2025
Publisher : P3M Politeknik Negeri Cilacap

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35970/infotekmesin.v16i1.2552

Abstract

Cervical cancer is a leading cause of death among women, with over 300,000 deaths recorded in 2020. This study aims to improve the accuracy of cervical cancer diagnosis classification through a combination of Adaptive Synthetic Sampling (ADASYN) and Random Forest algorithm. The research data was obtained from the Cervical Cancer dataset in the UCI Machine Learning Repository with an imbalanced data distribution of 95% negative class and 5% positive class. ADASYN method was chosen for its ability to handle imbalanced data by focusing on minority data points that are difficult to classify. The Random Forest algorithm was optimized using GridSearchCV to achieve maximum performance. Results show that this combination improved accuracy from 96.5% to 96.8% and recall from 93.7% to 94.3%. Feature importance analysis identified key risk factors such as number of pregnancies, age at first sexual intercourse, and hormonal contraceptive use that significantly influence diagnosis. This research demonstrates the effectiveness of combining ADASYN and Random Forest in enhancing classification performance for early cervical cancer detection.
Clustering IT Incidents Using K-Means: Improving Incident Response Time in Service Management Anggraeni, Rini; Alzami, Farrikh; Nurhindarto, Aris; Budi, Setyo; Megantara, Rama Aria; Rizqa, Ifan; Muslih, Muslih
Sinkron : jurnal dan penelitian teknik informatika Vol. 9 No. 2 (2025): Research Articles April 2025
Publisher : Politeknik Ganesha Medan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33395/sinkron.v9i2.14822

Abstract

Incident management is one of the critical processes in Information Technology service management that aims to manage disruptions and minimize the impact of unexpected incidents on business services. This study applies the K-Means algorithm to cluster IT service incidents, aiming to enhance company operational efficiency. Utilizing a dataset from the UCI Machine Learning Repository comprising 141,712 events related to 24,918 incidents, this research analyzes incident patterns and characteristics for optimized handling. The data was analyzed through a series of preprocessing stages, and the elbow and silhouette methods were used to determine the optimal number of clusters. From the results, it was successfully grouped into 4 (four) clusters with a distortion score value of 964264294.569 and 0.52 silhouette score based on incident characteristics, such as urgency, priority, and number of reassignments. From this, the clustering results show that the K-Means algorithm effectively identifies incidents that require further handling, such as those with high urgency and priority, as well as helping the company focus resources to resolve incidents that have the most impact on the business sector. This research provides a data-driven solution to improve incident management and Service Level Agreement (SLA) fulfillment, while offering a framework for more effective and efficient IT incident analysis and resource allocation.
Implementation Of Extreme Gradient Boosting Algorithm For Predicting The Red Onion Prices Saputri, Pungky Nabella; Alzami, Farrikh; Saputra, Filmada Ocky; Andono, Pulung Nurtantio; Megantara, Rama Aria; Handoko, L Budi; Umam, Chaerul; Wahyudi, Firman
Moneter: Jurnal Keuangan dan Perbankan Vol. 11 No. 1 (2023): APRIL
Publisher : Universitas Ibn Khladun Bogor

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (656.456 KB) | DOI: 10.32832/moneter.v11i1.55

Abstract

Red Onion or the Latin name Allium Cepa is included in the group of vegetable plants that are needed by the public for food needs. Red Onions are one of the seasonal crops so their availability can change in the market which causes price instability due to a lack of supply of production by several factors: 1) not yet it's harvest time, 2) crop attacked disease pests and fungi, and 3) weather factor. Therefore, a study is needed to predict red onion prices, so that it can be used as information for the government to stabilize red onion prices. The method used in this study is CRISP-DM and the Extreme Gradient Boosting algorithm to predict the price of red onions by taking data samples from Tegal and Pati Cities. The results of this study are that the Extreme Gradient Boosting algorithm is able to produce Tegal District Root Mean Square Error (RMSE) values of 5107.97% and Mean Absolute Percentage Error (MAPE) values of 0.17%. For prediction results with Pati Regency data samples, it produces a Root Mean Square Error (RMSE) value of 6049.74% and a Mean Absolute Percentage Error (MAPE) of 0.17%.
LDA Topic Analysis for Product Reviews in Social Media Platform Alzami, Farrikh; Megantara, Rama Aria; Prabowo, Dwi Puji; Sulistiyawati, Puri; Pramunendar, Ricardus Anggi; Dewi, Ika Novita; Ritzkal, Ritzkal
Moneter: Jurnal Keuangan dan Perbankan Vol. 11 No. 2 (2023): OKTOBER
Publisher : Universitas Ibn Khladun Bogor

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32832/moneter.v11i2.402

Abstract

Social media in recent years is used as platform for product reviews and customer feedback. Thus, to understand the topic which have been discussed, we utilized Latent Dirichlet Allocation for topic modeling. The topic modeling is important due to it can gain insights into the specific features that customers like or dislike about a particular product. Thus, by not using stop words due it have possibilities remove the time domain, the information can be valuable for businesses as it helps them understand customer preferences and inform product development and marketing strategies with the coherence score 0.621520, the topic modeling obtained 3 optimal topics, where the topic 0 discussed about price and time it will be available to purchase. In topic 1 it discussed about the product is hard to obtain due to it not available in market. In topic 2, it discussed about ownership (what they like after usage).
RFM Analysis for Customer Lifetime Value with PARETO/NBD Model in Online Retail Dataset Megantara, Rama Aria; Alzami, Farrikh; Akrom, Ahmad; Pramunendar, Ricardus Anggi; Prabowo, Dwi Puji; Wibowo, Sasono; Ritzkal, Ritzkal
Moneter: Jurnal Keuangan dan Perbankan Vol. 11 No. 2 (2023): OKTOBER
Publisher : Universitas Ibn Khladun Bogor

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32832/moneter.v11i2.409

Abstract

In recent years, there has been a growing interest in analyzing Customer Lifetime Value (CLV) due to its ability to provide valuable insights into customer profitability and worth. CLV analysis predicts the net profit attributed to the entire future relationship with a customer. This analysis involves calculating the present value of a customer's expected future spending with the company, facilitating an understanding of the economic value of long-term customer relationships. CLV analysis empowers businesses to identify their most profitable customers and develop strategies for retaining them, ultimately maximizing long-term profitability. CLV analysis relies on various models and techniques, including the RFM analysis categorizes customers based on recency, frequency, and monetary value, helping to segment customers and predict future behavior. Then, The Pareto/NBD model combines probability distributions to estimate CLV and is commonly used for customer base analysis. This research article explores the application of RFM analysis for estimating customer lifetime value using the Pareto/NBD model in an online retail dataset. This metric is crucial for businesses as it assists in identifying valuable customers and formulating retention strategies to maximize long-term profitability.
Fairer Public Complaint Classification on LaporGub: Integrating XLM-RoBERTa with Focal Loss for Imbalance Data Zahro, Azzula Cerliana; Alzami, Farrikh; Sani, Ramadhan Rakhmat; Fahmi, Amiq; Megantara, Rama Aria; Naufal, Muhammad; Azies, Harun Al; Iswahyudi, Iswahyudi
Sinkron : jurnal dan penelitian teknik informatika Vol. 9 No. 4 (2025): Articles Research October 2025
Publisher : Politeknik Ganesha Medan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33395/sinkron.v9i4.15260

Abstract

The advancement of digital technology has provided opportunities for governments to improve the quality of public services through citizen complaint channels. One example of this implementation in Indonesia is Lapor Gub, managed by the Dinas Komunikasi dan Informasi Provinsi Jawa Tengah (Communication and Information Agency of Central Java Province). This platform receives thousands of complaints daily, ranging from infrastructure, social issues, to illegal levies. However, the large volume of data and the imbalanced distribution of categories pose significant challenges for both manual and automated processing. This study aims to classify citizen complaint texts using XLM-RoBERTa combined with Focal Loss as an approach to handle data imbalance. The dataset consists of 53,774 complaints after data cleaning and text preprocessing. The training process applied a stratified split (78% training, 18% validation, 10% testing) and fine-tuning for 10 epochs. Model performance was evaluated using accuracy, precision, recall, and macro F1-score. The results show that the model without Focal Loss achieved 78.1% accuracy with a macro F1-score of 0.606, while the model with Focal Loss improved the macro F1-score to 0.625 with 78.5% accuracy. These findings demonstrate that the application of Focal Loss enhances the model’s ability to recognize minority categories without reducing performance on majority classes. Therefore, the combination of RoBERTa and Focal Loss offers an effective solution to support faster, fairer, and more transparent public complaint management.
Enhancing Entity Extraction in E-Government Complaint Data using LDA-Assisted NER Umam, Ahmad Khotibul; Alzami, Farrikh; Sani, Ramadhan Rakhmat; Rohmani, Asih; Prabowo, Dwi Puji; Pergiwati, Dewi; Megantara, Rama Aria; Iswahyudi, Iswahyudi
Sinkron : jurnal dan penelitian teknik informatika Vol. 9 No. 4 (2025): Articles Research October 2025
Publisher : Politeknik Ganesha Medan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33395/sinkron.v9i4.15292

Abstract

With the rapid development of information technology, governments are increasingly challenged to provide digital channels that enhance public participation in governance. LaporGub, an official platform managed by the Central Java Provincial Government, accommodates citizens' aspirations and complaints, but faces challenges in processing large amounts of unstructured text. Manual analysis is time-consuming and error-prone, resulting in delayed responses and decreased service quality. Conventional Named Entity Recognition (NER) models struggle to handle informal Indonesian-language text, while transformer-based approaches require substantial computing resources that are not widely available in local government environments. Therefore, this study aims to develop a lightweight NER approach by integrating Latent Dirichlet Allocation (LDA) as a semantic pre-annotation tool to improve the accuracy of entity extraction in Indonesian e-government complaint data. To achieve this goal, a dataset of 53,858 complaint reports from the LaporGub platform (2022–2025) was processed using LDA topic modeling (k=10) to provide semantic context during annotation. Next, the enriched dataset was used to train a spaCy-based NER model targeting three entity types: LOCATION, ORGANIZATION, and PERSON, with a training-validation-test split ratio of 70:15:15 using stratified sampling. The evaluation showed that the proposed NER+LDA model achieved a precision of 90.03%, a recall of 81.86%, and an F1-score of 85.75%, representing improvements of +5.78, +2.55, and +4.04, respectively, compared to the baseline NER model (F1-score: 81.71%). Furthermore, the most significant improvements occurred in the detection of ORGANIZATION and PERSON entities. These findings confirm that the integration of LDA as a pre-annotation strategy effectively improves NER performance on informal complaint texts in Indonesia, thus offering a practical and resource-efficient alternative to transformer-based methods for e-government applications.