Diana Purwitasari
Teknik Informatika, Institut Teknologi Sepuluh Nopember

Published : 2 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 2 Documents
Search

EKSTRAKSI TRENDING ISSUE DENGAN PENDEKATAN DISTRIBUSI KATA PADA PEMBOBOTAN TERM UNTUK PERINGKASAN MULTI-DOKUMEN BERITA Christian Sri Kusuma Aditya; Chastine Fatichah; Diana Purwitasari
JUTI: Jurnal Ilmiah Teknologi Informasi Vol 14, No. 2, Juli 2016
Publisher : Department of Informatics, Institut Teknologi Sepuluh Nopember

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12962/j24068535.v14i2.a570

Abstract

Penggunaan trending issue dari media sosial Twitter sebagai kalimat penting efektif dalam proses peringkasan dokumen dikarenakan trending issue memiliki kedekatan kata kunci terhadap sebuah kejadian berita yang sedang berlangsung. Pembobotan term dengan TFIDF yang hanya berbasis pada dokumen itu tidak cukup untuk menentukan in-deks dari suatu dokumen. Penentuan indeks yang akurat juga bergantung pada nilai informatif suatu term terhadap kelas atau cluster. Term yang sering muncul di banyak kelas atau cluster seharusnya tidak menjadi term yang penting meskipun nilai TFIDF-nya tinggi. Penelitian ini bertujuan untuk melakukan peringkasan multi dokumen berita menggunakan ekstraksi trending issue dengan pendekatan term distribution on centroid based (TDCB) pada pembobotan fitur dan mengintegrasikannya dengan query expansion sebagai kata kunci dalam peringkasan dokumen. Metode TDCB dilakukan dengan mempertimbangkan adanya kemunculan sub topic dari cluster hasil pengelompokan tweets yang dapat dijadikan nilai informatif tambahan dalam penentuan pembobotan kalimat penting penyusunan ringkasan. Tahapan yang dilakukan untuk menghasilkan ringkasan multi dokumen berita antara lain ekstraksi trending issue, query expansion, auto labelling, seleksi berita, ekstraksi fitur berita, pembobotan kalimat penting dan penyusunan ringkasan. Hasil percobaan menunjukan metode peringkasan dokumen dengan menambahkan nilai informatif sub topic trending issue NeFTIS-TDCB menunjukan nilai rata-rata max-ROUGE-1 terbesar 0.8615 untuk n=30 dari seluruh varian topik berita.
K-MEANS AND XGBOOST FOR CUSTOMER ELECTRICITY ACCOUNT PAYMENT BEHAVIOR ANALYSIS (CASE STUDY: PLN ULP PANAKKUKANG) Raditya Hari Nugraha; Diana Purwitasari; Agus Budi Raharjo
JUTI: Jurnal Ilmiah Teknologi Informasi Vol. 20, No. 2, July 2022
Publisher : Department of Informatics, Institut Teknologi Sepuluh Nopember

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12962/j24068535.v20i2.a1132

Abstract

Revenue Acceleration from electricity account receivables is one of the energy companies' efforts to maintain cash flow so that they can carry out operational activities and carry out investment activities to develop company assets. Factors that influence electricity bill payment behavior include the location of consumers, the amount of the bill, payment point facilities located around consumers' homes, the use of digital technology as a media of payment, as well as consumer awareness and understanding regarding the time limit for paying electricity bills. Therefore, it is necessary to conduct an analysis so that the company can determine a special strategy for customers who have the potential to be in arrears in electricity bills. To get the characteristic of electricity bill payments, several previous studies have used various classification methods of machine learning such as random forest, nave bayes, SVM, CART, etc. to get the best accuracy. In this research, to increase the accuracy of the model, author using the cluster method with the k-means technique and combining it with the eXtreme Gradient Boosting (XGBOOST) classification method based on data on the characteristics of consumer electricity bill payments. In this study also used hyperparameter adjustment with hillclimbing, random search, and bayesian techniques to increase the accuracy of the model. The model simulation carried out in this thesis gives the result that the combination of the k-means cluster with the XGBoost classification and by adjusting the bayesian technique hyperparameters has a much better model accuracy rate with a value of 89.27% and an Area Under Curve (AUC) value of 0.92 when compared to gradient boosting method with an accuracy rate of only 74.76% and an AUC value of 0.75. Based on the simulation results on ULP Panakkukang customer data, it was found that the subsidy category customer group and customers who often experience power outages have a tendency to be in arrears on electricity bills.