Claim Missing Document
Check
Articles

Found 20 Documents
Search

Improving C4.5 Algorithm Accuracy With Adaptive Boosting Method For Predicting Students in Obtaining Education Funding Mohammad Ahmad Maidanul Abrori; Abdul Syukur; Affandy Affandy; Moch Arief Soeleman
Journal of Development Research Vol. 6 No. 2 (2022): Volume 6, Number 2, November 2022
Publisher : Lembaga Penelitian dan Pengabdian Masyarakat Universitas Nahdlatul Ulama Blitar

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.28926/jdr.v6i2.205

Abstract

The level of accuracy in determining the prediction of the provision of educational funding assistance is very important for the education agency. The large number of data on prospective beneficiaries can be processed into information that can be used as decision support in determining eligibility for education funding assistance. The data processing is included in the field of data mining. One method that can be applied in predicting the feasibility of receiving aid funds is classification. There are several classification algorithms, one of which is a decision tree. The famous decision tree algorithm is C4.5. The C4.5 algorithm can be applied in classifying prospective recipients of educational aid funds. This study uses datasets from student data of SMK Al Fattah Kertosono. The purpose of this study is to increase the accuracy of the C4.5 algorithm by applying adaboost in classifying students who deserve education funding and not, by comparing the results before and after applying adaboost. Validation in this study uses cross validation. While the measurement of accuracy is measured by the confusion matrix. The experimental results show that there is an increase in accuracy of 7.2%. The accuracy of the application of the C4.5 algorithm reaches 91.32%. While the accuracy of the application of the C4.5 algorithm with adaboost reached 98.55%.
Improving the Accuracy of C4.5 Algorithm with Chi-Square Method on Pure Tea Classification Using Electronic Nose Mula Agung Barata; Edi Noersasongko; Purwanto; Moch Arief Soeleman
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 7 No 2 (2023): April 2023
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29207/resti.v7i2.4687

Abstract

Tea is one of the plantation products within the Ministry of Agriculture of the Republic of Indonesia, which plays an essential role as a mainstay commodity that boosts the Indonesian economy. Each type of tea has different properties, and the aroma of each type of tea can measure the quality of the tea. The human sense of smell is still very limited in classifying pure types of tea. Therefore, a device is needed to help measure the aroma of tea from an electronic nose. The devices attached to several gas sensors help humans take data from the smell of pure tea and calculate the value of each type of tea to test datasets with data mining algorithms. This study uses the C4.5 algorithm as a classification method with advantages over noise data, missing values, and handling variables with discrete and continuous types. Meanwhile, Chi-square is used to perform attribute severing in the data preprocessing process to increase the accuracy of dataset testing. Testing a pure tea dataset with four whole attributes, namely CO2, CO, H2, and CH4, using the C4.5 algorithm resulted in an accuracy of 93.65% and an increase in the accuracy performance of the C4.5 algorithm by 94.27% with dataset testing using Chi-Square feature selection with the two highest value attributes.
BPNN Optimization With Genetic Algorithm For Classification of Tobacco Leaves With GLCM Extraction Features Kristhina Evandari; M. Arief Soeleman; Ricardus Anggi Pramunendar
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 7 No 2 (2023): April 2023
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29207/resti.v7i2.4743

Abstract

Tobacco leaves are one of the agricultural commodities cultivated by Indonesian farmers. In their application in the field, there are many obstacles in tobacco leaf cultivation, one of which is declining tobacco quality caused by weather factors. In this study, a technology-based analysis step was carried out to determine the classification in determining the quality of tobacco leaves. The research was carried out by applying the classification optimization of the Backpropagation Artificial Neural Network Method and genetic algorithms to determine the weights obtained from extracting GLCM features. You can get the weight value from the genetic algorithm on the homogeneity variable from this analysis step. The variable gets a weight value of 1. The results of this study obtained a classification value with the Backpropagation Artificial Neural Network Method model getting an accuracy value of 53.50% at a hidden layer value of 2,4,5,7. For classification with the Artificial Neural Network Method, Backpropagation, which is optimized with genetic algorithms, you get an accuracy value of 64.50% at the 4th hidden layer value. From this study, the value of optimization accuracy increased by 11% after being optimized with genetic algorithms.
The evaluation of convolutional neural network and genetic algorithm performance based on the number of hyperparameters for English handwritten recognition Muhammad Munsarif; Edi Noersasongko; Pulung Nurtantio Andono; Moch Arief Soeleman
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 12, No 3: September 2023
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v12.i3.pp1250-1259

Abstract

Convolutional neural network (CNN) has been widely applied to image recognition, especially handwritten English recognition. CNN's performance is good if the hyperparameter values are correct. However, the determination of precise hyperparameters is not a trivial task. This task is made more difficult when combined with a larger number of hyperparameters resulting in a high dimensionality of the search space. Usually, hyperparameter optimization uses a finite number. Previous studies have shown that a large number of hyperparameters can result in optimal CNN performance. However, the studies only apply to text mining datasets. This study offers two novelties. First, it applied 20 hyperparameters and their ranges to handwritten English. Second, this paper conducted seven experiments based on different hyperparameters and the number of hyperparameters. This paper also compares the existing methods, namely random and grid search. The experiment resulted in the proposed model being superior to the existing methods. EX3 is better than other experiments and a larger number of hyperparameters and layer-specific hyperparameter values are unimportant.
Comparison of Information Gain and Chi-Square Selection Features For Performance Improvement of Naive Bayes Algorithm On Determining Students With No PIP Recipients at SMKN 1 Brebes Magus Sarasnomo; Muljono Muljono; M. Arief Soeleman
Syntax Literate Jurnal Ilmiah Indonesia
Publisher : Syntax Corporation

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (2224.578 KB) | DOI: 10.36418/syntax-literate.v7i4.6661

Abstract

All policies of the Smart Indonesia Program (PIP) through the form of the Smart Indonesia Card (KIP) are issued by the government under the auspices of the Ministry of Education and Culture (Kemendikbud) through the National Team for the Acceleration of Poverty Reduction (TNP2K). Helping to alleviate the poor category of students in order to obtain a proper education, prevent children dropping out of school, and fulfill their school needs are the goals of the program. This assistance can be used by students to meet all school needs such as transportation costs to go to school, the cost of buying school supplies, and school pocket money. This study aims to compare the Information Gain and Chi-Square selection features to improve the performance of the Naive Bayes algorithm in determining poor students who are recipients of the Smart Indonesia Program (PIP) at SMKN 1 Brebes, to determine the accuracy of the Naive Bayes, Information Gain and Chi-Square algorithms. and compare the level of accuracy and determine the attributes that affect the accuracy. At this stage, collecting relevant and useful research data, which is collected in the form of literature and data, and processed as research material. Sources of data used in this study in the form of primary data collection and secondary data. The primary data collection technique used in this study was a questionnaire or questionnaire, while the secondary data obtained in this study was through document files. At this stage, preliminary data processing is carried out, the data used is student data of SMKN 1 Brebes in 2021. The initial data collection obtained was 703 data, but not all records were used because they had to go through several stages of initial data processing (data preparation). The results of the Naive Bayes algorithm accuracy of 90.31% with an AUC of 0.967, after the addition of the Information Gain selection feature the accuracy becomes 90.88% with an AUC value of 0.970. The addition of the Information Gain selection feature can help improve the classification performance of the Naive Bayes algorithm even though the accuracy is not maximized. The accuracy of the Naive Bayes algorithm is 90.31% with an AUC of 0.967, after the addition of the Chi-Square selection feature the accuracy becomes 90.88% with an AUC value of 0.970. The accuracy results are not maximized but the addition of the Chi-Square selection feature can also improve the classification performance of the Naive Bayes algorithm. The accuracy of the Naive Bayes algorithm is 90.31% with an AUC of 0.967, after the addition of the Information Gain selection feature and the Chi-Square selection feature the accuracy becomes 90.88% with an AUC value of 0.970. The results of the same accuracy in the use of the Information Gain and Chi-Square selection features to increase the performance of the Naive Bayes algorithm by 0.57% although the accuracy results are still less than optimal.
Antlion Optimizer Algorithm Modification for Initial Centroid Determination in K-means Algorithm Nanang Lestio Wibowo; Moch Arief Soeleman; Ahmad Zainul Fanani
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 7 No 4 (2023): August 2023
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29207/resti.v7i4.4997

Abstract

Clustering is a grouping of data used in data mining processing. K-means is one of the popular clustering algorithms, is easy to use, and is fast in clustering data. The K-means method groups the data based on k distances and randomly determines the initial centroid as a reference for processing. Careless selection of centroids can result in poor clustering processes and local optima. One of the improvements in determining the initial centroid on the k-means method is to use the optimization method to determine the initial centroid. The modified Antlion Optimizer (ALO) method is used to improve poor clustering in the initial centroid determination and as an alternative to determining the initial centroid in the k-means method for better clustering results. The results of the research on the use of the proposed method for determining the initial centroid provide an increase in clustering compared to the usual k-means and k-means++ methods. This is evidenced by the evaluation of the sum of intragroup distance (SICD) with UCI datasets, namely iris, wine, glass, ecoli, and cancer, in each method, the best SICD value was obtained in the proposed method. Then measuring the best SICD value for each method and dataset is measured by providing a ranking proving that the proposed method on the iris, wine, and cancer datasets gets the first rank, and on the ecoli and glass datasets the proposed method and the k-means++ method both get the first rank. From the average ranking value, the proposed method is ranked first, which provides evidence that the proposed method can improve the clustering results and can be an alternative method for determining the initial center of a cluster using the k-means method.
Customer Segmentation with RFM Model using Fuzzy C-Means and Genetic Programming Anas Syaifudin; Purwanto Purwanto; Heribertus Himawan; M. Arief Soeleman
MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer Vol. 22 No. 2 (2023)
Publisher : Universitas Bumigora

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30812/matrik.v22i2.2408

Abstract

One of the strategies a company uses to retain its customers is Customer Relationship Management (CRM). CRM manages interactions and supports business strategies to build mutually beneficial relationships between companies and customers. The utilization of information technology, such as data mining used to manage the data, is critical in order to be able to find out patterns made by customers when processing transactions. Clustering techniques are possible in data mining to find out the patterns generated from customer transaction data. Fuzzy C-Means (FCM) is one of the best-known and most widely used fuzzy grouping methods. The iteration process is carried out to determine which data is in the right cluster based on the objective function. The local minimum is the condition where the resulting value is not the lowest value from the solution set. This research aims to solve the minimum local problem in the FCM algorithm using Genetic Programming (GP), which is one of the evolution-based algorithms to produce better data clusters. The result of the research is to compare the application of fuzzy c-means (FCM) and genetic programming fuzzy c-means (GP-FCM) for customer segmentation applied to the Cahaya Estetika clinic dataset. The test results of the GP-FCM yielded an objective function of 20.3091, while for the FCM algorithm, it was 32.44741. Furthermore, evaluating cluster validity using Partition Coefficient (PC), Classification Entropy (CE), and Silhouette Index proves that the results of cluster quality from gp-fcm are more optimal than fcm. The results of this study indicate that the application of genetic programming in the fuzzy c-means algorithm produces more optimal cluster quality than the fuzzy c-means algorithm.
Deteksi Dini Covid-19 Melalui Citra CT-Scan Paru-Paru Menggunakan K-Nearest Neighbor dengan Komparasi Jarak Lu'luul Maknun; Abdul Syukur; Affandy Affandy; Moch Arief Soeleman
Jurnal Indonesia Sosial Teknologi Vol. 3 No. 03 (2022): Jurnal Indonesia Sosial Teknologi
Publisher : Publikasi Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (1049.395 KB) | DOI: 10.59141/jist.v3i03.397

Abstract

Covid -19 yang telah mewabah dan menjadi pandemik secara global yang merupakan masalah utama yang perlu di perhatikan dan di tangani, beberapa cara yang harus di lakukan adalah dengan memutus mata rantai penyebaran virus salah satunya dengan melakukan deteksi dini dan melakukan karantina, dengan CT scan paru-paru. CT scan paru-paru dapat dijadikan jalan alternatif. Berdasarkan permasalahan di atas maka peneliti mengetahui kondisi paru-paru secara detail dan dalam mendiagnosis virus secara dini. Pada penelitian ini pendekatan yang di ajukan menggunakan metode K-NN dengan perhitungan jarak euclidean distance, manhattan distance, miskowski distance untuk deteksi dini Covid -19 melalui citra CT scan paru-paru yang di duga terinfeksi Covid -19 . dalam mendeteksi secara dini evaluasi yang di gunakan untuk mengetahui pervorma yang di usulkan menggunakan coufusion matrix dengan hasil eksperimen menunjukkan hasil dari tiga perhitungan jarak menunjukkan hasil akurasi yang baik dan menggunakan dataset secara publik yaitu euclidean distance berjumlah 83%, Manhattan distance berjumlah 87%, Minkowski berjumlah 76%, di harapkan metode ini dapat di gunakan dan di kembangkan untuk melengkapi dioglosa medis.
Optimizing Stroke Prediction Using Backward Elimination and SMOTE with C4.5 and K-Nearest Neighbors Imam Bagus Pratama; Ahmad Zainul Fanani; M. Arief Soeleman; Via Indriani Kumalasari
Journal of Information System and Informatics Vol 8 No 2 (2026): April
Publisher : Asosiasi Doktor Sistem Informasi Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.63158/journalisi.v8i2.1521

Abstract

Early prediction of stroke risk is crucial for reducing mortality and the burden on the healthcare system, but class imbalance and irrelevant features often compromise model reliability. This study analyzes the impact of Backward Elimination and SMOTE on the performance of the C4.5 and K-NN algorithms in stroke prediction. The study used a fixed working subset of 1,239 data points and evaluated four modeling scenarios using Stratified 10-Fold Cross Validation. Model performance was measured using accuracy, precision, recall, F1-score, and AUC. The results showed that Backward Elimination improved model performance on the analyzed subsets. For C4.5, accuracy increased from 70.94% to 73.05%, stroke recall from 83.94% to 85.14%, and AUC from 0.776 to 0.806. For K-NN, accuracy increased from 72.31% to 74.82% and precision from 39.91% to 42.73%, while stroke recall remained relatively stable at 74.30%. These findings indicate that although the improvements are small numerically, the results remain practically relevant as they enhance the balance between sensitivity and class discrimination capability. In the context of stroke screening, reducing false negatives is more important because it helps minimize undetected high-risk cases, although false positives still need to be considered as a consequence of further testing. Overall, C4.5 with Backward Elimination demonstrates more balanced performance, although the results are still limited to the analyzed subset.
Pengembangan Kreativitas Visual Siswa melalui Pelatihan Media Personal Branding di SMK Al Kautsariyyah Demak Ali Muqoddas; Muslih Muslih; Abi Senoprabowo; Elkaf Rahmawan P.; Puri Sulistiyawati; M. Arief Soeleman; Supriyono Asfawi
ABDIMASKU : JURNAL PENGABDIAN MASYARAKAT Vol 9, No 1 (2026): JANUARI 2026
Publisher : LPPM UNIVERSITAS DIAN NUSWANTORO

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62411/ja.v9i1.3221

Abstract

Transformasi digital dalam industri kreatif menuntut peserta didik pendidikan vokasi, khususnya pada kompetensi Desain Komunikasi Visual (DKV), untuk memiliki kesiapan profesional yang melampaui penguasaan keterampilan teknis semata. Realitas di lapangan menunjukkan bahwa sebagian siswa sekolah menengah kejuruan masih belum memiliki pemahaman yang memadai terkait pembentukan identitas profesional, pengelolaan proses kreatif, serta penyajian karya dalam bentuk portofolio digital yang relevan dengan standar industri. Kegiatan Pengabdian kepada Masyarakat ini dirancang sebagai intervensi edukatif melalui pelaksanaan pelatihan dan pendampingan sistematis yang berfokus pada integrasi personal branding dengan penyusunan portofolio digital siswa DKV di SMK Al Kautsariyyah Demak. Pendekatan kegiatan meliputi penguatan pemahaman konseptual, eksplorasi karakter visual individu, kurasi dan pengemasan karya, serta pendampingan pemanfaatan platform digital sebagai media presentasi profesional. Implementasi program menunjukkan adanya peningkatan kesadaran peserta didik terhadap pentingnya konsistensi identitas visual dan kemampuan menyajikan karya secara terstruktur dan komunikatif. Selain berdampak pada peningkatan kompetensi siswa, kegiatan ini juga menjadi sarana pembelajaran kontekstual bagi mahasiswa dan dosen dalam pelaksanaan tridarma perguruan tinggi serta mendukung capaian Indikator Kinerja Utama (IKU). Dengan demikian, program ini berperan dalam memperkuat keterhubungan antara pendidikan vokasi dan kebutuhan aktual industri kreatif. Untuk luaran kegiatan pengabdian kepada masyarakat ini, juga akan dipublikasikan pada jurnal nasional pengabdian masyarakat ABDIMASKU, hasil dokumentasi pelaksanaan diupload pada youtube, dan HKI. Kegiatan PKM ini laksanakan oleh dosen dan mahasiswa dari program studi Desain Komunikasi Visual dan Teknik Informatika yang terakreditasi unggul sehingga kompeten dibidangnya masing-masing.