Claim Missing Document
Check
Articles

Found 5 Documents
Search

Application of Data Mining Using the K-Means Clustering Algorithm for Opening Industrial Classes in Vocational High Schools Aan Rosydiana; Dian Sediana; Christina Juliane
Indonesian Journal of Artificial Intelligence and Data Mining Vol 5, No 2 (2022): September 2022
Publisher : Universitas Islam Negeri Sultan Syarif Kasim Riau

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24014/ijaidm.v5i2.19172

Abstract

Vocational High School has a goal to enter the world of work, meaning that it must have a skill program to be relevant to the industrial world. However, adapting to the industrial world is difficult, one of the things that is happening between industries is increasing. Various efforts continue to be made, among others, by establishing an industrial class, the formation of an industrial class is expected to produce students who have competencies in accordance with the standards required by the collaborating industries. The formation of an industrial class can be done by applying data mining methods, in order to form the right industrial class and in accordance with predetermined criteria. This study aims to classify new student registration data at State Vocational Schools at the Regional Education Office XIII Branch of West Java Province in 2022 and the results of the grouping are used to form industrial classes. The clustering process is carried out using the K-Means algorithm and cluster analysis is carried out with the help of RapidMiner software. The results showed that the data clustering was formed into 4 clusters. The cluster that has the highest number is cluster 1 and the cluster that has the lowest number is cluster 0. There are variables used for data grouping, including school variables and expertise programs, from these variables it is obtained that the schools selected by students are based on the highest order and have expertise programs contained in their clusters, which need to be considered when opening industrial classes.
Pemanfaatan Manajemen Pengetahuan untuk Membantu Persiapan Data pada Proses Data Mining Yusuf Bayu Wicaksono; Christina Juliane
JATISI (Jurnal Teknik Informatika dan Sistem Informasi) Vol 10 No 1 (2023): JATISI (Jurnal Teknik Informatika dan Sistem Informasi)
Publisher : Lembaga Penelitian dan Pengabdian pada Masyarakat (LPPM) STMIK Global Informatika MDP

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35957/jatisi.v10i1.2424

Abstract

The data mining process always involves a data preparation stage. Based on the experience of IBM data mining practitioners, 40-70% of data mining project time is spent on data preparation. This is because not everyone knows what the content of the available data is, so it will take time just to understand the data itself. The research method used adopts an information systems research framework, by comparing the knowledge base (data mining) with environmental facts (the duration of data preparation). Design/research is made using a knowledge management approach designed for data. Two qualitative and quantitative tables containing data related knowledge are used as an explicit form of data. With this knowledge the data preparation process can be shortened because miners are not mining data from zero knowledge.
Comparative Analysis of Various Ensemble Algorithms for Computer Malware Prediction Yusuf Bayu Wicaksono; Christina Juliane
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 7 No 3 (2023): Juni 2023
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29207/resti.v7i3.4492

Abstract

By 2022 it is estimated that 29 billion devices have been connected to the internet so that cybercrime will become a major threat. One of the most common forms of cybercrime is infection with malicious software (malware) designed to harm end users. Microsoft has the highest number of vulnerabilities among software companies, with the Microsoft operating system (Windows) contributing to the largest vulnerabilities at 68.85%. Malware infection research is mostly done when malware has infected a user's device. This study uses the opposite approach, which is to predict the potential for malware infection on the user's device before the infection occurs. Similar studies still use single algorithms, while this study uses ensemble algorithms that are more resistant to bias-variance trade-off. This study builds models from data on computer features that affect the possibility of malware infection on computer devices with Microsoft Windows operating system using ensemble algoritms, such as Bagging Classifier, Random Forest, Light Gradient Boosting Machine, Extreme Gradient Boosting Machine, Category Boosting, and Stacking Classifier. The best model is Stacking Classifier, which is a combination of Light Gradient Boosting Machine and Category Boosting Classifier, with training and test results of 0.70665 and 0.64694. Important features have also been identified as a reference for taking policies to protect user devices from malware infections.
THE EFFECT OF AMOUNT OF DATA ON RESULTS OF ACCURACY VALUE OF C4.5 ALGORITHM ON STUDENT ACHIEVEMENT INDEX DATA Anton Sunardi; Sienny Rusli; Christina Juliane
Jurnal Riset Informatika Vol. 4 No. 2 (2022): March 2022
Publisher : Kresnamedia Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34288/jri.v4i2.157

Abstract

Of the many academic data, data in the form of an achievement index needs to be used in-depth so that it does not become a display of numbers and information only. This achievement index evaluation data reflects the educational process students and teaching staff carries out in an educational process. This study aims to measure the accuracy of data mining processing based on differences in test data by analyzing the C4.5 algorithm using RapidMiner as a data processing tool and determining the decisions students can make and academic institutions in developing study strategies and educational curricula to be maximized. The data processing is carried out by classifying the student achievement index data at a private university using data analysis test equipment. The data source comes from kaggle.com, which consists of 1687 data that have been processed and processed. The conclusion from the results of this study is that the amount of data turns out to have a significant influence on the accuracy value of the C4.5 algorithm, where an accuracy rate of 91.69% is obtained from the test results of 1687 data with four main attributes, namely IPK1, IPK2, IPK3, IPK4 and correctly or not as a label.
Penerapan Forecasting Menggunakan Metode Time Series Untuk Menentukan Proyeksi Sales di Perusahaan Manufacturing Furniture Lukito Angga Prasakti; Christina Juliane
Smart Comp :Jurnalnya Orang Pintar Komputer Vol 12, No 2 (2023): Smart Comp: Jurnalnya Orang Pintar Komputer
Publisher : Politeknik Harapan Bersama

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30591/smartcomp.v12i2.5165

Abstract

Jumlah penduduk yang tidak sedikit tentu mendorong para perusahaan termasuk perusahaan nanufaktur untuk terus mengembangkan produksinya baik secara kualitas dan kuantitas, apalagi jumlah perusahaan dengan fokus yang sama cukup banyak. Hal ini dikarenakan, setiap perusahaan  terntu ingin memperoleh keuntungan yang banyak dan minim adanya keluhan konsumen atau pelanggan. Salah satu cara yang dianggap dapat mengatasi hal tersebut ialah dengan melakukan kebijakan perusahaan mengacu pada peramalan penjualan produk di masa mendatang. Oleh karena itu, peneliti ingin mengetahui lebih lanjut mengenai penerapan forecasting untuk menentukan  proyeksi sales perbulan pada tahun berikutnya di Perusahaan Manufacturing Furniture. Tujuannya untuk mengetahui peran forecasting dalam membuat kebijakan atas produksi perusahaan pada waktu berikutnya dengan mempertinbangkan proyeksi sales yang didasari hasil forecasting perusahaan. Metode yang digunakan ialah Time Series dengan pengumpulan data melalui dokumentasi pada reguler local market tahun 2022 tepatnya 12 bulan. Setelah data terkumpul maka akan dianalisis secara mendalam sehingga diketahui hasil penelitian bahwa penetapan forecasting secara teliti maka akan menghasilkan peramalan yang tidak jauh dari kenyataan dan dapat membantu dalam menghitung proyeksi sales perusahaan manufacture bidang furnitur pada waktu berikutnya, dengan nilai MAPE 0,06