Claim Missing Document
Check
Articles

Educational Data Mining: Multiple Choice Question Classification in Vocational School Sucipto Sucipto; Didik Dwi Prasetya; Triyanna Widiyaningtyas
MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer Vol. 23 No. 2 (2024)
Publisher : LPPM Universitas Bumigora

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30812/matrik.v23i2.3499

Abstract

Data mining on student learning outcomes in the education sector can overcome this problem. This research aimed to provide a solution for selecting quality multiple choice questions (MCQ) using the results of students’ mid-semester exams in vocational high schools using a Data Mining approach. The research method used was the Cross-Industry Standard Process for Machine Learning (CRISP-ML) model. Steps to assess the accuracy of analyzing the difficulty level of questions based on student profile data and midterm test results. The data used in this research were the findings of basic computer tests on mid-term exams in mathematics disciplines at vocational high schools. This research used several classification algorithms, including SVM, Naive Bayes, Random Forest, Decision Three, Linear Regression, and KNN. The results of evaluating the classification
Anatomy of Sentiment Analysis in Ontological, Epistemological, and Axiological Perspectives Fadli Hidayat, M. Noer; Dwi Prasetya, Didik; Widiyaningtyas, Triyanna; Patmanthara, Syaad
JOIN (Jurnal Online Informatika) Vol 10 No 1 (2025)
Publisher : Department of Informatics, UIN Sunan Gunung Djati Bandung

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15575/join.v10i1.1228

Abstract

The aim of this article was to examine sentiment analysis methods from the perspective of the philosophy of science with three approaches, ontological, epistemological and axiological. This research used a qualitative research method (descriptive-analysis) with an ontological, epistemological and axiological approach that uses library research and document studies of previous research results. Data collection was carried out through books and reputable scientific journals on Scopus, ScienceDirect, IEEEXplore and Springer Link. The results of this research showed that sentiment analysis from an ontological perspective describes the definition, development and relationship of sentiment with social reality. Meanwhile, from an epistemological perspective, sentiment analysis is viewed from how the source of knowledge is obtained, explaining the production of sentiment analysis knowledge, and several ways of working that can be applied in studies. Axiologically, sentiment analysis can see the function and value resulting from sentiment analysis, as well as discussing the results of interpretation from sentiment analysis studies. These findings showed the development of sentiment analysis in answering various problems to improve the quality of sustainable services in various fields.
Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes Prediction Fadhlullah, Aufar Faiq; Widiyaningtyas, Triyanna
JTAM (Jurnal Teori dan Aplikasi Matematika) Vol 8, No 4 (2024): October
Publisher : Universitas Muhammadiyah Mataram

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31764/jtam.v8i4.24388

Abstract

Diabetes Mellitus is a long-term medical disorder marked by high blood glucose levels that raise the risk of early mortality and organ failure. It has become an increasing global health problem, so making an accurate and timely diagnosis is urgently necessary. This study aims to diagnose people with diabetes mellitus by utilizing prediction techniques in data mining using experimental research. The prediction stage for diagnosing diabetes consists of four stages: dataset collection, data pre-processing, data processing, and evaluation. Data was obtained from Electronic Health Records (EHRs), namely the public "Diabetes Prediction Dataset". The pre-processing stage involves data filtering, attribute conversion, and class selection. The data processing utilizes random forests and decision tree models for diabetes prediction. The models were evaluated using accuracy, precision, and recall metrics. The results showed that the Random Forest algorithm produced an accuracy value of 93.97%, precision of 99.88%, and recall of 66.56%, with a computational time of 16s. Meanwhile, the decision tree algorithm produces an accuracy value of 93.89%, precision of 98.73%, and recall of 66.88%, with a computation time of less than 1s. Based on these results, it can be concluded that the Decision Tree algorithm is more effective because the difference in accuracy, precision, and recall values produced by the two algorithms does not have significant differences. However, the Decision Tree algorithm has the advantage of using computational time more effectively, which is needed in detecting diabetes because it is related to someone's life. 
Comparison of Time Series Algorithms Using SARIMA and Prophet in Predicting Short-Term Bitcoin Prices Brilliant, Muhammad Zidan; Widiyaningtyas, Triyanna; Caesarendra, Wahyu
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 4 (2025): JUTIF Volume 6, Number 4, Agustus 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.4.4773

Abstract

Digital finance, particularly Bitcoin, has become a global phenomenon with high volatility, posing great challenges for traders in predicting short-term prices. This study compares the performance of the SARIMA and Prophet algorithms in predicting short-term Bitcoin prices using daily closing price data from October 1, 2014, to October 1, 2024. The study utilizes two different data timeframes, a 10-year dataset (2014-2024) and the last 5 years (2019-2024) for comparative analysis. The SEMMA methodology is used to analyze and compare the two algorithms, which consist of the stages Sample, Explore, Modify, Model, and Assess. The experimental results show that SARIMA provides more stable and consistent results with an MAPE value of 1.24% and RMSE of 896.15 in Scenario 1 and an MAPE value of 1.27% and RMSE of 920.24 in Scenario 2. In contrast, Prophet shows different performance in each scenario. In Scenario 1, Prophet shows optimal results but not so good with an average MAPE of 1.74% and an RMSE value of 1214.86. On the other hand, Prophet showed good performance in Scenario 2 with a lower average MAPE of 0.71% and a smaller RMSE of 489.94, indicating Prophet's ability to handle newer and more dynamic datasets. Both models show their respective advantages; SARIMA is better for long and stable historical data, while Prophet is more effective for shorter and dynamic data. This research provides practical insights for traders and investors in choosing the right prediction model, with results for further study in predicting crypto asset prices.
EVALUASI ALGORITMA STRING MATCHING UNTUK DETEKSI PLAGIARISME PADA TEKS AKADEMIK PENDEK: STUDI PERBANDINGAN LEVENSHTEIN SEQUENCEMATCHER DAN RABIN-KARP Rizal, Muhammad Fatkhur; Widiyaningtyas, Triyanna
Jurnal Informatika Teknologi dan Sains (Jinteks) Vol 7 No 3 (2025): EDISI 25
Publisher : Program Studi Informatika Universitas Teknologi Sumbawa

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.51401/jinteks.v7i3.6180

Abstract

Plagiarisme dalam tugas akademik merupakan masalah serius yang berdampak negatif pada integritas pendidikan tinggi. Penelitian ini bertujuan mengevaluasi kinerja tiga algoritma string matching, yaitu Levenshtein, SequenceMatcher, dan Rabin-Karp, dalam mendeteksi plagiarisme pada teks akademik pendek. Dataset yang digunakan adalah Short Answer Plagiarism Corpus dengan 100 pasang dokumen. Pengujian dilakukan menggunakan Python 3.13.5 dengan threshold 0.8 untuk Levenshtein dan SequenceMatcher, serta 0.7 untuk Rabin-Karp. Hasil menunjukkan bahwa Levenshtein dan SequenceMatcher memiliki presisi sempurna (1.00), namun menghasilkan nilai recall yang rendah (0.23 dan 0.05). sedangkan Rabin-Karp memiliki recall tertinggi (1.00) tetapi menunjukan nilai presisi yang rendah (0.6). Temuan ini menunjukkan bahwa metode string matching efektif untuk mendeteksi plagiarisme literal (plagiarisme dari sumber salinan teks langsung) namun kurang optimal terhadap variasi parafrase (penulisan ulang atau rewording). Penelitian ini merekomendasikan integrasi metode string matching dengan analisis semantik atau pembelajaran mesin untuk deteksi plagiarisme yang lebih komprehensif.
Domination Numbers in Graphs Resulting from Shackle Operations with Linkage of any Graph Saifudin, Ilham; Widiyaningtyas, Triyanna; Rhomdani, Rohmad Wahid; Dasuki, Moh.
JTAM (Jurnal Teori dan Aplikasi Matematika) Vol 8, No 2 (2024): April
Publisher : Universitas Muhammadiyah Mataram

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31764/jtam.v8i2.19675

Abstract

The domination number is the number of dominating nodes in a graph that can dominate the surrounding connected nodes with a minimum number of dominating nodes. This domini number is denoted by γ(G). In this research, we will examine the domination number of the distance between two graphs resulting from the shackle operation with any graph as linkage. This differs from previous research, namely the domination of numbers at one and two distances. This study emphasizes how the results of operations on the shackle are connected to the shackle graph as any graph connects the copy. Any graph here means all graphs are connected and generally accepted. The method used in this research is pattern recognition and axiomatic deductive methods. The pattern detection method examines patterns where a graph's number of dominating points can dominate the connected points around it with a minimum number of dominating nodes. Meanwhile, axiomatic deductive is a research method that uses the principles of deductive proof that apply to mathematical logic by using existing axioms or theorems to solve a problem. The Result of graph S_n with t copies and S_m as linkage, then the two-distance domination number in the graph resulting from the shackle operation is γ_2 (Shack(S_n,S_m,t) )=t-1; graph S_n with t copies and C_m as linkage, then the two-distance domination number in the graph resulting from the shackle operation is γ_2 (Shack(S_n,C_m,t) )={■(t,for 3≤m≤6@⌈n/5⌉(t-1),for m≥7)┤; graph C_n with t copies and S_m as linkage, then the two-distance domination number in the graph resulting from the shackle operation isγ_2 (Shack(C_n,S_m,t) )={■(t-1,for n=3@t,for 4≤n≤5@⌈n/5⌉t,for n≥6)┤ This research provides benefits and adds to research results in the field of graph theory specialization of two-distance domination numbers in the result graph of shackle operation with linkage any graph.
Educational Data Mining: Multiple Choice Question Classification in Vocational School Sucipto, Sucipto; Dwi Prasetya, Didik; Widiyaningtyas, Triyanna
MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer Vol. 23 No. 2 (2024)
Publisher : Universitas Bumigora

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30812/matrik.v23i2.3499

Abstract

Data mining on student learning outcomes in the education sector can overcome this problem. This research aimed to provide a solution for selecting quality multiple choice questions (MCQ) using the results of students’ mid-semester exams in vocational high schools using a Data Mining approach. The research method used was the Cross-Industry Standard Process for Machine Learning (CRISP-ML) model. Steps to assess the accuracy of analyzing the difficulty level of questions based on student profile data and midterm test results. The data used in this research were the findings of basic computer tests on mid-term exams in mathematics disciplines at vocational high schools. This research used several classification algorithms, including SVM, Naive Bayes, Random Forest, Decision Three, Linear Regression, and KNN. The results of evaluating the classification
PENERAPAN MESIN SPINNER BERBASIS INVERTER UNTUK MENINGKATKAN PRODUKTIVITAS DAN KUALITAS KACANG METE PADA UMKM DININAS Widiyaningtyas, Triyanna; Mustika, Soraya Norma; Mahandi, Yogi Dwi; Akbar, Muhammad Iqbal; Sujito, Sujito; Falah, Moh Zainul
Jurnal Pengabdian Pendidikan Masyarakat (JPPM) Vol 4 No 2 (2023): Jurnal Pengabdian Pendidikan Masyarakat Vol 4 No 2 (2023)
Publisher : LPPM UNIVERSITAS MUHAMMADIYAH MUARA BUNGO

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52060/jppm.v4i2.1370

Abstract

Cashew nuts are the fruit of the cashew plant, the most important of which is the cashew nut itself. Cashews are one of the most traded nut products. One of the MSMEs located in Singosari, Malang Regency is an MSME engaged in processed food products, namely cashew nuts. This MSME is called Dininas. This business was initiated by Mrs. Dini Mei Nastiti and has been established since 2018. Mrs. Dini's business produces around 10 – 50 kg per day with production in the form of cashew nuts. The problem faced by partners is the slicing process which still uses the conventional method, namely draining naturally by placing it in a container covering it with oil-absorbing paper, and then aerating it. Therefore the solution that is relevant in community service is the application of Inverter-based Spinner machines are more optimal, hygienic, and efficient to increase the amount of production. It is hoped that this community service with the application of Inverter-based Spinner machines can help Dinias SMEs in producing cashew nuts and meet demands from consumers. In addition, this machine is also expected to increase the productivity, quantity, hygiene, and quality of cashew nuts as well as increase profits for business managers.
Application-Level Caching Approach Based on Enhanced Aging Factor and Pearson Correlation Coefficient Zulfa, Mulki Indana; Maryani, Sri; Ardiansyah, -; Widiyaningtyas, Triyanna; Ali, Waleed
JOIV : International Journal on Informatics Visualization Vol 8, No 1 (2024)
Publisher : Society of Visual Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62527/joiv.8.1.2143

Abstract

Relational database management systems (RDBMS) have long served as the fundamental infrastructure for web applications. Relatively slow access speeds characterize an RDBMS because its data is stored on a disk. This RDBMS weakness can be overcome using an in-memory database (IMDB). Each query result can be stored in the IMDB to accelerate future access. However, due to the limited capacity of the server cache in the IMDB, an appropriate data priority assessment mechanism needs to be developed. This paper presents a similar cache framework that considers four data vectors, namely the data size, timestamp, aging factor, and controller access statistics for each web page, which serve as the foundation elements for determining the replacement policy whenever there is a change in the content of the server cache. The proposed similarCache employs the Pearson correlation coefficient to quantify the similarity levels among the cached data in the server cache. The lowest Pearson correlation coefficients cached data are the first to be evicted from the memory. The proposed similarCache was empirically evaluated based on simulations conducted on four IRcache datasets. The simulation outcomes revealed that the data access patterns, and the configuration of the allocated memory cache significantly influenced the hit ratio performance. In particular, the simulations on the SV dataset with the most minor memory space configuration exhibited a 2.33% and 1% superiority over the SIZE and FIFO algorithms, respectively. Future tasks include building a cache that can adapt to data access patterns by determining the standard deviation. The proposed similarCache should raise the Pearson coefficient for often available data to the same level as most accessed data in exceptional cases.
Addressing Class Imbalance of Health Data: A Systematic Literature Review on Modified Synthetic Minority Oversampling Technique (SMOTE) Strategies Hairani, Hairani; Widiyaningtyas, Triyanna; Dwi Prasetya, Didik
JOIV : International Journal on Informatics Visualization Vol 8, No 3 (2024)
Publisher : Society of Visual Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62527/joiv.8.3.2283

Abstract

The Synthetic Minority Oversampling Technique (SMOTE) method is the baseline for solving unbalanced data problems. The working concept of the SMOTE method is to generate new synthetic data patterns by performing linear interpolation between minority class samples based on k-nearest neighbors. However, the SMOTE method has weaknesses, namely the problem of overgeneralization due to excessive sampling of sample noise and increased overlapping between classes in the decision boundary area, which has the potential for noise data. Based on the weaknesses of the Smote method, the purpose of this research is to conduct a systematic literature review on the Smote method modification approach in solving unbalanced data. This systematic literature review method comprises keyword identification, article search process, determination of selection criteria, and selection results based on criteria. The results of this study showed that the SMOTE modification approach was based on filtering, clustering, and distance modification to reduce the resulting noise data. The filtering approach removed the noise data before SMOTE, positively impacting resolving unbalanced data. Meanwhile, the use of a clustering approach in SMOTE can minimize the overlapping artificial minority data that has noise potential. The most used datasets are Pima 60% and Haberman 50%. The most used performance evaluation on unbalanced data is f1-measure 57%, accuracy 55%, recall 43%, and AUC 27%. The implication of the results of this literature review is to provide opportunities for further research in modifying SMOTE in addressing health data imbalances, especially handling noise and overlapping data. The thoroughness of our literature review should instill confidence in the research community.
Co-Authors - Ardiansyah, - Abdul Hadi, Afif Adam Ramadhani P Adiba Qonita Ahmad Farobi Ahmad Fuadi Aji P Wibawa Aji Prasetya Wibawa Ali, Waleed Annas Gading Pertiwi Arif Mudi Priyatno Aya Shofia Mufti Bambang Nurdewanto Bintang Romadhon Binti Afifah Brilliant, Muhammad Zidan Budi Wibowotomo Darwis, Herdianti Dasuki, Moh. Didik Dwi Prasetya Ega Gefrie Febriawan Elta Sonalitha Fadhlullah, Aufar Faiq Fadli Hidayat, M. Noer Falah, Moh Zainul Fitriyah Fitriyah Fitriyah Fitriyah Gading Pertiwi, Annas Gamma Fitrian Permadi Hairani Hairani Haviluddin Haviluddin Hazizah, Chalista Yulia Heru Wahyu Herwanto I Made Wirawan Imansyah, Pranadya Bagus Indriana, Poppy Kornelius Kamargo/Irawan Dwi Wahyono Kornelius Kamargo Kurniawan, Rizky Rizaldi M. Ardhika Mulya Pratama M. Zainal Arifin Martin Indra Wisnu Prabowo Maryani, Sri Moh Zainul Falah Moh. Robieth Alfan Alhamid Mohamad Yusuf Kurniawan Muhammad Afnan Habibi Muhammad Firman Aji Saputra Muhammad Iqbal Akbar Muhammad Jauharul Fuady Muhammad Rizki Irwanto Mulki Indana Zulfa, Mulki Indana Mulya Pratama, M. Ardhika Nafalski, Andrew Nazhiroh Tahta Arsyillah Nurhidayati Pindo Tutuko Poppy Indriana Purnawansyah Purnawansyah Qonita, Adiba Raja, Roesman Ridwan Rendy Yani Susanto Rhomdani, Rohmad Wahid Rizal, Muhammad Fatkhur Rosydah, Lucyta Qutsyaning Saifudin, Ilham Satria Putra Pratama Setiadi Cahyono Putro Shandy Krisnawan Sihombing, Wesly M Soenar Soekopitojo Soraya Norma Mustika Suastika Yulia Riska Sucipto Sucipto Sucipto Sucipto Sujito Sujito Syaad Patmanthara Syah, Abdullah Iskandar Syamsul Arifin Utomo Pujianto Wahyu Caesarendra Wahyu Sakti Gunawan Wahyu Sakti Gunawan Irianto Wibawa, Aji P Wisnu Prabowo, Martin Indra Yogi Dwi Mahandi Yuniardini, Fatma