cover
Contact Name
Firdaus Annas
Contact Email
firdaus@uinbukittinggi.ac.id
Phone
+6285278566869
Journal Mail Official
knowbase.uinbukittinggi@gmail.com
Editorial Address
Data Center Building - Kampus II Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi. Jln Gurun Aua Kubang Putih Kecamatan Banuhampu Kabupaten Agam Sumatera Barat Telp. 0752 33136 Fax 0752 22871
Location
Kab. agam,
Sumatera barat
INDONESIA
Knowbase : International Journal of Knowledge in Database
ISSN : 27980758     EISSN : 27977501     DOI : https://www.doi.org/10.30983/knowbase
Core Subject : Science,
Knowbase : International Journal of Knowledge in Database is a peer-reviewed journal that publishes articles which contribute new results in all areas of the database management systems & its applications. The goal of this journal is to bring together researchers and practitioners from academia to focus on understanding Modern developments in this field, and establishing new collaborations in these areas. Authors are solicited to contribute to the journal by submitting articles that illustrate research results that describe significant advances in the areas of Database management systems.
Articles 150 Documents
Optimization Of Agricultural Production In South Sumatera Using Multiple Linear Regression Algorithm Setiadi, Dedi; Sasmita, Sasmita; Mukti, Yogi Isro
Knowbase : International Journal of Knowledge in Database Vol. 4 No. 2 (2024): December 2024
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v4i2.8754

Abstract

Rice is one of the agricultural commodities in South Sumatra whose productivity level still fluctuates. In 2000, rice production reached 1,863,643.00 kg, then increased to 3,272,451.00 kg, in 2010, but decreased again in 2020 to 2,696,877.46 kg. This instability is influenced by various factors such as land area, rainfall, pest attacks, and fertilizer use. This study aims to optimize rice production by applying machine learning using multiple linear regression algorithms, and the CRISP-DM method, with the stages being business understanding, data understanding, data preparation, modeling, evaluation, and implementation. Data of 1,000 records obtained from farmers were analyzed using Google Collaboratory, resulting in an intercept of -3836,2639, and coefficients for land area of 5,7336, rainfall of 1,2710, pests of 6,1153, urea of 1,6226, and phonska of 1,2581. To evaluate the accuracy of rice production predictions based on these independent variables, calculations were made on the RMSE value and analysis of the coefficient of determination. The results were that the RMSE value was recorded at 17065084,9641, and the coefficient of determination (R²) was 0,6487, indicating that around 64,87 % of the variability in rice production can be explained by independent variables such as land area, rainfall, pest attacks, use of urea fertilizer, and phonska, while the remaining 35,13 % was influenced by other factors.
Application of Data Mining for Ceramic Sales Data Association Using Apriori Algorithm Habibi, M. Ilham; Nazir, Alwis; Haerani, Elin; Budianita, Elvia
Knowbase : International Journal of Knowledge in Database Vol. 4 No. 2 (2024): December 2024
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v5i2.8757

Abstract

This research is conducted to provide an understanding of consumer purchasing patterns at CV. Sukses Bersama by applying data mining using the association rules method and the Apriori algorithm to identify the relationships between one item that influences other items within a ceramic sales dataset at CV. Sukses Bersama. This information is expected to serve as a foundation for improving sales strategies, optimizing customer satisfaction, and expanding the company's market share. The Apriori algorithm is a popular algorithm implemented to identify association rules in data mining. The Apriori algorithm was chosen due to its ability to efficiently identify association rules and its good scalability in handling large datasets. This research begins with the collection of ceramic sales data, followed by data preprocessing to clean and prepare the data. The Apriori algorithm is then applied to discover the association rules, which generate two matrices: support and confidence, and the results are subsequently evaluated. This research was conducted using Google Colaboratory, a web application that is a cloud-based platform provided by Google to run Python code. The results of the study show that the Apriori algorithm can depict significant association structures between different ceramic brand types in the sales data of CV. Sukses Bersama. The calculation results show that the rule has the maximum support and confidence value, namely 67% support value and 84% confidence value in the rule "if you buy the DIAMD brand, you will buy the TOTAL brand"
Performance Comparison of Naïve Bayes and SVM Algorithms in Sentiment Analysis on JKN Application Data Eka Apriyani, Meyti; Fikri Nur, Amiruddin; Setyo Astuti, Ely
Knowbase : International Journal of Knowledge in Database Vol. 4 No. 2 (2024): December 2024
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v4i2.8758

Abstract

In 2022, 67.88% of Indonesia's population owned mobile devices. BPJS Kesehatan responded to this trend by launching the Mobile JKN application to provide modern, accessible healthcare services. To drive continuous innovation, BPJS Kesehatan needs insights into user feedback regarding the Mobile JKN application. Given the large volume of reviews, sentiment analysis is employed to classify reviews into positive or negative categories. This study compares the performance of Naïve Bayes and SVM (Support Vector Machine) algorithms in sentiment classification using a dataset from the Mobile JKN application. The dataset consists of 200 reviews labeled by two different raters, yielding 110 positive and 90 negative reviews for the first set and 114 positive and 86 negative reviews for the second set. Testing was conducted using three data split scenarios for training and testing: 70:30, 80:20, and 90:10. Model performance was evaluated using a confusion matrix, with metrics including accuracy, precision, recall, and F1-score. The results show that the Naïve Bayes algorithm achieved its best performance with a 90:10 data split, yielding an accuracy of 85%, precision of 77%, recall of 100%, and F1-score of 87%. Conversely, the SVM algorithm performed best with an 80:20 data split, achieving 93% accuracy, 100% precision, 84% recall, and an F1-score of 91% for the first rater's dataset. For the second rater's dataset, SVM reached optimal performance with a 90:10 data split, yielding 90% accuracy, 100% precision, 80% recall, and an F1-score of 89%. Overall, the comparison highlights that SVM outperforms Naïve Bayes in terms of accuracy and precision, making it more effective for predicting positive sentiment in Mobile JKN application reviews.
Modelling Time Series Data for Stock Prices Prediction Using Bidirectional Long Short-Term Memory Syukriyah, Yenie; Purnama, Adi
Knowbase : International Journal of Knowledge in Database Vol. 4 No. 2 (2024): December 2024
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v4i2.8759

Abstract

The dynamic nature of stock markets, characterized by intricate patterns and sudden fluctuations, poses significant challenges to accurate price prediction. Traditional analytical methods are often unable to capture this complexity. This requires the use of advanced techniques capable of modelling non-linear dependencies. This study aims to build a model using recurrent neural network and predict the Indonesian stock prices. PT Gudang Garam Tbk.'s (GGRM.JK) stock was selected due to its significant role in the Indonesian stock market and its contribution to national revenue through excise tax. The method used in this research involves training the BiLSTM (Bidirectional Long Short-Term Memory) model using historical stock price data with training and test data ratios of 90:10, 80:20 and 70:30 to determine the optimal configuration. The evaluation results showed that the 90:10 data ratio gave the best performance with a MAPE of 1.51%, MAE of 343.55 IDR and RMSE of 522.30 IDR. These results indicate that the BiLSTM model has high accuracy and minimal prediction errors. Further analysis showed that the model performed optimally with a batch size of 32 and higher epochs, such as 200 and 250, providing greater stability and prediction accuracy. These results demonstrate the potential of the BiLSTM model as an effective predictive tool to support strategic investment decisions, particularly for high volatility stocks. Future research is recommended to test this model on other stock data and to consider external factors to improve its generalizability.
Business Intelligence Dashboard Human Resource Capacity to Increase the Capacity City of Bekasi Prio Pamungkas, R Wisnu; Rakhmi Khalida
Knowbase : International Journal of Knowledge in Database Vol. 4 No. 2 (2024): December 2024
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v4i2.8764

Abstract

Bekasi City with qualified and evenly distributed human resources will be better able to meet dynamic and complex development needs. Effective data visualization can simplify complex information related to HR capacity, such as education levels, skills distribution, and the number of workers in various sectors, making it easier for policy makers to design strategies including identifying the distribution of filling several positions based on gender and identifying areas of need for educational facilities, children's health, and other infrastructure that supports the growth and development of the younger generation, and developing more effective policies to improve the overall capacity of the city. This research aims to develop a human resource capacity data visualization model as a tool in improving city capacity. This research uses Google Looker Studio as a data visualization platform, data integration is done by Extract, Transform, Load (ETL) method, the data starts from Excel then cleaned, adjusted the format and loaded into Google Sheets. The data used includes key variables that describe the characteristics of human resources in the Bekasi city area, such as education, age group, gender, and demographic distribution. The results show that based on the dashboard visualization, the Bekasi City government can increase 10% representation of the number of women in supervisory and administrator positions in 2 years and the number of only 5% at the S2 or S3 education level requires an increase in education to support the optimization of HR for strategic positions
Design and Development of an Online Analytical Processing (OLAP) Application for Customer Profiling Analysis of Insurance "X" Kesuma Wardanie, Debleng Puja; Kacung, Slamet; Fauzi, Chamdan; Pamudi, Pamudi
Knowbase : International Journal of Knowledge in Database Vol. 4 No. 2 (2024): December 2024
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v4i2.8799

Abstract

The system's slow and inflexible response time is a characteristic of analytical processes based on transactional databases (OLTP), as experienced by PT Asuransi "X." This limitation arises because transactional databases are not designed for OLAP, which can provide various functions to perform synthesis and analysis that improve response time. This study aims to design and develop an Online Analytical Processing (OLAP) application to be used for customer profiling analysis at insurance company "X." In the insurance industry, effective and efficient data analysis is essential to understand customer behavior, perform segmentation, and make more informed decisions in marketing insurance products. The OLAP application developed in this study integrates various customer data dimensions, such as demographics, claim history, and owned products, facilitating multidimensional analysis for its users. The application design process includes system design, data collection, OLAP technology implementation, and application testing. The study results indicate that the application reveals that the majority of customers are male (56%), aged between 30 and 45 years (45%), and employed in the private sector. Additionally, in the city of Surabaya, there is a higher tendency to purchase the Mitra Sakinah life insurance policy. This information enables the company to better understand customer demographic characteristics and tailor its marketing strategies accordingly.
Analysis of A Priori Algorithm in Medical Data for Heart Disease Identification with Association Rule Mining Sutejo, Davip; Yudha Adhi Jaya, Villa Indra; Kacung, Slamet
Knowbase : International Journal of Knowledge in Database Vol. 4 No. 2 (2024): December 2024
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v4i2.8909

Abstract

Heart disease is one of the leading causes of death worldwide, so it is important to identify risk factors that can contribute to the development of this disease in order to carry out early prevention. This study aims to identify patterns of association between risk variables and the incidence of heart disease using the Association Rule Mining (ARM) method combined with the A priori algorithm. The data used in this study includes lifestyle information, medical history, and other health parameters, obtained from the UCI Machine Learning repository. The analysis results showed that with a support value between 30% and 70%, the strongest association rule was found between sex (sex = 1) and angina (exang = 1), with a lift value of 1.67, indicating a strong positive relationship towards a positive diagnosis (target = 1). In addition, other moderate association rules were found, such as the combination of cp_1 = 1 and ca_0 = 1, with a lift value of about 0.73, indicating a weaker association. These findings suggest that some attribute combinations have higher predictive power, which can be used to improve prediction models in the medical diagnosis of heart disease. This research also highlights the main challenges faced by the A priori algorithm, such as computational complexity and selecting the right threshold to obtain significant rules
The Effectiveness of ASIAP Digital Innovation in the Management of SPPT PBB in Pekanbaru City: DAIGUSI Analysis of Innovation and User Satisfaction Rizqi, Ikra Novar; Dytihana, Zahra Aqilah
Knowbase : International Journal of Knowledge in Database Vol. 5 No. 1 (2025): June 2025
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v5i1.9453

Abstract

Digital innovation in public services is increasingly becoming a primary need for efficient and responsive governance. One such innovation is ASIAP (Aplikasi Antar SPPT PBB), a digital system developed by the Pekanbaru City Regional Revenue Agency (Bapenda) to distribute Tax Payable Notification Letters (SPPT) online. However, as a silent innovation, ASIAP is not widely known, despite its significant contribution to regional tax management. This study aims to assess user satisfaction with the ASIAP application and analyze the accompanying innovation diffusion process. The study used a mixed-method approach with quantitative and qualitative methods. Data collection was conducted by distributing questionnaires to ten ASIAP user employees, based on five dimensions in the End-User Computing Satisfaction (EUCS) model: content, accuracy, format, ease of use, and timeliness. In-depth interviews were also conducted with several Bapenda employees to explore the innovation adoption process based on Rogers' innovation diffusion theory. The results showed that users found the ASIAP application quite satisfactory, particularly in terms of information accuracy and ease of use. However, aspects of information up-to-dateness and visual presentation still require improvement. The ASIAP diffusion process is considered to have progressed gradually through five stages of innovation adoption, supported by the role of internal change agents and informal communication between employees. In conclusion, ASIAP is a potential digital innovation for strengthening information technology-based public services at the regional level, but it still requires further development to achieve broader and more equitable benefits.
Implementation of a K-Means-Based Intelligent Patient Complaint Clustering System to Identify Handling Priorities Ideal, M. Agung vafky; Nurfiah; Idir Fitriyanto
Knowbase : International Journal of Knowledge in Database Vol. 5 No. 1 (2025): June 2025
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v5i1.9529

Abstract

Patient complaints are the body’s response to health disturbances, triggered by internal factors such as genetics or external ones like the living environment. Understanding these causes allows community health centers (puskesmas) to take more effective preventive measures and design more targeted services. This study utilizes patient complaint data sourced from medical records, which include biodata and medical history, as well as complaint details that form the research subject. The main goal of this study is to develop an intelligent system that can generate clusters of patient complaints using the K-Means Clustering algorithm. The system is developed using the Research and Development (RnD) method. The clustering process applies a data mining approach, producing clusters based on patient complaints. A total of 600 complaint records, categorized into 72 distinct types, were used. The output consists of three clusters: C1 (high intensity) with 24 categories, C2 (moderate intensity) with 14 categories, and C3 (low intensity) with 34 categories. A practicality test yielded a score of 0.81, indicating the system is highly practical, while an effectiveness test by medical staff scored 0.88, showing the system is highly effective. This system enables health centers to identify trending complaints in the community and develop more focused prevention and treatment strategies. The clustering results also serve as a valuable foundation for strategic decision-making in disease control.
Artificial Neural Network Prediction Model for Agricultural Commodity Production Using Backpropagation Algorithm Wahyuni, Rina; Sakti Wira Adi Utomo; TB. Muhammad Endra Zhafir Al Ghifari
Knowbase : International Journal of Knowledge in Database Vol. 5 No. 1 (2025): June 2025
Publisher : Universitas Islam Negeri Sjech M. Djamil Djambek Bukittinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30983/knowbase.v5i1.9530

Abstract

The development of Artificial Intelligence (AI) technology has been widely used by the Government and Society to support daily activities, including supporting the decision-making process. In Indonesia's agricultural sector, innovations are currently being implemented using Machine Learning methods, especially Artificial Neural Networks, to estimate the yield of an agricultural commodity. This technology is very relevant to be applied in the agricultural sector, especially since the majority of Indonesians are farmers. With prediction of production and prices, the Government can estimate the amount of production and immediately set a strategy to keep prices stable. The use of predictive data on agricultural production results is very important in maintaining food availability and preventing price fluctuations that affect society. This study uses data on chili commodities, employing a qualitative method with the Backpropagation Algorithm of Artificial Neural Networks. The objective is to generate projections of the Artificial Neural Network (ANN) model using the Altair AI Studio with minimal error so that better prediction values and performances are produced. Based on the results obtained, the best network architecture is the 12-25-1 model for large chili production, and 12-15-1 for bird’s eye chili pepper. This model is proven to be able to help production planning, supply distribution arrangements, and maintain price and supply stability by related agencies.