cover
Contact Name
Teuku Rizky Noviandy
Contact Email
trizkynoviandy@gmail.com
Phone
+6282275731976
Journal Mail Official
editorial-office@heca-analitika.com
Editorial Address
Jl. Makam T. Nyak Arief Kompleks BUPERTA Blok L7B, Lamgapang, Aceh Besar, Provinsi Aceh
Location
Kab. aceh besar,
Aceh
INDONESIA
Infolitika Journal of Data Science
ISSN : -     EISSN : 30258618     DOI : https://doi.org/10.60084/ijds
Infolitika Journal of Data Science is a distinguished international scientific journal that showcases high caliber original research articles and comprehensive review papers in the field of data science. The journals core mission is to stimulate interdisciplinary research collaboration, facilitate the exchange of knowledge, and drive the advancement and application of innovative strategies within the data science domain. Topics of this journal includes, but not limited to Data Mining and Analysis, Machine Learning and Artificial Intelligence, Big Data and Data Engineering, Predictive Modeling and Forecasting, Natural Language Processing, Computer Vision, Data Visualization and Interpretation, Ethics and Privacy in Data Science, Applications of Data Science, Interdisciplinary Approaches
Articles 5 Documents
Search results for , issue "Vol. 1 No. 2 (2023): December 2023" : 5 Documents clear
Enhancing the Red Wine Quality Classification Using Ensemble Voting Classifiers Supriatna, Deny Joefakri Iwa; Saputra, Huzair; Hasan, Khaidir
Infolitika Journal of Data Science Vol. 1 No. 2 (2023): December 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i2.95

Abstract

This study introduces an ensemble voting classifier for red wine quality classification using machine learning algorithms. Wine quality assessment, traditionally reliant on subjective expert evaluations, is addressed through data-driven methodologies. The dataset comprises physicochemical attributes and quality ratings of red wines. Results reveal individual models with accuracy ranging from 0.816 to 0.873, while the ensemble approach significantly enhances accuracy. The combination of Random Forest and XGBoost achieves an accuracy of 0.885, demonstrating its potential in red wine quality assessment. In conclusion, this study showcases the potential of machine learning in enhancing the classification of red wine quality, offering a more objective and precise alternative to traditional sensory evaluation. The ensemble voting classifier, especially when combining Random Forest and XGBoost, provides a robust solution for this task, improving the accuracy of wine quality assessments.
Maternal Health Risk Detection Using Light Gradient Boosting Machine Approach Noviandy, Teuku Rizky; Nainggolan, Sarah Ika; Raihan, Raihan; Firmansyah, Isra; Idroes, Rinaldi
Infolitika Journal of Data Science Vol. 1 No. 2 (2023): December 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i2.123

Abstract

Maternal health risk detection is crucial for reducing morbidity and mortality among pregnant women. In this study, we employed the Light Gradient Boosting Machine (LightGBM) model to identify risk levels using data from rural healthcare facilities. The dataset included key health indicators aligned with the United Nations Sustainable Development Goals. The LightGBM model underwent rigorous optimization through hyperparameter tuning and 10-fold cross-validation. Its predictive performance was benchmarked against other algorithms using accuracy, precision, recall, and F1-score, with feature importance assessed to identify critical risk predictors. The LightGBM model demonstrating the highest performance across all metrics. The results underscore the value of advanced machine learning techniques in public health. Future research directions include expanding the demographic scope, incorporating temporal data, and enhancing model transparency. This study highlights the transformative potential of machine learning in maternal healthcare, providing a foundation for improved risk detection and proactive healthcare interventions.
A Statistical Clustering Approach: Mapping Population Indicators Through Probabilistic Analysis in Aceh Province, Indonesia Sasmita, Novi Reandy; Khairul, Moh; Sofyan, Hizir; Kruba, Rumaisa; Mardalena, Selvi; Dahlawy, Arriz; Apriliansyah, Feby; Muliadi, Muliadi; Saputra, Dimas Chaerul Ekty; Noviandy, Teuku Rizky; Watsiq Maula, Ahmad
Infolitika Journal of Data Science Vol. 1 No. 2 (2023): December 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i2.130

Abstract

The clustering, one of statistical analysis, can be used for understanding population patterns and as a basis for more targeted policy making. In this ecological study, we explored the population dynamics across 23 districts/cities in Aceh Province. The study used the Aceh Population Development Profile Year 2022 data, focusing on the total population, in-migrants, out-migrants, fertility, and maternal mortality as variables. The study employed descriptive statistics to ascertain the data distribution, followed by the Shapiro-Wilk test to evaluate normality, which is crucial for selecting the appropriate statistical methods. The Spearman test was used to determine correlations between the total population and the variable as indicators. Probabilistic Fuzzy C-Means (PFCM) method is used for clustering. To optimize clustering, the silhouette coefficient was calculated using the Euclidean Distance and the elbow method, with the results analyzed using R-4.3.2 software. This study's design and methods aim to provide a nuanced understanding of demographic patterns for targeted policy-making and regional development in Aceh, Indonesia. Based on the data normality test results, only fertility (p-value = 0.45), while the other variables are not normally distributed. Spearman test was used, and the results showed that only in-migrants (p-value = 1.78 x 10-6) and out-migrants (p-value = 2.30 x 10-6) correlated to the Aceh Province population. Using the population variable and the two variables associated with it, it was found that 4 is the best optimum number of clusters, where clusters 1, 2, 3, and 4 consist of three districts/city, nine districts/city, four districts/city and seven districts/city respectively.
Cardiovascular Disease Prediction Using Gradient Boosting Classifier Suhendra, Rivansyah; Husdayanti, Noviana; Suryadi, Suryadi; Juliwardi, Ilham; Sanusi, Sanusi; Ridho, Abdurrahman; Ardiansyah, Muhammad; Murhaban, Murhaban; Ikhsan, Ikhsan
Infolitika Journal of Data Science Vol. 1 No. 2 (2023): December 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i2.131

Abstract

Cardiovascular Disease (CVD), a prevalent global health concern involving heart and blood vessel disorders, prompts this research's focus on accurate prediction. This study explores the predictive capabilities of the Gradient Boosting Classifier (GBC) in cardiovascular disease across two datasets. Through meticulous data collection, preprocessing, and GBC classification, the study achieves a noteworthy accuracy of 97.63%, underscoring the GBC's effectiveness in accurate CVD detection. The robust performance of the GBC, evidenced by high accuracy, highlights its adaptability to diverse datasets and signifies its potential as a valuable tool for early identification of cardiovascular diseases. These findings provide valuable insights into the application of machine learning methodologies, particularly the GBC, in advancing the accuracy of CVD prediction, with implications for proactive healthcare interventions and improved patient outcomes.
Unraveling Geospatial Determinants: Robust Geographically Weighted Regression Analysis of Maternal Mortality in Indonesia Rahayu, Latifah; Ulfa, Elvitra Mutia; Sasmita, Novi Reandy; Sofyan, Hizir; Kruba, Rumaisa; Mardalena, Selvi; Saputra, Arif
Infolitika Journal of Data Science Vol. 1 No. 2 (2023): December 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i2.133

Abstract

Maternal Mortality Rate (MMR) in Indonesia has experienced a concerning annual increase, reaching 4,627 deaths in 2020 compared to 4,221 in 2019. This upward trajectory underscores the urgency of investigating the factors contributing to MMR. Recognizing the spatial heterogeneity and outliers in the data, our study employs the Robust Geographically Weighted Regression (RGWR) method with the Least Absolute Deviation approach. Using secondary data from the 2020 Indonesian Health Profile publication, the research seeks to establish province-specific models for MMR in 2020 and identify the key influencing factors in each region. Standard regression analyses fall short in addressing the complexities present in the data, making the RGWR approach crucial for understanding the nuanced relationships. The chosen RGWR model utilizes the Least Absolute Deviation method and a fixed kernel exponential weighting function. Notably, this model maintains a consistent bandwidth value across all locations, showcasing its robustness. In evaluating the model variations, the exponential fixed kernel weighting function emerges as the most optimal, boasting the smallest Akaike Information Criterion (AIC) value of 23.990 and the highest coefficient of determination  value of 93.66%. The outcomes of this research yield 24 distinct models, each tailored to the unique characteristics of every province in Indonesia. This nuanced, location-specific approach is vital for developing effective interventions and policies to address the persistently high MMR. By providing insights into the complex interplay of factors influencing maternal mortality in different regions, the study contributes to the groundwork for targeted and impactful public health initiatives across Indonesia.

Page 1 of 1 | Total Record : 5