Claim Missing Document
Check
Articles

Found 18 Documents
Search
Journal : PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND OFFICIAL STATISTICS

Extracting Consumer Opinion on Indonesian E-Commerce: A Rating Evaluation and Lexicon-Based Sentiment Analysis Arbi Setiyawan; Arie Wahyu Wijayanto; He Youshi
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2021 No. 1 (2021): Proceedings of 2021 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2021i1.22

Abstract

E-commerce as a business platform offers abundant advantages in modern life all over the world. Sellers and buyers at online marketplaces may get benefits and advantages from e-commerce. One of the advantages is that e-commerce can be accessed anywhere and anytime. Despite providing advantages, e-commerce also has disadvantages including product quality fraud and data theft. Online marketplaces provide facilities for consumer evaluation, through star rating and consumer reviews. In this paper, we focus on the Business-to-Consumer (B2C) e-commerce type and extract consumer opinion data from a leading online marketplace in Indonesia and use text mining approaches to compare the rating evaluation and sentiment analysis on consumer reviews. With 2,937 records, we investigate the relationship between star rating and lexicon-based sentiment analysis. From the results, we found that most consumers do not hesitantly provide a good evaluation indicated by a 5-star rating and positive sentiment of reviews. A quite polarized rating distribution is found and indicates a straightforward consumer opinion. However, a further examination of the relation between rating and review, we discover inconsistencies in consumer opinion where the good rating may also contain negative reviews. Our result findings provide an insight to build a more integrated consumer opinion indicator in e-commerce and that online marketplace sellers need to look deeper at the detailed reviews rating.
Knowledge Management System in Official Statistics: An Empirical Investigation on Indonesia Population Census Achmad Muchlis Abdi Putra; Arie Wahyu Wijayanto
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2021 No. 1 (2021): Proceedings of 2021 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2021i1.25

Abstract

National statistical offices around the world show a strong interest in producing reliable, objective, and accurate information in compliance with a high level of professional and scientific standards. Such a set of information provided by government agencies is known as the official statistics. To support the potential of knowledge-based business processes and deliver high-quality public services, knowledge management systems (KMS) are undoubtedly required. In this work, we study the impact of embracing KMS in one of the most massive scale statistical census in South East Asia, the 2020 Indonesia Population Census (IPC2020). The regression analysis is utilized in this study where the perceived usefulness is the dependent variable and the perceived ease of use become the independent variable. Our findings reveal that KMS utilization gains a positive influence on the perceived ease of use and usefulness among the stakeholders and organizing personnel. This provides an incentive to enlarge the range of implementation and improve the system and infrastructure capability to better support the knowledge-driven collaboration among stakeholders of the statistical office.
Optimization of Waste Transportation Routes using Multi-objective Non-dominated Sorting Genetic Algorithm II (MNSGA-II) in the Eastern and Southern Regions of Bandung City, Indonesia Natasya Afira; Arie Wahyu Wijayanto
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2021 No. 1 (2021): Proceedings of 2021 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2021i1.27

Abstract

Ensuring high-quality and effective urban waste management has been an important priority to achieve sustainable and environmental-friendly cities and communities mandated by Sustainable Development Goals (SDGs). The massively growing population in urban regions of developing countries, such as Bandung City, Indonesia, leads to the increasing volume of daily goods consumption and households waste production. The waste transportation route is one of the main determining factors for the cost of waste management. In this paper, we introduce the Multi-objective Non-dominated Sorting Genetic Algorithm II (MNSGA-II) to solve the waste transportation route optimization problem in the Eastern and Southern Regions of Bandung City, Indonesia. Compared to the existing traditional evolutionary algorithms, MNSGA-II offers three major important benefits: efficient computational complexity, no requirement of sharing parameters, and a non-elitism mechanism. Algorithm parameters include the number of generations, mutation rate, and crossover rate. Our extensive experiments suggest the best solution resulted in 14 routes with a total distance of 152,63 km. Further, our proposed route optimization is potentially beneficial to support the improvement of the sustainable waste management service system at Bandung City.
Preserving Women Public Restroom Privacy using Convolutional Neural Networks-Based Automatic Gender Detection Desi Kristiyani; Arie Wahyu Wijayanto
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2021 No. 1 (2021): Proceedings of 2021 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2021i1.29

Abstract

Personal safety and privacy have been the significant concerns among women to use and access public restrooms/toilets, especially in developing countries such as Indonesia. Privacy-enhancing designs are unquestionably expected to ensure no men entering the rooms neither intentionally nor accidentally without prior notice. In this paper, we propose a facial recognition approach to ensure women's safety and privacy in public restroom areas using Convolutional Neural Networks (CNN) model as a gender classifier. Our main contributions are as follows: (1) a webcam feed automatic gender detection model using CNN which may further be connected to a security alarm (2) a publicly available gender-annotated image dataset that embraces Indonesian facial recognition samples. Supplementary Indonesian facial examples are taken from a government-affiliated college, Politeknik Statistika STIS students' photo datasets. The experimental results show a promising accuracy of our proposed model up to 95.84%. This study could be beneficial and useful for wider implementation in supporting the safety system of public universities, offices, and government buildings.
Bayesian Network Model to Distinguish COVID-19 for Illness with Similar Symptoms Emir Luthfi; Arie Wahyu Wijayanto
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2021 No. 1 (2021): Proceedings of 2021 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2021i1.36

Abstract

Numerous diseases and illnesses exhibit similar physical and medical symptoms, such as COVID-19 and its similar disguised illness (common cold, flu, and seasonal allergies). In this study, we construct a Bayesian Network model to distinguish such symptom variables in a classification task. The Bayesian Network model has been widely used as a classifier comparable to machine learning models. We develop the model with a scoring-based method and implement it using a hill-climbing algorithm with the Bayesian information criterion (BIC) score approach. Experimental evaluations using publicly available Mayo Clinic based data using this Bayesian Network model that present Directed Acyclic Graph (DAG) which can show the relationship between the similar symptoms and the type of disease with Conditional Probability Table (CPT). This model shows a promising accuracy performance up to 93.14% which is better than the performance of other machine learning classifiers, including the Support Vector Machine (SVM) and the ensemble approaches such as Random Forest (RF), while slightly smaller than that of the neural networks (NN).
Learning Bayesian Network for Rainfall Prediction Modeling in Urban Area using Remote Sensing Satellite Data (Case Study: Jakarta, Indonesia) Salwa Rizqina Putri; Arie Wahyu Wijayanto
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2021 No. 1 (2021): Proceedings of 2021 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2021i1.37

Abstract

Rainfall modeling is one of the most critical factors in agricultural monitoring and statistics, transportation schedules, and urban flood prevention. Weather anomaly during the dry season in urban coastal areas of tropical countries such as Jakarta, Indonesia has become a challenging issue that causes unexpected changes in rain patterns. In this paper, we propose the Bayesian Network (BN) approach to model the probabilistic nature of rain patterns in urban areas and causal relationships among its predictor variables. Rain occurrences are predicted using temperature, relative humidity, mean-sea level (MSL) pressure, cloud cover, and precipitation variables. Data are obtained from the remote sensing sources of the National Oceanic and Atmospheric Administration (NOAA) satellite in Jakarta 2020-2021. We compare both of the score-based, i.e., Hill Climbing (HC), and hybrid structure learning algorithms of Bayesian Network including the techniques of Max-Min Hill Climbing (MMHC), General 2-Phase Restricted Maximization (RSMAX2), and Hybrid-Hybrid Parents & Children (H2PC). Further, we also compare the performance of score-based model (Hill Climbing) under five different popular scorings: Bayesian Information Criterion (BIC), K2, Log-Likelihood, Bayesian Dirichlet Equivalent (BDE), and Akaike Information Criterion (AIC) methods. The main contributions of this study are as follows: (1) insights that the hybrid structure learning algorithms of Bayesian Network models are either superior in performance or at least comparable to its score-based counterparts (2) our proposed best performed Bayesian Network model that is able to predict the rain occurrences in Jakarta with a promising overall accuracy of more than 81 percent.
Revisiting Local Walking Based on Social Network Trust (LWSNT): Friends Recommendation Algorithm in Facebook Social Networks Wahidya Nurkarim; Arie Wahyu Wijayanto
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2021 No. 1 (2021): Proceedings of 2021 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2021i1.124

Abstract

In the last decades, the internet penetration rate and online social network users have grown very fast. Online social network, such as Facebook, is a platform where one can find friends without having to meet face to face. A social network is represented by a large graph because it involves many participants. Hence, it is hard to find potential friends who have the same thoughts and interests. The Local Walking Based on Social Network Trust (LWSNT) algorithm is one of the popular algorithms for social friend recommendation. This study re-examines whether the correlation between attributes gives un-match ranks in different cases (cases with and without correlation). We assess the performance of LWSNT in Facebook networks under the supervised manner by comparing its F-score against similar methods. By using Kendall’s tau correlation, the results show that the correlation of attributes has no significant effect on the order of friend recommendations. In addition, the LWSNT performance is quite inferior against the Common Neighbors algorithm and Jaccard index.
A Land cover change analysis of buffer areas in New Capital City of Nusantara, Indonesia: A cellular automata approach on satellite imageries data Maria Shawna Cinnamon Claire; Salwa Rizqina Putri; Arie Wahyu Wijayanto
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2023 No. 1 (2023): Proceedings of 2023 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2023i1.338

Abstract

The proposed plan to move Indonesia's capital city to the New Capital City of Nusantara in East Kalimantan Province undoubtedly requires careful efforts to ensure food supply for the population. Population migration to the new capital may pose a food security challenge. To address this fundamental issue, one of the most crucial approaches is to establish buffer areas that can support the food needs of the new capital. The currently existing official Area Sampling Frame survey conducted by the government to assess food vulnerability faced several limitations, including weather conditions, field terrain variations, and high cost. In this study, we propose the utilization of remote sensing satellite imagery data in buffer areas to analyze changes and predict future land cover, which can provide valuable data for assessing food availability. We investigate the integration of a Cellular Automata method with the two most popular analytical methods of classical Logistic Regression and data-driven Artificial Neural Networks, known as CA-LR and CA-ANN, to identify and map land cover changes in the new capital buffer zones. Our findings reveal that both combined methods, CA-LR and CA-ANN, yield fairly promising results, with correctness and kappa statistic values exceeding 80%. Prediction results indicate that buffer areas are predominantly covered by trees, while built-up areas are still limited. The flooded vegetation cover, including rice fields, is predicted to decrease by 2024. This should be a matter of concern for stakeholders, considering the construction of the new capital city is still ongoing and the number of migrants is expected to keep rising.
Automatic Detection and Counting of Urban Housing and Settlement in Depok City, Indonesia: An Object-Based Deep Learning Model on Optical Satellite Imageries and Points of Interests Atut Pindarwati; Arie Wahyu Wijayanto
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2023 No. 1 (2023): Proceedings of 2023 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2023i1.349

Abstract

Detecting urban housing and settlements has a substantial position in decision-making problems such as monitoring housing and development, not to mention the widelyrequired urban mapping application. One of the most important goals in the United NationsSustainable Development Goals (SDGs) is to improve urban living conditions globally by2030. We propose an automatic detection of urban housing and settlements on remote sensingsatellite imagery data using object detection-based deep learning using semantic segmentationand the potential availability of remote sensing datasets at high spatial resolutions, Open StreetMap (OSM) geolocation point of interest dataset, and Sentinel-2 optical satellite imagery data.The detection model using Mask Region-based Convolutional Neural Networks (Mask R-CNN) is implemented in Depok City, Indonesia. These regions were chosen because it is thesecond most populous suburb in Indonesia and the tenth most populous globally and, making itchallenging to extract building features from satellite imagery. This model categorizes dense,moderate, and sparse conditions and has a promising result of an average precision of 100%and an F1-score of 67% with evaluation performance metrics only considering pointsassociated with buildings, not building boundaries or the intersection over union (IoU). Themodel performance has been compared to ground check results of field surveys, and itperforms best in sparse conditions. Our findings offer the potential implementation of themodel for fast and accurate monitoring of housing, settlement, and regional planning in urbanareas.
Geospatial Big Data Approaches to Estimate Granular Level Poverty Distribution in East Java, Indonesia using Machine Learning and Deep Learning Regressions Rifqi Ramadhan; Arie Wahyu Wijayanto; Setia Pramana
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2023 No. 1 (2023): Proceedings of 2023 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2023i1.359

Abstract

One of the economic development the focus of the Indonesian government's efforts is for reducing poverty. In Indonesia, collecting poverty data uses the conventional method, the name is National Socio-Economic Survey (SUSENAS) which takes a large cost, time, and effort. To overcome these limitations, there is a need for additional data to provide more detailed poverty data. Recent studies show that the use of geospatial big data could identify poverty at a granular level, with a lower cost and faster update because of their unique and unbiased capacity to identify physical and socioeconomic phenomena. The integrated multi-source satellite imagery data such as the normalized difference vegetation index (NDVI) for detecting rural areas based on vegetation, built-up index (BUI) for identifying urban areas through building distribution, normalized difference water index (NDWI) for land cover detection, day time land surface temperature (LST) for identifying urban regions based on surface temperature, and pollutants such as carbon monoxide (CO), nitrogen dioxide (NO2), and sulfur dioxide (SO2) to evaluate economic activities based on pollution. Additionally, point of interest (POI) density and minimum POI distance are used to measure area accessibility. Therefore, the contribution of this research is to implement the utilization of geospatial big data to estimate the numbers of poverties at a granular level to the 666 sub-districts in East Java Province using machine learning and deep learning regression models. The evaluation results to estimate sub-district level poverty shows that the best model development using Support Vector Regression (SVR) in machine learning was the best root mean squared error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE) values of 0.365, 0.293, and 0.032 with R-squared of 0.59 and MLP in deep learning algorithm with 0.444, 0.345, and 0.039 values of RMSE, MAE, and MAPE with R2 0.52. In addition, the results of visual identification revealed that high estimates of lower poverty are typically found in urban areas with high accessibility, and these areas are not spatially deprived areas with limited accessibility.
Co-Authors A.A. Ngurah Gede, Wasudewa Achmad Muchlis Abdi Putra Akhmad Fatikhurrizqi Alfina Nurpiana Alvia Rossa Damayanti Alya Azzahra Andriansyah Muqiit Wardoyo Saputra Annisa Firnanda Arbi Setiyawan Arif Handoyo Marsuhandi Arina Mana Sikana Arini, Rechtiana Putri Arista Ariyani, Marwah Erni Atmaja, Anugerah Surya Atut Pindarwati Ayu Aina Nurkhaliza Az-Zahra, Afifah Bagus Almahenzar Bahar, Vicka Kharisma Bony Parulian Josaphat Chisan, Innas Khoirun Daulay, Nur Ainun Desi Kristiyani Dewi, Ni Kadek Ayu Purnami Sari Dwi Karunia Syaputri Dwi Wahyu Triscowati Emir Luthfi Fauzan Faldy Anggita Fauzan, Fardhi Dzakwan Febrian, M. Yandre Feriyanto, Muhamad Ghina Rofifa Suraya He Youshi Hidayat, Anang Kurnia Hutahaean, Yohana Madame Ika Yuni Wulansari Ikhsanudin, Muhammad Rafi Iman, Qonita Intan Kemala Iskanda, Doddy Aditya Iskanda, Watekhi Izzuddin, Kautsar Hilmi Karmawan, I Putu Agus Kurniawan, Bayu Dwi Kusuma, Arya Candra Luthfi, Emir Maghfiroh, Meilinda F N Maghfiroh, Meilinda F. N. Margareth Dwiyanti Simatupang Maria Angelika H Siallagan Maria Shawna Cinnamon Claire Marsisno, Waris Marsisno, Waris Maulana, Farhan Maulidya, Luthfi Muchlisoh, Siti Muhammad Rezza Ferdiansyah Munifah Zuhra Almasah Nabila Bianca Putri Nasiya Alifah Utami Natasya Afira Natasya Afira Ningrum, Icha Wahyu Kusuma Ningsih, I Kadek Mira Merta Nissa Shahadah Qur'ani Nora Dzulvawan Nurafiza Thamrin Nursiyono, Joko Ade Parwanto, Novia Budi Pasaribu, Ernawati Perani Rosyani Permatasari, Noverlina Putri Pikata Aselnino Pindarwati, Atut Pramana, Setia Prasetyo, Rindang Bangun Pratama, Ahmad R. Prayoga, Suhendra Widi Putri, Salwa Rizqina Putri, Salwa Rizqina Rahmawati, Delvina Nur Raisa Rizky Amelia Rahman Raisa Rizky Amelia Rahman Regita Iswari Puri, Ida Ayu Wayan Renata De La Rosa Manik Ressa Isnaini Arumnisaa Restu Ilahi, Muhammad Ridho, Farid Rifqi Ramadhan Rifqi Ramadhan Robert Kurniawan, Robert Rudianto, Regita Dewanti Sakka, Asriadi Salwa Rizqina Putri Siregar, Tifani Husna Sofa, Wahyuni Andriana Suadaa, Lya Hulliyyatus Sugiarto, Sugiarto Swardanasuta, I Bagus Putu Wahidya Nurkarim Wahyuni, Krismanti Tri Watekhi watin, Rahma Wilantika, Nori Windy Rahmatul Azizah Wulansari, Ika Yuni Yeza, Ardhan Yulia Aryani Yuniarto, Budi Zalukhu, Bill Van Ricardo Zanial Fahmi Firdaus