Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Journal of Dinda : Data Science, Information Technology, and Data Analytics

Cluster Analysis of Covid-19 in Indonesia Using K-means Method Claudia Larasvaty; Siti Khomsah; Rona Nisa SA
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 3 No 1 (2023): February
Publisher : Research Group of Data Engineering, Faculty of Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20895/dinda.v3i1.822

Abstract

These days technology are rapidly increasing and developing in various fields, especially data storage. The information that has been stored in a database usually called a dataset. Covid-19 is a new type of respiratory disease that attacks the respiratory system with rapid transmission, followed by the increasing number of Covid-19 cases that continues to increase every day in all provinces in Indonesia. This study aims to cluster the spread of Covid-19 in every province in Indonesia by using the data that obtained from the website named kaggle with many data variables. The method used in this research is K-Means. From many variables in the data, for this study only 3 variables were taken, which are: Number of Recovery, Number of Deaths, and Number of total Cases in Covid-19 in Indonesia. These 3 variables then will be applied using the K-Means method and formed 3 provincial groups. By using the clustering method and the K-means algorithm, this research can be carried out to find the characteristics of the distribution in each province in Indonesia by looking at the best clusters.
Comprehensive Lakehouse Data Architecture Model for College Accreditation Nenen Isnaeni; Bambang Purnomosidi Dwi Putranto; Widyastuti Andriyani; Siti Khomsah
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 5 No 1 (2025): February
Publisher : Research Group of Data Engineering, Faculty of Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20895/dinda.v5i1.1759

Abstract

Accreditation is an assessment activity that determines the feasibility of study programs at a university. College accreditation data comes from various sources and includes multiple data types: semi-structured, unstructured, or structured. Over time, the volume of data will continue to grow and develop, so there is a possibility of data redundancy and a long time to collect the data needed for accreditation activities. The solution is integrating data. This research aims to design a data architecture to facilitate the management of university accreditation data using the Lakehouse data architecture model. All data types can be stored on one platform in the Lakehouse data architecture. In this research, the identification, integration, and data transformation process for university accreditation data is carried out. The data used in this research is academic data in which there are with. The study's results provide an overview of the data flow process in the Lakehouse data architecture model to help better manage university accreditation data. This architecture also supports real-time data analysis so that the accreditation process can be carried out more effectively and efficiently. Keywords: accreditation, data analysis, data architecture, data lakehouse, data warehouse