Claim Missing Document
Check
Articles

Found 13 Documents
Search

Implementation of Web Scraping on Google Search Engine for Text Collection Into Structured 2D List Fahrudin, Tresna Maulana; Riyantoko, Prismahardi Aji; Hindrayani, Kartika Maulida
Telematika Vol 20 No 2 (2023): Edisi Juni 2023
Publisher : Jurusan Informatika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31315/telematika.v20i2.9575

Abstract

Purpose: This research proposes the implementation of web scraping on Google Search Engine to collect text into a structured 2D list.Design/methodology/approach: Implementing two important stages in the process of collecting data through web scraping, namely the HTML parsing process to extract links (URL) on Google Search Engine pages, and HTML parsing process to extract the body text from website pages on each link that has been collected.Findings/result: The inputted query is adjusted to the latest issues and news in Indonesia, for example the President's important figures, the month of Ramadan and Idul Fitri, riots tragedy (stadium) and natural disasters, rising prices of basic commodities, oil and gold, as well as other news. The least number of links obtained was 56 links and the most was 151 links, while the processing time to obtain links for each of the fastest queries was 1 minute 6.3 seconds and the longest was 2 minutes 49.1 seconds. The results of scraping links from these queries were obtained from Wikipedia, Detik, Kompas, the Election Supervisory Body (Bawaslu), CNN Indonesia, the General Election Commission (KPU), Pikiran Rakyat, and others.Originality/value/state of the art: Based on previous research, this study provides an alternative to produce optimal collection of links and text from web scraping results in the form of a 2D list structure. Lists in the Python programming language can store character sequences in the form of strings and can be accessed using index keys, and manipulate text efficiently.
Application of K-Means Clustering for Regency/City Clustering in East Java Based on 2024 Human Development Index Indicators Emilia, Kholidatus; Rahayu, Ayu Sri; Yuliani, Devina Putri; Prasetya, Dwi Arman; Riyantoko, Prismahardi Aji
Jurnal Aplikasi Sains Data Vol. 1 No. 2 (2025): Journal of Data Science Applications.
Publisher : Program Studi Sains Data UPN "Veteran" Jawa Timur

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33005/jasid.v1i2.21

Abstract

This study applies the K-Means clustering algorithm to group 38 regencies and cities in East Java Province based on five Human Development Index (HDI) indicators for the year 2024. These indicators include Life Expectancy (UHH), Expected Years of Schooling (HLS), Mean Years of Schooling (RLS), and Real Expenditure Per Capita (PPK). The aim of this research is to uncover hidden patterns and disparities in regional development, which can be used as a basis for more targeted and data-driven policy interventions.The optimal number of clusters was determined using three evaluation metrics: the Elbow Method, Silhouette Score, and Davies-Bouldin Index. These evaluations collectively identified three distinct clusters. Cluster 0 represents regions with high levels of development across all indicators. Cluster 1 consists of regions with moderate development levels and potential for improvement, while Cluster 2 contains regions with significantly lower values, particularly in education and income metrics.In addition to clustering, a correlation analysis was conducted to examine the relationship between HDI and its supporting indicators. The results show that Mean Years of Schooling (RLS) and Real Expenditure Per Capita (PPK) have the strongest positive correlation with HDI across all clusters. This highlights the key role of education and economic well-being in improving human development. The findings emphasize the importance of clustering analysis in shaping equitable and region-specific development strategies.
Data Augmentation of Sperm Images Using Generative Adversarial Networks (WGAN-GP) Diyasa, I Gede Susrama Mas; Kuswardhani , Hajjar Ayu Cahyani; Idhom, Mohammad; Riyantoko, Prismahardi Aji; Dewi , Deshinta Arrova
Register: Jurnal Ilmiah Teknologi Sistem Informasi Vol 12 No 1 (2026): January (In Progress)
Publisher : Information Systems - Universitas Pesantren Tinggi Darul Ulum

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26594/register.v12i1.5954

Abstract

This study analyzes the use of WGAN-GP for data augmentation in the analysis of sperm morphology. WGAN-GP has been the focus in this study for generating sperm microscopy images, which in turn aims to mitigate the problem of data scarcity in medical imaging. A heterogeneous dataset with mixed object categories was initially employed, leading to an FID score of 134, which in turn reflected a high incidence of mode collapse. For this reason, the dataset was divided into subcategories of Normal, Abnormal, and Non-Sperm identifications, with the scores of the subcategories being 59.19, 74.92, and 83.56, respectively, and showing better balanced model stability. This study's primary contribution is the use of WGAN-GP for the first time for sperm image data augmentation and the generation of more realistic synthetic images. Furthermore, this study illustrates the first understanding of the intricacies of data distribution's complexity and its effect on the model's performance, indicating the possibility of improvement using class-based techniques and sophisticated architectures for the generator. The innovation of this study is the application of WGAN-GP to sperm morphology datasets, improving image quality and the stability of the results, coupled with extensive model performance analysis and providing a further understanding of the field of medical image data augmentation.