Jurnal Komputer Teknologi Informasi Sistem Komputer (JUKTISI)
Vol. 5 No. 1 (2026): Juni 2026

Analisis Analisis 5V Big Data pada Internet Archive untuk Pemetaan Evulosi Topik Web (1996-2026)

Octavia, Khairida Octavia Ramadhani (Unknown)
Micael Zecsen Saragih (Unknown)
Syuhada Simbolon (Unknown)
Dwi Nina Putri Anakampun (Unknown)



Article Info

Publish Date
01 Apr 2026

Abstract

Abstract The massive collection of digital artifacts in the Internet Archive and Wayback Machine represents a historical encyclopedia of modern civilization. However, the sheer volume of unstructured data poses challenges in extracting meaningful information, demanding advanced computational analytic approaches. This study aims to demonstrate the architectural evaluation of digital heritage stacks using a comprehensive Big Data 5V framework (Volume, Velocity, Variety, Veracity, Value), designed to map the dynamic trends of web topic evolution over three decades (1996–2026). The methodology relies on 3,000 metadata corpora extracted using K-Means clustering (K=10) with Term Frequency-Inverse Document Frequency (TF-IDF) matrix weighting for text grouping, followed by Apriori association rules

Copyrights © 2026






Journal Info

Abbrev

juktisi

Publisher

Subject

Computer Science & IT Control & Systems Engineering Decision Sciences, Operations Research & Management Education Engineering

Description

Focus dan scope dari JUKTISI (Jurnal Komputer Teknologi Informasi Sistem Komputer) terbit pertama kali pada tahun 2022 yang dimaksudkan sebagai media kajian ilmiah dari hasil pemikirian yang dituangkan kedalam Jurnal. Jurnal JUKTISI Lembaga Kursus dan Pelatihan Karya Prima terbit 3 (tiga) kali ...