Claim Missing Document
Check
Articles

Found 3 Documents
Search

Tiny datablock in saving Hadoop distributed file system wasted memory Al-Masadeh, Mohammad Bahjat; Azmi, Mohad Sanusi; Syed Ahmad, Sharifah Sakinah
International Journal of Electrical and Computer Engineering (IJECE) Vol 13, No 2: April 2023
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijece.v13i2.pp1757-1772

Abstract

Hadoop distributed file system (HDFS) is the file system whereby Hadoop is use it to store all the upcoming data inside it. Since it been declared, HDFS is consuming a huge memory amount in order to serve a normal dataset. Nonetheless, the current file saving mechanism in HDFS save only one file in one datablock. Thus, a file with just 5 Mb in size will take up the whole datablock capacity causing the rest of the memory unavailable for other upcoming files, and this is considered a huge waste of memory in serving a normal size dataset. This paper proposed a method called tiny datablock-HDFS (TD-HDFS) to increase the usability of HDFS memory and increase the file hosting capabilities by reducing the datablock size to the minimum capacity, and then merging all the related datablocks into one master datablock. This master datablock consists of tiny virtual datablocks that contain the related small files together; will exploit the full memory of the master datablock. The result of this study is a running HDFS with a minimum amount of wasted memory with the same read/write data performance. The results were examined through a comparison between the standard HDFS file hosting and the proposed solution of this study.
How University Students Regulate AI Prompting Behavior Across Varying Levels of Learning Difficulty Purbohadi, Dwijoko; Syed Ahmad, Sharifah Sakinah
Proceedings International Conference on Sustainable Innovation (ICoSI) Vol. 6 No. 1 (2026)
Publisher : Universitas Muhammadiyah Yogyakarta

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.18196/icosi.v6i1.154

Abstract

The integration of generative artificial intelligence into higher education has significantly impacted how students engage in learning activities, especially in programming courses. This study investigates how students regulate their AI interaction behavior across different levels of learning difficulty. We collected data from 183 undergraduate students enrolled in the Java Programming course, distributed across six classes. This study analyzes the frequency of prompting, behavioral change during the difficult and easy learning phases, and the primary purpose of chatbot use. The findings show that most students actively adapt their interactions with generative AI based on their perception of the level of learning difficulty. 78 percent of students increase the frequency of prompting when facing challenging tasks, while 69 percent reduce interaction after reaching conceptual understanding. Code generation is identified as the primary purpose of using chatbots, followed by debugging and conceptual clarification. This pattern shows that generative AI serves as a flexible cognitive support tool that students strategically use to manage learning challenges. This research indicates that adaptive prompting behavior represents a new form of self regulated learning in AI supported education environments. Instead of relying on chatbots continuously, students use them selectively to support problem solving and conceptual understanding. This study contributes to the growing body of research on artificial intelligence in education by highlighting behavioral adaptations in students’ interactions with generative AI systems and their implications for programming education in higher education
Exploring word embeddings and clustering algorithms for user reviews Sidek, Zuleaizal; Syed Ahmad, Sharifah Sakinah
Indonesian Journal of Electrical Engineering and Computer Science Vol 41, No 3: March 2026
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v41.i3.pp1017-1024

Abstract

The rapid advancement of information technology has led to a significant surge in the volume of unstructured textual data. This has posed a major problem in terms of analyzing, organizing, and automatically clustering text for research purposes, which is crucial for extracting valuable insights. The process of manually clustering the unstructured data, such as customer reviews on the Internet, which capture the opinions of customers regarding products, services, and social events, requires significant financial resources, manpower, and time. Most of the studies are directed towards the analysis of sentiment in user reviews. In order to address the issues effectively, automated text clustering could assist in categorizing reviews into various themes, thereby simplifying the analysis process. Therefore, in this paper, we present and compare the result of experiment the combination of five text clustering techniques, namely K-means, fuzzy C-mean (FCM), non-negative matrix factorization (NMF), latent dirichlet allocation (LDA), and latent semantic analysis (LSA) with different embedding techniques, namely term frequency–inverse document frequency (TF-IDF), Word2Vec, and global vectors (GloVe). The experiments revealed that LDA is a reliable algorithm as it consistently produces good results across three-word embeddings. The highest Silhouette score recorded in the experiments was 0.66 using LDA and Word2Vec as word embedding. Simultaneously, the application of LSA in conjunction with Word2Vec yields superior outcomes, as evidenced by a Silhouette score of 0.65.