Claim Missing Document
Check
Articles

Found 14 Documents
Search

Attention-Enhanced Convolutional Neural Network for Context Extraction in Andersen's Fairy Tales Daniati, Erna; Wibawa, Aji Prasetya; Irianto, Wahyu Sakti Gunawan; Nafalski, Andrew
JOIV : International Journal on Informatics Visualization Vol 9, No 6 (2025)
Publisher : Society of Visual Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62527/joiv.9.6.4056

Abstract

Event extraction in classic literature and fairy tales remains highly challenging due to their non-linear plot structures, archaic linguistic expressions, and intricate character interactions, while advances in modern NLP still show limitations in capturing subtle narrative cues in historical texts. This study aims to address these gaps by developing an event extraction model tailored to the narrative characteristics of Hans Christian Andersen’s fairy tales. We propose a BERT-enhanced Context-aware Convolutional Neural Network (CNN) that integrates an attention mechanism to overcome the limited contextual range of traditional CNNs. The model leverages BERT’s contextual embeddings enriched with an attention layer to detect event triggers, character relations, and narrative transitions across nonlinear storylines. A hybrid dataset was constructed through system-generated annotations refined via manual verification and combined with AN/an cartoon-based representations for model training and final testing. Experimental results show that the proposed model surpasses both the CNN-only baseline and a rule-based approach, achieving precision of 0.92, recall of 0.89, F1-score of 0.90, and accuracy of 0.91, outperforming the CNN baseline (0.85/0.82/0.83/0.84) and rule-based system (0.78/0.75/0.76/0.77). These findings highlight the effectiveness of context-aware representations for processing literary narratives and demonstrate interdisciplinary relevance to digital humanities and AI-based storytelling, with future extensions envisioned for multilingual settings and genre-specific adaptations.
Optimizing Transformer Model FlanT5 for Multi-Question Answering with Context-Aware Learning Rate Suryanto, Tri Lathif Mardi; Wibawa, Aji Prasetya; Hariyono, Hariyono; Nafalski, Andrew
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 9 No 6 (2025): December 2025
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29207/resti.v9i6.6985

Abstract

This study investigates the performance of FlanT5-based transformer models in handling Multiple-Question Answering (M-QA) tasks, in which multiple semantically related questions must be addressed with a single cohesive answer. Unlike traditional QA systems that focus on one-to-one question-answer pairs, the M-QA approach challenges the model to understand contextual relationships across several questions tied to the same topic. A custom dataset was developed with shared context, grouped questions, and a unified answer to train and evaluate the model. The FlanT5 architecture was fine-tuned using different learning rates (0.0001, 0.0002, 0.0003) to explore the effect of training configurations on model performance. The evaluation was conducted using the ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-Lsum metrics. The results indicate that a learning rate of 0.0003 provides the optimal performance, achieving a ROUGE-Lsum score of 0.7390. This study confirms the capability of instruction-tuned transformers to manage complex summarization scenarios that require contextual coherence. The findings are relevant for real-world applications such as intelligent digital assistants, clinical decision support, and educational chatbots. Furthermore, this study emphasizes the importance of hyperparameter tuning in improving the effectiveness of question-driven summarization systems for scalable and efficient deployment.
Survey and Challenges: Event Extraction of Story Narrative in NLP Approach Daniati, Erna; Wibawa, Aji Prasetya; Irianto, Wahyu Sakti Gunawan; Nafalski, Andrew
Buletin Ilmiah Sarjana Teknik Elektro Vol. 8 No. 1 (2026): February
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12928/biste.v8i1.15534

Abstract

Event extraction from story narratives remains a challenging yet underexplored area in natural language processing due to narrative complexity including implicit causality long-range dependencies and temporal ambiguity. This study addresses the research question: How have NLP and deep learning approaches been applied to extract events from story narratives and what gaps persist. Following the PRISMA 2020 guidelines we systematically reviewed 12 peer-reviewed studies published between 2017 and 2024. Our analysis reveals growing adoption of transformer-based models such as BERT alongside emerging architectures like DEEIA and PAIE which leverage prompt-based learning and event-specific contextual aggregation. Commonly used datasets include ROCStories and custom narrative corpora though few are standardized. Key challenges involve handling implicit events limited annotated data cross-domain generalization and integration of commonsense reasoning. The main contribution of this review is the first structured synthesis of event extraction techniques specifically for story narratives using a rigorous systematic methodology. We highlight the need for document-level modeling narrative-aware evaluation metrics and low-resource adaptation strategies. This work provides a foundation for future research aiming to bridge narrative understanding with robust event-centric NLP systems.
LSTM-based Multivariate Time-Series Analysis: A Case of Journal Visitors Forecasting Saputra, Anggie Wahyu; Wibawa, Aji Prasetya; Pujianto, Utomo; Putra Utama, Agung Bella; Nafalski, Andrew
ILKOM Jurnal Ilmiah Vol 14, No 1 (2022)
Publisher : Prodi Teknik Informatika FIK Universitas Muslim Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33096/ilkom.v14i1.1106.57-62

Abstract

Forecasting is the process of predicting something in the future based on previous patterns. Forecasting will never be 100% accurate because the future has a problem of uncertainty. However, using the right method can make forecasting have a low error rate value to provide a good forecast for the future. This study aims to determine the effect of increasing the number of hidden layers and neurons on the performance of the long short-term memory (LSTM) forecasting method. LSTM performance measurement is done by root mean square error (RMSE) in various architectural scenarios. The LSTM algorithm is considered capable of handling long-term dependencies on its input and can predict data for a relatively long time. Based on research conducted from all models, the best results were obtained with an RMSE value of 0.699 obtained in model 1 with the number of hidden layers 2 and 64 neurons. Adding the number of hidden layers can significantly affect the RMSE results using neurons 16 and 32 in Model 1.