Claim Missing Document
Check
Articles

Found 4 Documents
Search

From Static to Contextual: A Survey of Embedding Advances in NLP Alkaabi, Hussein; Jasim, Ali Kadhim; Darroudi, Ali
PERFECT: Journal of Smart Algorithms Vol. 2 No. 2 (2025): PERFECT: Journal of Smart Algorithms, Article Research July 2025
Publisher : LEMBAGA KAJIAN PEMBANGUNAN PERTANIAN DAN LINGKUNGAN (LKPPL)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62671/perfect.v2i2.77

Abstract

Embedding techniques have been a cornerstone of Natural Language Processing (NLP), enabling machines to represent textual data in a form that captures semantic and syntactic relationships. Over the years, the field has witnessed a significant evolution—from static word embeddings, such as Word2Vec and GloVe, which represent words as fixed vectors, to dynamic, contextualized embeddings like BERT and GPT, which generate word representations based on their surrounding context. This survey provides a comprehensive overview of embedding techniques, tracing their development from early methods to state-of-the-art approaches. We discuss the strengths and limitations of each paradigm, their applications across various NLP tasks, and the challenges they address, such as polysemy and out-of-vocabulary words. Furthermore, we highlight emerging trends, including multimodal embeddings, domain-specific representations, and efforts to mitigate embedding bias. By synthesizing the advancements in this rapidly evolving field, this paper aims to serve as a valuable resource for researchers and practitioners while identifying open challenges and future directions for embedding research in NLP.
Explainable AI for Medical Imaging: A Taxonomy Based on Clinical Task Requirements Kamber, Ali Nadhim; Alkaabi, Hussein; Al-Rekabi, Mohammed; Jasim, Ali Kadhim
PERFECT: Journal of Smart Algorithms Vol. 2 No. 2 (2025): PERFECT: Journal of Smart Algorithms, Article Research July 2025
Publisher : LEMBAGA KAJIAN PEMBANGUNAN PERTANIAN DAN LINGKUNGAN (LKPPL)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62671/perfect.v2i2.115

Abstract

Explainable Artificial Intelligence (XAI) has emerged as a critical enabler for deploying AI-driven medical imaging systems where transparency, trust, and accountability are paramount. However, most current taxonomies of XAI methods categorize techniques based on algorithmic families (e.g., saliency maps, attribution methods), which often fail to reflect the practical requirements of clinical tasks. This paper proposes a novel task-centric taxonomy of XAI in medical imaging that aligns explanation techniques with four key clinical tasks: classification, detection, segmentation, and prognostic assessment. For each task, we analyze how different XAI methods enhance model interpretability, their suitability for clinical decision-making, and their limitations in real-world applications. Our taxonomy aims to provide a practical framework for researchers and practitioners to select appropriate XAI strategies tailored to the specific demands of medical imaging workflows. Furthermore, we highlight the current gaps in task-specific explainability and propose future research directions towards clinically meaningful, task-driven XAI solutions. This work serves as a step towards bridging the gap between technical XAI developments and the functional needs of clinical practice.
Arabic NLP: A Survey of Pre-Processing and Representation Techniques Alkaabi, Hussein Ala'a; Jasim, Ali kadhim; Darroudi, Ali
Journal of Computer Science, Information Technology and Telecommunication Engineering Vol 6, No 2 (2025)
Publisher : Universitas Muhammadiyah Sumatera Utara, Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30596/jcositte.v6i2.25562

Abstract

The rapid growth of Arabic Natural Language Processing (NLP) has underscored the vital role of upstream tasks that prepare raw text for modeling. This review systematically examines the key steps in Arabic text pre-processing and representation learning, highlighting their impact on downstream NLP performance. We discuss the unique linguistic challenges posed by Arabic, such as rich morphology, orthographic ambiguity, dialectal diversity, and code-switching phenomena. The survey covers traditional rule-based and statistical methods and modern deep learning approaches, including subword tokenization and contextual embeddings. Special attention is given to how pre-trained language models like AraBERT and MARBERT interact with pre-processing pipelines, often redefining the balance between explicit text normalization and implicit representation learning. Furthermore, we analyze existing tools, benchmarks, and evaluation metrics, and identify persistent gaps such as dialect adaptation and Romanized Arabic (Arabizi) processing. By mapping current practices and open issues, this review aims to guide researchers and practitioners towards more robust, adaptive, and linguistically-aware Arabic NLP pipelines, ensuring that the data fed into models is as clean, consistent, and semantically meaningful as possible.
A Multi-Feature Fusion Framework for Sentiment Analysis Based on Textual and Affective Signals Alkaabi, Hussein Ala'a; jasim, ali kadhim
International Journal of Artificial Intelligence Research Vol 9, No 2 (2025): December
Publisher : Universitas Dharma Wacana

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29099/ijair.v9i2.1634

Abstract

Sentiment analysis of social media content, particularly on platforms like Twitter, presents significant challenges due to the informal, brief, and context-dependent nature of user-generated text. Traditional lexicon-based and shallow machine learning approaches often fail to capture nuanced sentiment expressions, especially in the presence of slang, abbreviations, sarcasm, and emotionally charged language. To address these limitations, this paper proposes a novel tri-stream feature fusion framework that integrates contextual semantics, sequential dependencies, and affective signals for robust sentiment classification. The framework employs RoBERTa to extract rich contextual embeddings, Bidirectional Long Short-Term Memory (BiLSTM) networks to capture word-order and temporal patterns, and lexicon-based emotion vectors to enhance emotional cue detection. These heterogeneous features are concatenated at the representation level to form a comprehensive feature space, which is subsequently used to predict sentiment polarity via a fully connected neural network classifier. Extensive experiments conducted on the Sentiment140 dataset, comprising 1.6 million labeled tweets, demonstrate that the proposed approach significantly outperforms conventional baselines and recent hybrid models, achieving an accuracy of 92.1%. Additionally, ablation studies and misclassification analyses reveal each feature stream’s complementary contributions and highlight challenges in detecting sarcasm and implicit sentiment. Future work will integrate sarcasm-aware components and external knowledge sources to further enhance model interpretability and robustness.