cover
Contact Name
-
Contact Email
-
Phone
-
Journal Mail Official
-
Editorial Address
-
Location
Kota yogyakarta,
Daerah istimewa yogyakarta
INDONESIA
International Journal of Advances in Intelligent Informatics
ISSN : 24426571     EISSN : 25483161     DOI : 10.26555
Core Subject : Science,
International journal of advances in intelligent informatics (IJAIN) e-ISSN: 2442-6571 is a peer reviewed open-access journal published three times a year in English-language, provides scientists and engineers throughout the world for the exchange and dissemination of theoretical and practice-oriented papers dealing with advances in intelligent informatics. All the papers are refereed by two international reviewers, accepted papers will be available on line (free access), and no publication fee for authors.
Arjuna Subject : -
Articles 330 Documents
Detection of code smells using machine learning techniques combined with data-balancing methods Nasraldeen Alnor Adam Khleel; Károly Nehéz
International Journal of Advances in Intelligent Informatics Vol 9, No 3 (2023): November 2023
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v9i3.981

Abstract

Code smells are prevalent issues in software design that arise when implementation or design principles are violated. These issues manifest as symptoms or anomalies in the source code. Timely identification of code smells plays a crucial role in enhancing software quality and facilitating software maintenance. Previous studies have shown that code smell detection can be accomplished through the utilization of machine learning (ML) methods. However, despite their increasing popularity, research suggests that the suitability of these methods are not always appropriate due to the problem of imbalanced data. Consequently, the effectiveness of ML models may be negatively affected. This study aims to propose a novel method for detecting code smells by employing five ML algorithms, namely decision tree (DT), k-nearest neighbors (K-NN), support vector machine (SVM), XGboost (XGB), and multi-layer perceptron (MLP). Additionally, to tackle the challenge of imbalanced data, the proposed method incorporates the random oversampling technique. Experiments were conducted in this study using four datasets that encompassed code smells, specifically god-class, data-class, long-method, and feature-envy. The experimental outcomes were evaluated and compared using various performance metrics. Upon comparing the outcomes of our models on both the balanced and original datasets, we found that the XGB model achieved the highest accuracy of 100% for detecting the data class and long method on the original datasets. In contrast, the highest accuracy of 100% was obtained for the data class and long method using DT, SVM, and XGB models on the balanced datasets. According to the empirical findings, there is significant promise in using ML techniques for the accurate prediction of code smells.
Systematic literature review of dermoscopic pigmented skin lesions classification using convolutional neural network (CNN) Erwin Setyo Nugroho; Igi Ardiyanto; Hanung Adi Nugroho
International Journal of Advances in Intelligent Informatics Vol 9, No 3 (2023): November 2023
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v9i3.961

Abstract

The occurrence of pigmented skin lesions (PSL), including melanoma, are rising, and early detection is crucial for reducing mortality. To assist Pigmented skin lesions, including melanoma, are rising, and early detection is crucial in reducing mortality. To aid dermatologists in early detection, computational techniques have been developed. This research conducted a systematic literature review (SLR) to identify research goals, datasets, methodologies, and performance evaluation methods used in categorizing dermoscopic lesions. This review focuses on using convolutional neural networks (CNNs) in analyzing PSL. Based on specific inclusion and exclusion criteria, the review included 54 primary studies published on Scopus and PubMed between 2018 and 2022. The results showed that ResNet and self-developed CNN were used in 22% of the studies, followed by Ensemble at 20% and DenseNet at 9%. Public datasets such as ISIC 2019 were predominantly used, and 85% of the classifiers used were softmax. The findings suggest that the input, architecture, and output/feature modifications can enhance the model's performance, although improving sensitivity in multiclass classification remains a challenge. While there is no specific model approach to solve the problem in this area, we recommend simultaneously modifying the three clusters to improve the model's performance.
Secure medical image watermarking based on reversible data hiding with Arnold's cat map Aulia Arham; Novia Lestari
International Journal of Advances in Intelligent Informatics Vol 9, No 3 (2023): November 2023
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v9i3.1029

Abstract

The process of restoring medical images to their original form after the extraction process in application watermarking is crucial for ensuring their authenticity. Inaccurate diagnoses can occur due to distortions in medical images from conventional data embedding applications. To address this issue, reversible data hiding (RDH) method has been proposed by several researchers in recent years to embed data in medical images. After the extraction process, images can be restored to their original form with a reversible data-hiding method. In the past few years, several RDH methods have been rapidly developed, which are based on the concept of difference expansion (DE). However, it is crucial to pay attention to the security of the medical image watermarking method, the embedded data with RDH method can be easily modified, accessed, and altered by unauthorized individuals if they know the employed method. This research suggests a new approach to secure the RDH method through the use of Chaotic Map-based Arnold's Cat Map algorithms on the medical images. Data embedding was performed on random medical images using a DE method. Four gray-scale medical image modalities were used to assess the proposed method's efficacy. In our approach, we can incorporate capacity up to 0.62 bpp while maintaining a visual quality up to 41.02 dB according to PSNR and 0.9900 according to SSIM. The results indicated that it can enhance the security of the RDH method while retaining the ability to embed data and preserving the visual appearance of the medical images.
Analysis and review of the possibility of using the generative model as a compression technique in DNA data storage: review and future research agenda Muhammad Rafi Muttaqin; Yeni Herdiyeni; Agus Buono; Karlisa Priandana; Iskandar Zulkarnaen Siregar
International Journal of Advances in Intelligent Informatics Vol 9, No 3 (2023): November 2023
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v9i3.1063

Abstract

The amount of data in this world is getting higher, and overwriting technology also has severe challenges. Data growth is expected to grow to 175 ZB by 2025. Data storage technology in DNA is an alternative technology with potential in information storage, mainly digital data. One of the stages of storing information on DNA is synthesis. This synthesis process costs very high, so it is necessary to integrate compression techniques for digital data to minimize the costs incurred. One of the models used in compression techniques is the generative model. This paper aims to see if compression using this generative model allows it to be integrated into data storage methods on DNA. To this end, we have conducted a Systematic Literature Review using the PRISMA method in selecting papers. We took the source of the papers from four leading databases and other additional databases. Out of 2440 papers, we finally decided on 34 primary papers for detailed analysis. This systematic literature review (SLR) presents and categorizes based on research questions, namely discussing machine learning methods applied in DNA storage, identifying compression techniques for DNA storage, knowing the role of deep learning in the compression process for DNA storage, knowing how generative models are associated with deep learning, knowing how generative models are applied in the compression process, and knowing latent space can be formed. The study highlights open problems that need to be solved and provides an identified research direction.
Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic Nurzulaikha Khalid; Shuzlina Abdul-Rahman; Wahyu Wibowo; Nur Atiqah Sia Abdullah; Sofianita Mutalib
International Journal of Advances in Intelligent Informatics Vol 9, No 3 (2023): November 2023
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v9i3.1367

Abstract

In Malaysia, during the early stages of the COVID-19 pandemic, the negative impact on mental health became noticeable. The public's psychological and behavioral responses have risen as the COVID-19 outbreak progresses. A high impression of severity, vulnerability, impact, and fear was the element that influenced higher anxiety. Social media data can be used to track Malaysian sentiments in the COVID-19 era. However, it is often found on the internet in text format with no labels, and manually decoding this data is usually complicated. Furthermore, traditional data-gathering approaches, such as filling out a survey form, may not completely capture the sentiments. This study uses a text mining technique called Latent Dirichlet Allocation (LDA) on social media to discover mental health topics during the COVID-19 pandemic. Then, a model is developed using a hybrid approach, combining both lexicon-based and Naïve Bayes classifier. The accuracy, precision, recall, and F-measures are used to evaluate the sentiment classification. The result shows that the best lexicon-based technique is VADER with 72% accuracy compared to TextBlob with 70% accuracy. These sentiments results allow for a better understanding and handling of the pandemic. The top three topics are identified and further classified into positive and negative comments. In conclusion, the developed model can assist healthcare workers and policymakers in making the right decisions in the upcoming pandemic outbreaks.
Detecting and monitoring the development stages of wild flowers and plants using computer vision: approaches, challenges and opportunities João Videira; Pedro Dinis Gaspar; Vasco Nuno da Gama de Jesus Soares; João Manuel Leitão Pires Caldeira
International Journal of Advances in Intelligent Informatics Vol 9, No 3 (2023): November 2023
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v9i3.1012

Abstract

Wild flowers and plants play an important role in protecting biodiversity and providing various ecosystem services. However, some of them are endangered or threatened and are entitled to preservation and protection. This study represents a first step to develop a computer vision system and a supporting mobile app for detecting and monitoring the development stages of wild flowers and plants, aiming to contribute to their preservation. It first introduces the related concepts. Then, surveys related work and categorizes existing solutions presenting their key features, strengths, and limitations. The most promising solutions and techniques are identified. Insights on open issues and research directions in the topic are also provided. This paper paves the way to a wider adoption of recent results in computer vision techniques in this field and for the proposal of a mobile application that uses YOLO convolutional neural networks to detect the stages of development of wild flowers and plants.
Fragile watermarking for image authentication using dyadic walsh ordering Prajanto Wahyu Adi; Adi Wibowo; Guruh Aryotejo; Ferda Ernawan
International Journal of Advances in Intelligent Informatics Vol 9, No 3 (2023): November 2023
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v9i3.1017

Abstract

A digital image is subjected to the most manipulation. This is driven by the easy manipulating process through image editing software which is growing rapidly. These problems can be solved through the watermarking model as an active authentication system for the image. One of the most popular methods is Singular Value Decomposition (SVD) which has good imperceptibility and detection capabilities. Nevertheless, SVD has high complexity and can only utilize one singular matrix S, and ignore two orthogonal matrices. This paper proposes the use of the Walsh matrix with dyadic ordering to generate a new S matrix without the orthogonal matrices. The experimental results showed that the proposed method was able to reduce computational time by 22% and 13% compared to the SVD-based method and similar methods based on the Hadamard matrix respectively. This research can be used as a reference to speed up the computing time of the watermarking methods without compromising the level of imperceptibility and authentication.
Scientific reference style using rule-based machine learning Helen, Afrida; Pradana, Aditya; Afif, Muhammad
International Journal of Advances in Intelligent Informatics Vol 9, No 3 (2023): November 2023
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v9i3.1056

Abstract

Regular Expressions (RegEx) can be employed as a technique for supervised learning to define and search for specific patterns inside text. This work devised a method that utilizes regular expressions to convert the reference style of academic papers into several styles, dependent on the specific needs of the target publication or conference. Our research aimed to detect distinctive patterns of reference styles using RegEx and compare them with a dataset including various reference styles. We gathered a diverse range of reference format categories, encompassing seven distinct classes, from various sources such as academic papers, journals, conference proceedings, and books. Our approach involves employing RegEx to convert one referencing format to another based on the user's specific preferences. The proposed model demonstrated an accuracy of 57.26% for book references and 57.56% for journal references. We used the similarity ratio and Levenshtein distance to evaluate the dataset's performance. The model achieved a 97.8% similarity ratio with a Levenshtein distance of 2. Notably, the APA style for journal references yielded the best results. However, the effectiveness of the extraction function varies depending on the reference style. For APA style, the model showed a 99.97% similarity ratio with a Levenshtein distance of 1. Overall, our proposed model outperforms baseline machine learning models in this task. This study introduces an automated program that utilizes regular expressions to modify academic reference formats. This will enhance the efficiency, precision, and adaptability of academic publishing.
A deep learning model for detection and classification of coffee-leaf diseases using the transfer-learning technique Mansouri, Nabila; Guessmi, Hanene; Alkhalil, Adel
International Journal of Advances in Intelligent Informatics Vol 10, No 3 (2024): August 2024
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v10i3.1573

Abstract

The early treatment  and detection of plant diseases are important processes, as many diseases affecting crops are highly contagious. Recent advancements in deep learning have helped to provide innovative tools that have not only assisted early detection, but also significantly improved the performance and accuracy of Coffee Leaf Disease (CLD) classification and treatment. However, training a deep learning model from scratch can be both resource and time-consuming. To overcome this challenge, the transfer learning technique can take full advantage of pre-trained  models for more general tasks on extensive datasets o ameliorate the performance of a new, related task using few-shot training. This paper proposes a deep learning model, coupled with transfer learning, for CLD detection that aims to provide high accuracy forecasting of diseases that could affect coffee leaves. Our method involves 195 different pre-trained deep learning models, including real-time models like MobileNet and dense ones like EfficientNet and ResNet for the detection of four different diseases. The findings suggest that the EfficientNetB0 model, with transfer learning, has the most relevent accuracy (99.99%), and thus offers an effective solution for coffee leaf diseases classification of. This result could be used to develop applications that help coffee growers to improve the productivity and quality of their crops through early and accurate detection of coffee plant leaf diseases. Such an Artificial Intelligence based application would provide growers with timely control measures, preventing the spread of disease, and minimizing crop damage.
Hybrid machine learning model based on feature decomposition and entropy optimization for higher accuracy flood forecasting Mohd Khairudin, Nazli; Mustapha, Norwati; Mohd Aris, Teh Noranis; Zolkepli, Maslina
International Journal of Advances in Intelligent Informatics Vol 10, No 1 (2024): February 2024
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/ijain.v10i1.1130

Abstract

The advancement of machine learning model has widely been adopted to provide flood forecast. However, the model must deal with the challenges to determine the most important features to be used in in flood forecast with high-dimensional non-linear time series when involving data from various stations. Decomposition of time-series data such as empirical mode decomposition, ensemble empirical mode decomposition and discrete wavelet transform are widely used for optimization of input; however, they have been done for single dimension time-series data which are unable to determine relationships between data in high dimensional time series.  In this study, hybrid machine learning models are developed based on this feature decomposition to forecast the monthly water level using monthly rainfall data. Rainfall data from eight stations in Kelantan River Basin are used in the hybrid model. To effectively select the best rainfall data from the multi-stations that provide higher accuracy, these rainfall data are analyzed with entropy called Mutual Information that measure the uncertainty of random variables from various stations. Mutual Information act as optimization method helps the researcher to select the appropriate features to score higher accuracy of the model. The experimental evaluations proved that the hybrid machine learning model based on the feature decomposition and ranked by Mutual Information can increase the accuracy of water level forecasting.  This outcome will help the authorities in managing the risk of flood and helping people in the evacuation process as an early warning can be assigned and disseminate to the citizen.