Claim Missing Document
Check
Articles

Found 6 Documents
Search

Counting Bacterial Colony and Reducing noise on Low-Quality Image Using Modified Perona-Malik Diffusion Filter with Sobel Mask Fractional Order Hamdani, Ibnu Mansyur; Anam, Syaiful; Shofianah, Nur; Bustamin, Syamsumar
Jurnal Sisfokom (Sistem Informasi dan Komputer) Vol 12, No 2 (2023): JULI
Publisher : ISB Atma Luhur

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32736/sisfokom.v12i2.1661

Abstract

In the field of microbiology, the counting of bacterial colonies is fundamental and mandatory. This is done to estimate the number of bacterial cells in every 1 milliliter or gram of sample. The counting takes a long time and is tedious, so it requires an accurate and fast counting method. The image quality used is very low and contains noise. Therefore, a preprocessing method is needed to reduce the noise. The Perona-Malik filter method is known to be able to remove noise well. However, it is difficult to determine the appropriate gradient threshold parameter ( ) for each different image. To find the appropriate value of , the original Sobel Mask method and Sobel Mask Fractional-Order are used to estimate the value of . The experimental results show the results of noise reduction using PMD with a value of  from the original Sobel Mask and Sobel Mask Fractional-Order. The results of the accuracy of determining the value of k with the Sobel Mask Fractional-Order (α=1.0) show higher results based on the F-Measure values for samples 1, 2, and 3 respectively 97%, 98%, and 90%.
Hybrid methods to identify ovarian cancer from imbalanced high-dimensional microarray data Sapitri, Ni Kadek Emik; Sa'adah, Umu; Shofianah, Nur
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 14, No 2: April 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v14.i2.pp1173-1182

Abstract

Scientists have used microarray data to identify healthy people and patients with various types of cancer, including ovarian cancer. Ovarian cancer is the most dangerous of all types of cancer that attacks the female reproductive organ. The right combination of methods is needed to identify ovarian cancer from microarray data because that type of data is high-dimensional and imbalanced. This research aims to propose two hybrid methods which are a combination of infinite feature selection (IFS) as features selector with classification and regression tree (CART) as a classifier. IFS can work with two separate scenarios, namely supervised infinite feature selection (SIFS) and unsupervised infinite feature selection (UIFS). This research also compares the performance of the two hybrid methods proposed (SIFS-CART and UIFS-CART) with CART without IFS. The data used is OVA_ovary that has 10937 columns and 1545 rows. The results shows that SIFS-CART achieves maximum performance using 1000 features and UIFS-CART 5000 features. CART without IFS uses all 10935 features. The balanced accuracy results show SIFS-CART can outperform CART without IFS and UIFS-CART. Using less features to get highest balanced accuracy results, SIFS is more effective in performing feature selection on the OVA_ovary dataset compared to UIFS.
Numerical Simulation and Sensitivity Analysis of COVID-19 Transmission Involves Virus in the Environment Azizah, Maratus Sholihatul; Trisilowati; Shofianah, Nur
The Journal of Experimental Life Science Vol. 13 No. 2 (2023)
Publisher : Graduate School, Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21776/ub.jels.2023.013.02.08

Abstract

This paper is aimed to develop a new COVID-19 mathematical model involving viruses in the environment. In this mathematical model, the human population is divided into five subpopulations: susceptible, exposed, infected, hospitalized, and cured individuals. In addition, the model also contains the virus population in the environment. Infection in the model occurs due to interactions between susceptible individual subpopulations and infected individuals and hospitalizations, as well as the spread of the virus in the environment. Based on the results of dynamic analysis, this model has two equilibrium points, the disease-free and endemic equilibrium points. The disease-free equilibrium point always exists, and both equilibrium points are locally asymptotically stable if they meet the Routh-Hurwitz criteria. Model sensitivity analysis was carried out on model parameters that affect the basic reproduction number with the most sensitive parameters are the natural death rate, the recruitment rate, the transmission rate of the virus in the environment, the virus clearance rate, and the rate of wearing PPE (Personal Protective Equipment), as well as the parameter that does not affect the basic reproduction number that is the rate of leaving the recovered population. Numerical simulations performed show results in accordance with the analysis, also from the simulations can be concluded that the increase (or decrease) of the transmission rate of the virus in an environment that has a higher sensitivity index has more significant influences on the basic reproduction number and the number of infected population than the transmission rate of hospitalized individuals. Keywords: Basic Reproduction Number, Dynamics Analysis, Epidemic Models of COVID-19, Local Stability Analysis, Sensitivity Analysis.
Liver Cirrhosis Classification using Extreme Gradient Boosting Classifier and Harris Hawk Optimization as Hyperparameter Tuning Nalasari , Lista Tri; Anam, Syaiful; Shofianah, Nur
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 7 No 2 (2025): April
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v7i2.730

Abstract

This study proposes an early diagnosis model based on Machine Learning for liver cirrhosis classification using the Hepatitis C dataset, which is the leading cause of cirrhosis, from UCI ML. The classification is performed using the XGBoost algorithm because it provides high accuracy and time efficiency based on previous studies. However, these advantages depend on the combination of its hyperparameters set. XGBoost has a large number of hyperparameters, which can be time-consuming for researchers to manually configure. Therefore, this study proposes combining XGBoost with the Harris Hawks Optimization (HHO) algorithm for hyperparameter tuning. HHO is implemented with a hawk population of 40 and maximum iterations set at 25. The proposed XGBoost-HHO model provides an average performance of 99.34% for accuracy, MAR, MAP and 99.33% for Macro F1-score. These performances are achieved with the shortest processing time across 25 experiments compared to other combination models. The performance of the XGBoost-HHO model shows more significant increase in performance and reduction in overfitting compared to the standard XGBoost, SVM, RF models, as well as several other combined models including RF-HHO, SVM-HHO, XGBoost-PSO, and XGBoost-BA. Additionally, based on the feature importance analysis of the XGBoost-HHO algorithm, Alanine Aminotransferase (ALT), Protein, and Gamma-glutamyltransferase (GGT) contribute the most to the classification process, with gain values of 11.21, 9.51, and 7.98, respectively. Overall, the findings of this study show that the XGBoost-HHO algorithm combination provides competitive performance and can serve as an excellent alternative for liver cirrhosis classification in terms of both accuracy and time efficiency.
IDENTIFYING IMPORTANT GENES IN OVARIAN CANCER FROM HIGH-DIMENSIONAL MICROARRAY DATA USING SIFS-CART METHOD Sapitri, Ni Kadek Emik; Sa'adah, Umu; Shofianah, Nur
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 18 No 3 (2024): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol18iss3pp1909-1918

Abstract

Ovarian cancer can be identified from microarray data using machine learning. Many studies only focus on improving the machine learning classification algorithms to achieve higher performance. The purpose of classification is not only to obtain high performance but also to seek new knowledge from the results. This research focuses on both. By using a hybrid Supervised Infinite Feature Selection (SIFS) method with Classification and Regression Tree (CART) or SIFS-CART, this research aims to predict ovarian cancer and identify potential genes for ovarian cancer cases. The data used is the OVA_ovary dataset. SIFS in the best SIFS-CART model reduced 10935 genes in the initial OVA_ovary dataset to 1000 genes. Then, CART was built with these 1000 genes. Based on the balanced accuracy (BA) metric for imbalanced microarray data, the best SIFS-CART model achieves 85.7% BA in training and 83.2% in testing. The optimal CART in the best SIFS-CART model only needs four genes from 1000 selected genes to build it. Those genes are STAR, WT1, PEG3, and ASPN. Based on studies of several pieces of literature in the medical field, it can be concluded that STAR, WT1, and PEG3 play an important role in ovarian cancer cases. However, the relationship between ASPN and ovarian cancer in more detail has not been studied by medical researchers.
Knowledge Discovery from Confusion Matrix of Pruned CART in Imbalanced Microarray Data Ovarian Cancer Classification Sapitri, Ni Kadek Emik; Sa’adah, Umu; Shofianah, Nur
Scientific Journal of Informatics Vol 11, No 1 (2024): February 2024
Publisher : Universitas Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15294/sji.v11i1.50077

Abstract

Purpose: The results of microarray data analysis is important in cancer diagnosis, especially in early stages asymptomatic cancers like ovarian cancer. One of the challenges in analyzing microarray data is the problem of imbalanced data. Unfortunately, research that carries out cancer classification from microarray data often ignores this challenge, so that it doesn’t use appropriate evaluation metrics. It makes the results biased towards the majority class. This study uses a popular evaluation metric “accuracy” and an evaluation metric that is suitable for imbalanced data “balanced accuracy (BA)” to gain information from the confusion matrix regarding accuracy and BA values in case of ovarian cancer classification.Methods: This study use Classification and Regression Tree (CART) as the classifier. CART optimized by pruning. CART optimal is determined from the results of CART complexity analysis and confusion matrix.Results: The confusion matrix and CART interpretations in this research show that CART with low complexity is still able to predict majority class respondents well. However, when none of the data in the minority class was classified correctly, the accuracy value was still quite high, namely 86.97% and 88.03% respectively at the training and testing stages, while the BA value at both stages was only 50%.Novelty: It is very important to ensure that the evaluation metrics used match the characteristics of the data being processed. This research illustrate the difference between accuracy and BA. It concluded that that classification of an imbalanced dataset without doing resampling can use BA as evaluation metric, because based on the results, BA is more fairly to both classes.