Claim Missing Document
Check
Articles

Found 3 Documents
Search

Stereo matching algorithm using deep learning and edge-preserving filter for machine vision Abd Gani, Shamsul Fakhar; Miskon, Muhammad Fahmi; Hamzah, Rostam Affendi; Hamid, Mohd Saad; Kadmin, Ahmad Fauzan; Herman, Adi Irwan
Bulletin of Electrical Engineering and Informatics Vol 13, No 3: June 2024
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/eei.v13i3.5708

Abstract

Machine vision research began with a single-camera system, but these systems had various limitations from having just one point-of-view of the environment and no depth information, therefore stereo cameras were invented. This paper proposes a hybrid method of a stereo matching algorithm with the goal of generating an accurate disparity map critical for applications such as 3D surface reconstruction and robot navigation to name a few. Convolutional neural network (CNN) is utilised to generate the matching cost, which is then input into cost aggregation to increase accuracy with the help of a bilateral filter (BF). Winner-take-all (WTA) is used to generate the preliminary disparity map. An edge-preserving filter (EPF) is applied to that output based on a transform that defines an isometry between curves on the 2D image manifold in 5D and the real line to eliminate these artefacts. The transform warps the input signal adaptively to allow linear 1D filtering. Due to the filter's resistance to high contrast and brightness, it is effective in refining and removing noise from the output image. Based on experimental research employing a Middlebury standard validation benchmark, this approach gives high accuracy with an average non-occluded error of 6.71% comparable to other published methods.
Refining disparity maps using deep learning and edge-aware smoothing filter Abd Gani, Shamsul Fakhar; Miskon, Muhammad Fahmi; Hamzah, Rostam Affendi; Hamid, Mohd Saad; Kadmin, Ahmad Fauzan; Herman, Adi Irwan
Bulletin of Electrical Engineering and Informatics Vol 13, No 3: June 2024
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/eei.v13i3.6480

Abstract

Stereo matching algorithm is crucial for applications that rely on three-dimensional (3D) surface reconstruction, producing a disparity map that contains depth information by computing the disparity values between corresponding points from a stereo image pair. In order to yield desirable results, the proposed stereo matching algorithm must possess a high degree of resilience against radiometric variation and edge inconsistencies. In this article convolutional neural network (CNN) is employed in the first stage to generate the raw matching cost, which is subsequently filtered with a bilateral filter (BF) and applied with cross-based cost aggregation (CBCA) during the cost aggregation stage to enhance precision. Winner-take-all (WTA) strategy is implemented to normalise the disparity map values. Finally, the resulting output is subjected to an edge-aware smoothing filter (EASF) to reduce the noise. Due to its resistance to high contrast and brightness, the filter is found to be effective in refining and eliminating noise from the output image. Despite discontinuities like adiron's lost cup handle or artl's shattered rods, this approach, based on experimental research utilizing a Middlebury standard validation benchmark, yields a high level of accuracy, with an average non-occluded error of 6.79%, comparable to other published methods.
Improved Hybrid GoogLeNet-Based Deep Learning Optimization for Standardized Straw Mushroom Quality Classification in Indonesia Priyatna, Bayu; Abdurahman, Titik Khawa; Miskon, Muhammad Fahmi; Hananto, April Lia; Hananto, Agustia Tia; Rahman, Aviv Yuniar
Journal of Applied Data Sciences Vol 7, No 2: May 2026
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v7i2.1206

Abstract

Deep learning plays a crucial role in modern computer vision due to its ability to automatically extract hierarchical features from large-scale image data. Among various architectures, Convolutional Neural Networks (CNNs) have been extensively utilized for image pattern interpretation, including in agricultural product inspection. Straw mushrooms (Volvariella volvacea) are important agro-industrial commodities in Indonesia; however, their quality assessment still relies on subjective manual evaluation based on the Indonesian National Standard (SNI:01-6945-2003), leading to inconsistency in grading results. To address this limitation, this research proposes an Improved Hybrid GoogLeNet model integrated with a YOLO-based detection framework and hybrid preprocessing to enhance feature clarity and classification robustness. The system is capable of conducting object detection, 3-class morphological quality classification (Pure White, Oval, and Black Spot/Defect), and automatic diameter measurement using calibrated pixel-to-centimeter conversion. Performance evaluation is carried out by benchmarking the proposed model against several popular deep learning architectures including YOLOv5, LeNet, AlexNet, VGGNet, and ResNet. Experimental results demonstrate that the Improved Hybrid GoogLeNet achieves the highest performance with precision of 97.99%, recall of 96.07%, and F1-score of 96.98%, along with low misclassification rates across all classes. These results indicate that the proposed method provides accurate, reliable, and efficient quality assessment that supports standardized automated grading in industrial applications. Therefore, this study contributes to the advancement of intelligent computer vision solutions for digital transformation in the Indonesian mushroom agro-industry.