Claim Missing Document
Check
Articles

Found 4 Documents
Search
Journal : IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Motion Detection and Face Recognition for CCTV Surveillance System Ade Nurhopipah; Agus Harjoko
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 12, No 2 (2018): July
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.18198

Abstract

Closed Circuit Television (CCTV) is currently used in daily life for a variety purpose. Development of the use of CCTV has transformed from a simple passive surveillance into an integrated intelligent control system. In this research, motion detection and facial recognation in CCTV video is done to be a base for decision making to produce automated, effective and efficient integrated system. This CCTV video processing provides three outputs, a motion detection information, a face detection information and a face identification information. Accumulative Differences Images (ADI) used  for motion detection, and Haar Classifiers Cascade used  for facial segmentation. Feature extraction is done with Speeded-Up Robust Features (SURF) and Principal Component Analysis (PCA). The features was trained by Counter-Propagation Network (CPN). Offline tests performed on 45 CCTV video. The test results obtained a motion detection success rate of 92,655%, a face detection success rate of 76%, and a face detection success rate of 60%. The results concluded that the process of faces identification through CCTV video with natural background have not been able to obtain optimal results. The motion detection process is ideal to be applied to real-time conditions. But in combination with face recognition process, there is a significant delay time.
Dataset Splitting Techniques Comparison For Face Classification on CCTV Images Ade Nurhopipah; Uswatun Hasanah
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 14, No 4 (2020): October
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.58092

Abstract

The performance of classification models in machine learning algorithms is influenced by many factors, one of which is dataset splitting method. To avoid overfitting, it is important to apply a suitable dataset splitting strategy. This study presents comparison of four dataset splitting techniques, namely Random Sub-sampling Validation (RSV), k-Fold Cross Validation (k-FCV), Bootstrap Validation (BV) and Moralis Lima Martin Validation (MLMV). This comparison is done in face classification on CCTV images using Convolutional Neural Network (CNN) algorithm and Support Vector Machine (SVM) algorithm. This study is also applied in two image datasets. The results of the comparison are reviewed by using model accuracy in training set, validation set and test set, also bias and variance of the model. The experiment shows that k-FCV technique has more stable performance and provide high accuracy on training set as well as good generalizations on validation set and test set. Meanwhile, data splitting using MLMV technique has lower performance than the other three techniques since it yields lower accuracy. This technique also shows higher bias and variance values and it builds overfitting models, especially when it is applied on validation set.
Behind the Mask: Detection and Recognition Based-on Deep Learning Ade Nurhopipah; Irfan Rifai Azziz; Jali Suhaman
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 16, No 1 (2022): January
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.72075

Abstract

COVID-19 prevention procedures are executed to support public services and business continuity in a pandemic situation. Manual mask use monitoring is not efficient as it requires resources to monitor people at all times. Therefore, this task can be supported by automated surveillance systems based on Deep Learning. We performed mask detection and face recognition for a real-environment dataset. YOLOV3 as a one-stage detector was implemented to simultaneously generate a bounding box of the face area and class prediction. In face recognition, we compared the performance of three pre-trained models, namely ResNet152V2, InceptionV3, and Xception. The mask detection showed promising results with MAP=0.8960 on training and MAP=0.8957 on validation. We chose the Xception model for face recognition because it has equal quality as ResNet152V2 but has fewer parameters. Xception achieved a minimal loss value in the validation of 0.09157 with perfect accuracy on facial images larger than 100 pixels. Overall the system delivers promising results and can identify faces, even those behind the mask.
Exploring Pre-Trained Model and Language Model for Translating Image to Bahasa Ade Nurhopipah; Jali Suhaman; Anan Widianto
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 17, No 4 (2023): October
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.76389

Abstract

In the last decade, there have been significant developments in Image Caption Generation research to translate images into English descriptions. This task has also been conducted to produce texts in non-English, including Bahasa. However, the references in this study are still limited, so exploration opportunities are open widely. This paper presents comparative research by examining several state-of-the-art Deep Learning algorithms to extract images and generate their descriptions in Bahasa. We extracted images using three pre-trained models, namely InceptionV3, Xception, and EfficientNetV2S. In the language model, we examined four architectures: LSTM, GRU, Bidirectional LSTM, and Bidirectional GRU. The database used was Flickr8k which was translated into Bahasa. Model evaluation was conducted using BLEU and Meteor. The performance results based on the pre-trained model showed that EfficientNetV3S significantly gave the highest score among other models. On the other hand, in the language model, there was only a slight difference in model performance. However, in general, the Bidirectional GRU scored higher. We also found that step size in training affected overfitting. Larger step sizes tended to provide better generalizations. The best model was generated using EfficientNetV3S and Bidirectional GRU with step size=4096, which resulted in an average score of BLEU-1=0,5828 and Meteor=0,4520.