Wijaya, Alfarizi
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Perancangan Sistem Penerjemah Bahasa Isyarat bagi Tunarungu dan Tunawicara Berbasis Pengolahan Citra Digital dan Text-to-Speech Trianto, Nafil Rizq; Wijaya, Alfarizi; Pardede, Arion; Pandiangan, Daniel; Syahputra, Hermawan
Teknik: Jurnal Ilmu Teknik dan Informatika Vol. 6 No. 1 (2026): Mei : Teknik: Jurnal Ilmu Teknik dan Informatika
Publisher : LPPM Sekolah Tinggi Ilmu Ekonomi - Studi Ekonomi Modern

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.51903/teknik.v6i1.1156

Abstract

Communication is an essential human right, yet a significant communication gap persists between individuals with sensory disabilities, specifically the deaf and speech-impaired, and the general public. While many technological solutions have been proposed to translate sign language, existing models primarily rely on heavy deep learning architectures such as Convolutional Neural Networks (CNN) or Recurrent Neural Networks (RNN/LSTM). These models often demand high computational power, leading to latency and limiting real-time application on standard devices. This study proposes a lightweight, fast, and highly responsive sign language translation system specifically designed to recognize static alphabets (A-Z) and single-character air writing. The system utilizes MediaPipe for hand tracking, where feature extraction is intelligently processed by calculating the relative spatial coordinates of fingertips to the wrist, reducing dependency on raw camera coordinates. Classification is performed using a Support Vector Machine (SVM) with a Radial Basis Function (RBF) kernel, prioritizing computational efficiency without sacrificing accuracy. To enhance user experience, the system introduces three key novelties: smart relative feature extraction, an anti-duplication hold system with a 1-second timer to prevent input spamming, and a non-blocking multithreaded audio execution (Daemon Thread) utilizing Google Text-to-Speech (gTTS), ensuring the webcam feed remains fluid during audio playback. Additionally, an alternative air-writing mode is integrated, utilizing geometric heuristics and PyTesseract OCR to read single drawn letters in the air. The results indicate that the proposed system operates swiftly and efficiently, bridging the communication barrier with a hardware-friendly approach.