Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Proceeding of the Electrical Engineering Computer Science and Informatics

Combination of Genetic Algorithm and Brill Tagger Algorithm for Part of Speech Tagging Bahasa Madura Nindian Puspa Dewi; Joan Santoso; Ubaidi Ubaidi; Eka Rahayu Setyaningsih
Proceeding of the Electrical Engineering Computer Science and Informatics Vol 7, No 2: EECSI 2020
Publisher : IAES Indonesia Section

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/eecsi.v7.2034

Abstract

Part of speech (POS) is commonly known as word types in a sentence such as verbs, adjectives, nouns, and so on. Part of Speech (POS) Tagging is a process of marking the word class or part of speech in every word in a sentence. Part of Speech Tagging has an important role to be used as a basis for research in Natural Language Processing. That is why research on Part of Speech Tagging for Bahasa Madura as an effort to preserve and develop the use of regional languages. In this research, POS Tagging is done using the Brill Tagger Algorithm which is combined with the Genetic Algorithm. Brill Tagger is a POS Tagging Algorithm that has the best level of accuracy when implemented in other languages. Genetic Algorithms used in the contextual learner process with consideration in previous studies can increase the speed of the training process so that it is more efficient. The results of this study are then compared with the results of the previous study so that we can find out suitable algorithms used for the development of text processing in Bahasa Madura. From a series of experiments, the average accuracy obtained by using Brill Tagger is 86.4% with the highest accuracy of 86.7%, while using GA Brill Tagger shows an average accuracy of 86.5% with the highest accuracy of 86.6%. Testing by observing OOV (Out of Vocabulary) achieves an average accuracy of 67.7% for Brill Taggers and 64.6% for GA Brill Taggers. Testing by considering multiple POS with Brill Tagger produces an average accuracy of 73.3% while testing using GA Brill Tagger produces an average accuracy of 90.9%. This shows that the accuracy with GA Brill Tagger is better than Brill Tagger, especially if considering multiple POS. This is because GA Brill Tagger can generate rules for handling the existence of multiple POS more than pure Brill Tagger.Part of speech (POS) is commonly known as word types in a sentence such as verbs, adjectives, nouns, and so on. Part of Speech (POS) Tagging is a process of marking the word class or part of speech in every word in a sentence. Part of Speech Tagging has an important role to be used as a basis for research in Natural Language Processing. That is why research on Part of Speech Tagging for Bahasa Madura as an effort to preserve and develop the use of regional languages. In this research, POS Tagging is done using the Brill Tagger Algorithm which is combined with the Genetic Algorithm. Brill Tagger is a POS Tagging Algorithm that has the best level of accuracy when implemented in other languages. Genetic Algorithms used in the contextual learner process with consideration in previous studies can increase the speed of the training process so that it is more efficient. The results of this study are then compared with the results of the previous study so that we can find out suitable algorithms used for the development of text processing in Bahasa Madura. From a series of experiments, the average accuracy obtained by using Brill Tagger is 86.4% with the highest accuracy of 86.7%, while using GA Brill Tagger shows an average accuracy of 86.5% with the highest accuracy of 86.6%. Testing by observing OOV (Out of Vocabulary) achieves an average accuracy of 67.7% for Brill Taggers and 64.6% for GA Brill Taggers. Testing by considering multiple POS with Brill Tagger produces an average accuracy of 73.3% while testing using GA Brill Tagger produces an average accuracy of 90.9%. This shows that the accuracy with GA Brill Tagger is better than Brill Tagger, especially if considering multiple POS. This is because GA Brill Tagger can generate rules for handling the existence of multiple POS more than pure Brill Tagger
Co-Authors Aditya Dwi Aryanto Adriel Ferdianto Afandi, Acxel Derian Agung Dewa Bagus Soetiono Ahdan, Syabith Umar Ahmad Syaifuddin Ali Djamhuri Ananta Tio Putra Andik Jatmiko Anita Guterres Budi Irawan Cahyadi, Billy Kelvianto Chandra, Francisca H. Christian Nathaniel Purwanto Devi Dwi Purwanto Dewi, Nindian Puspa Dipa, Sasra Edwin Pramana Eka Rahayu Setyaningsih Eko Mulyanto Yuniarno Elizabeth Shirley, Stephanie Endang Setyati Esther Irawati S. Esther Irawati Setiawan Eunike Kardinata F.X. Ferdinandus Fachrul Kurniawan Febriantoro, Erfan Ferdinandus, F. X. Francisca Chandra Fujisawa, Kimiya Gunawan Gunawan Gunawan Gunawan Gunawan Gunawan Halim, Kevin Jonathan Hans Juwiantho Hans Keven Budi Prakoso Harianto, Reddy Alexandro Hartarto Junaedi Hartono, Patrick Hendrawan Armanto Heppi Siswanto Herman Budianto Imron, Syaiful Indra Maryati Irawati Setiawan, Esther Jatmiko, Andik Kristian Indradiarta Gunawan Kristina, Natalia Kurniawan S, Putu Widiarsa Langgeng, Yudo Sembodo Hastoro Leonel Hernandez Lim, Ernest Luhfita Tirta Lukman Zaman Machfudin, Mohammad Farid Mauridhi Hery Purnomo Mochamad Hariadi Muhammad Amfahtori Wijarnoko Mustaqin, Farhan Faisal Zainul Nagari, Widean Nindian Puspa Dewi Ong, Hansel Santoso Putra, Bayu Anggara Putu Widiarsa Kurniawan S Rossy P. C. Rully Widiastutik Samuel Budi Wardhana Kusuma Saputra, Daniel Gamaliel Setiawan, Esther Setya Ardhi Soetiono, Agung Dewa Bagus Stefanie Hilda Kusumahadi Surya Sumpeno Sutanto, Patrick Sutanto, Ricky Syaiful Huda Syaiful Imron Tjendika, Patrick Tjwanda Putera Gunawan Tri Septianto Tuesday saka gustaf Ubaidi Ubaidi Ubaidi, Ubaidi Vania, Stella Vu, Tong Nam Tuan Wardoyo, Nikko Riestian Putra Yosi Kristian