Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Proceeding of the Electrical Engineering Computer Science and Informatics

Combination of Genetic Algorithm and Brill Tagger Algorithm for Part of Speech Tagging Bahasa Madura Nindian Puspa Dewi; Joan Santoso; Ubaidi Ubaidi; Eka Rahayu Setyaningsih
Proceeding of the Electrical Engineering Computer Science and Informatics Vol 7, No 2: EECSI 2020
Publisher : IAES Indonesia Section

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/eecsi.v7.2034

Abstract

Part of speech (POS) is commonly known as word types in a sentence such as verbs, adjectives, nouns, and so on. Part of Speech (POS) Tagging is a process of marking the word class or part of speech in every word in a sentence. Part of Speech Tagging has an important role to be used as a basis for research in Natural Language Processing. That is why research on Part of Speech Tagging for Bahasa Madura as an effort to preserve and develop the use of regional languages. In this research, POS Tagging is done using the Brill Tagger Algorithm which is combined with the Genetic Algorithm. Brill Tagger is a POS Tagging Algorithm that has the best level of accuracy when implemented in other languages. Genetic Algorithms used in the contextual learner process with consideration in previous studies can increase the speed of the training process so that it is more efficient. The results of this study are then compared with the results of the previous study so that we can find out suitable algorithms used for the development of text processing in Bahasa Madura. From a series of experiments, the average accuracy obtained by using Brill Tagger is 86.4% with the highest accuracy of 86.7%, while using GA Brill Tagger shows an average accuracy of 86.5% with the highest accuracy of 86.6%. Testing by observing OOV (Out of Vocabulary) achieves an average accuracy of 67.7% for Brill Taggers and 64.6% for GA Brill Taggers. Testing by considering multiple POS with Brill Tagger produces an average accuracy of 73.3% while testing using GA Brill Tagger produces an average accuracy of 90.9%. This shows that the accuracy with GA Brill Tagger is better than Brill Tagger, especially if considering multiple POS. This is because GA Brill Tagger can generate rules for handling the existence of multiple POS more than pure Brill Tagger.Part of speech (POS) is commonly known as word types in a sentence such as verbs, adjectives, nouns, and so on. Part of Speech (POS) Tagging is a process of marking the word class or part of speech in every word in a sentence. Part of Speech Tagging has an important role to be used as a basis for research in Natural Language Processing. That is why research on Part of Speech Tagging for Bahasa Madura as an effort to preserve and develop the use of regional languages. In this research, POS Tagging is done using the Brill Tagger Algorithm which is combined with the Genetic Algorithm. Brill Tagger is a POS Tagging Algorithm that has the best level of accuracy when implemented in other languages. Genetic Algorithms used in the contextual learner process with consideration in previous studies can increase the speed of the training process so that it is more efficient. The results of this study are then compared with the results of the previous study so that we can find out suitable algorithms used for the development of text processing in Bahasa Madura. From a series of experiments, the average accuracy obtained by using Brill Tagger is 86.4% with the highest accuracy of 86.7%, while using GA Brill Tagger shows an average accuracy of 86.5% with the highest accuracy of 86.6%. Testing by observing OOV (Out of Vocabulary) achieves an average accuracy of 67.7% for Brill Taggers and 64.6% for GA Brill Taggers. Testing by considering multiple POS with Brill Tagger produces an average accuracy of 73.3% while testing using GA Brill Tagger produces an average accuracy of 90.9%. This shows that the accuracy with GA Brill Tagger is better than Brill Tagger, especially if considering multiple POS. This is because GA Brill Tagger can generate rules for handling the existence of multiple POS more than pure Brill Tagger
Co-Authors Aditya Dwi Aryanto Adriel Ferdianto Agung Dewa Bagus Soetiono Ahmad Syaifuddin Ali Djamhuri Ananta Tio Putra Andik Jatmiko Anita Guterres Bayu Anggara Putra Budi Irawan Chandra, Francisca H. Christian Nathaniel Purwanto Devi Dwi Purwanto Dewi, Nindian Puspa Dipa, Sasra Edwin Pramana Eka Rahayu Setyaningsih Eko Mulyanto Yuniarno Elizabeth Shirley, Stephanie Endang Setyati Ernest Lim Esther Irawati S. Esther Irawati Setiawan Esther Irawati Setiawan Eunike Kardinata F.X. Ferdinandus Fachrul Kurniawan Febriantoro, Erfan Francisca Chandra Fujisawa, Kimiya Gunawan Gunawan Gunawan Gunawan Gunawan Gunawan Hans Juwiantho Hans Keven Budi Prakoso Hartarto Junaedi Hendrawan Armanto Heppi Siswanto Herman Budianto Imron, Syaiful Indra Maryati Jatmiko, Andik Kristian Indradiarta Gunawan Kristina, Natalia Kurniawan S, Putu Widiarsa Langgeng, Yudo Sembodo Hastoro Leonel Hernandez Luhfita Tirta Lukman Zaman Machfudin, Mohammad Farid Mauridhi Hery Purnomo Miftah Farid Mochamad Hariadi Muhammad Amfahtori Wijarnoko Mustaqin, Farhan Faisal Zainul Nagari, Widean Nikko Riestian Putra Wardoyo Nindian Puspa Dewi Ong, Hansel Santoso Patrick Sutanto Reddy Alexandro Harianto Ricky Sutanto Rossy P. C. Rully Widiastutik Samuel Budi Wardhana Kusuma Saputra, Daniel Gamaliel Setya Ardhi Soetiono, Agung Dewa Bagus Stefanie Hilda Kusumahadi Stella Vania Surya Sumpeno Syabith Umar Ahdan Syaiful Huda Syaiful Imron Tjendika, Patrick Tjwanda Putera Gunawan Tri Septianto Tuesday saka gustaf Ubaidi Ubaidi Ubaidi, Ubaidi Yosi Kristian