Claim Missing Document
Check
Articles

Found 5 Documents
Search

Ontology-based Why-Question Analysis Using Lexico-Syntactic Patterns A.A.I.N. Eka Karyawati; Edi Winarko; Azhari Azhari; Agus Harjoko
International Journal of Electrical and Computer Engineering (IJECE) Vol 5, No 2: April 2015
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (186.282 KB) | DOI: 10.11591/ijece.v5i2.pp318-332

Abstract

This research focuses on developing a method to analyze why-questions.  Some previous researches on the why-question analysis usually used the morphological and the syntactical approach without considering the expected answer types. Moreover, they rarely involved domain ontology to capture the semantic or conceptualization of the content. Consequently, some semantic mismatches occurred and then resulting not appropriate answers. The proposed method considers the expected answer types and involves domain ontology. It adapts the simple, the bag-of-words like model, by using semantic entities (i.e., concepts/entities and relations) instead of words to represent a query. The proposed method expands the question by adding the additional semantic entities got by executing the constructed SPARQL query of the why-question over the domain ontology. The major contribution of this research is in developing an ontology-based why-question analysis method by considering the expected answer types. Some experiments have been conducted to evaluate each phase of the proposed method. The results show good performance for all performance measures used (i.e., precision, recall, undergeneration, and overgeneration). Furthermore, comparison against two baseline methods, the keyword-based ones (i.e., the term-based and the phrase-based method), shows that the proposed method obtained better performance results in terms of MRR and P@10 values.
Nearest Neighbor-Based Indonesian G2P Conversion Suyanto Suyanto; Agus Harjoko
TELKOMNIKA (Telecommunication Computing Electronics and Control) Vol 12, No 2: June 2014
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12928/telkomnika.v12i2.57

Abstract

Grapheme-to-phoneme conversion (G2P), also known as letter-to-sound conversion, is an important module in both speech synthesis and speech recognition. The methods of G2P give varying accuracies for different languages although they are designed to be language independent. This paper discusses a new model based on pseudo nearest neighbor rule (PNNR) for Indonesian G2P. In this model, partial orthogonal binary code for graphemes, contextual weighting, and neighborhood weighting are introduced. Testing to 9,604 unseen words shows that the model parameters are easy to be tuned to reach high accuracy. Testing to 123 sentences containing homographs shows that the model could disambiguate homographs if it uses long graphemic context. Compare to information gain tree, PNNR gives slightly higher phoneme error rate, but it could disambiguate homographs.
Lampung Script Recognition Using Convolutional Neural Network Panji Bintoro; Agus Harjoko
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 16, No 1 (2022): January
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.70041

Abstract

The Lampung script is often used in writing words in Lampung language. The Lampung language itself is used by native Lampung people and people who learn Lampung language. The Lampung script is difficult to learn because there are many combinations of parent characters and subletters. CNN is a method in the field of object recognition that has a specific layer, namely a convolution layer and a pooling layer that allows the feature learning process well. Handwriting recognition as in character recognition in MNIST, CNN produces better performance compared to other methods. From the advantages of CNN, the CNN method with DenseNet architecture was chosen as the best architecture to recognize each Lampung script. In this study, there are 2 main processes, namely preprocessing, and recognition. This study succeeded in applying the CNN method which can recognize Lampung script. The dataset is divided into 4 groups of characters that have different sounds. First, the parent character data get 98% accuracy. Second, the parent letter data with the above letters get 98% accuracy. Third, the parent character data with the sub-letters on the side get 98% accuracy. Fourth, the parent letter data with the lower letters get 97% accuracy.
METODE KLASIFIKASI MUTU JAMBU BIJI MENGGUNAKAN KNN BERDASARKAN FITUR WARNA DAN TEKSTUR Taftyani Yusuf Prahudaya; Agus Harjoko
Jurnal Teknosains Vol 6, No 2 (2017): June
Publisher : Universitas Gadjah Mada

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/teknosains.26972

Abstract

Guava (Psidium guajava L.) is a fruit that has many health benefits. Guava also has commercial value in Indonesia and has a large market share. This indicates that the commodity of guava has been consumed by society extensively. This time the sorting process is still done manually which still has many shortcomings. This classification gives the classification results are less accurate and inconsistent due to the carelessness of humans. Grading process in the marketing sector is essential. Improper grading potentially detrimental to farmers because all the fruit quality were priced the same. Therefore, we need a consistent classification system.The system uses image processing to extract the color and texture features of guava. As a quality classification KNN method (K-Nearest Neighbor) is used. This system will classify guava into four quality classes, namely the super class, class A, class B, and external quality. KNN designed with input 7 features extraction which is the average value of RGB (Red, Green, and Blue), total defect area, and the GLCM value (entropy, homogeneity, and contrast) with the 4 outputs of quality. From the test results showed that the classification method is able to classify the quality of guava. The highest accuracy is obtained in testing K = 3 with 91.25% accuracy rate.
Ontology-Based Sentence Extraction for Answering Why-Question A. A. I. N. Eka Karyawati; Edi Winarko; Azhari Azhari; Agus Harjoko
Proceeding of the Electrical Engineering Computer Science and Informatics Vol 4: EECSI 2017
Publisher : IAES Indonesia Section

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (360.369 KB) | DOI: 10.11591/eecsi.v4.1012

Abstract

Most studies on why-question answering system usually   used   the   keyword-based   approaches.   They   rarely involved domain ontology in capturing the semantic of the document contents, especially in detecting the presence of the causal relations. Consequently, the word mismatch problem usually  occurs  and  the  system  often  retrieves  not  relevant answers. For solving this problem, we propose an answer extraction method by involving the semantic similarity measure, with selective causality detection. The selective causality detection is  applied  because  not  all  sentences  belonging  to  an  answer contain  causality.  Moreover,   the   motivation  of  the  use  of semantic similarity measure in scoring function is to get more moderate results about the presence of the semantic annotations in a sentence, instead of 0/1. The semantic similarity measure employed is based on the shortest path and the maximum depth of the ontology graph. The evaluation is conducted by comparing the proposed method against the comparable ontology-based methods, i.e., the sentence extraction with Monge-Elkan with 0/1 internal similarity function. The proposed method shows the improvements in  term of  MRR (16%, 0.79-0.68), P@1  (15%, 0.76-0.66), P@5 (14%, 0.8-0.7), and Recall (19%, 0.86-0.72).