TELKOMNIKA (Telecommunication Computing Electronics and Control)
Vol 12, No 3: September 2014

A Novel Part-of-Speech Set Developing Method for Statistical Machine Translation

Herry Sujaini (Bandung Institute of Technology)
Kuspriyanto Kuspriyanto (Bandung Institute of Technology)
Arry Akhmad Arman (Bandung Institute of Technology)
Ayu Purwarianti (Bandung Institute of Technology)



Article Info

Publish Date
01 Sep 2014

Abstract

Part of speech (PoS) is one of the features that can be used to improve the quality of statistical-based machine translation. Typically, the language PoS determined based grammar of the language or adopt from other languages PoS. This work aims to formulate a model to developing PoS as linguistic factors to improve the quality of machine translation automatically. The research method using word similarity approach, where we perform clustering of the words contained in a corpus. Further classes will be defined as PoS set obtained for a given language.We evaluated the results of the PoS that defined computational results using machine translation system MOSES as the system by comparing the results of the SMT are using PoS sets generated manually, while the assessment of the system using BLEU method. Language that will be used for evaluation is English as the source language and Indonesian as the target language.

Copyrights © 2014






Journal Info

Abbrev

TELKOMNIKA

Publisher

Subject

Computer Science & IT

Description

Submitted papers are evaluated by anonymous referees by single blind peer review for contribution, originality, relevance, and presentation. The Editor shall inform you of the results of the review as soon as possible, hopefully in 10 weeks. Please notice that because of the great number of ...