Jurnal Kata : Penelitian tentang ilmu bahasa dan sastra
Vol. 6 No. 2 (2022): Jurnal Kata : Penelitian tentang Ilmu Bahasa dan Sastra

CORPUS-BASED TERMS EXTRACTION IN LINGUISTICS DOMAIN FOR INDONESIAN LANGUAGE

Wahyu Maulana (Universitas Sumatera Utara)
Eddy Setia (Universitas Sumatera Utara)



Article Info

Publish Date
30 Oct 2022

Abstract

This research aims to extract the mono-lexical and poly-lexical terms from linguistics domain in Indonesian language. As the terminology and lexicology concept is somehow blurry, this research applies CTT by Cabré to do the terms extraction procedure. The corpus-based terminology method is applied in this research to get the best mono-lexical and poly-lexical terms possible. To compile the general and the specialized corpus in this research, AntConc is applied as an instrument. Even though the result is noisy, further analysis about the term limitation manually makes this research semi-automatic. The result shows that the limitation in language and words structure helps this research to delimit the mono-lexical terms extracted in this research. Furthermore, the mono-lexical terms extracted act as the starting point for poly-lexical terms.

Copyrights © 2022






Journal Info

Abbrev

kata

Publisher

Subject

Languange, Linguistic, Communication & Media

Description

Jurnal Kata : Penelitian tentang ilmu bahasa dan sastra, by ISSN 2502-0706 (Online) Research on linguistics, literature and art. Is a scientific journal that publishes the results of research and thinking in two languages, namely: Indonesian and English. Jurnal Kata is published twice a year in May ...