Linguistik Indonesia
Vol 36, No 1 (2018): Linguistik Indonesia

WORKING WITH A LINGUISTIC CORPUS USING R: AN INTRODUCTORY NOTE WITH INDONESIAN NEGATING CONSTRUCTION

Gede Primahadi Wijaya Rajeg (Monash University)
Karlina Denistia (Eberhard Karls University of Tübingen)
I Made Rajeg (Universitas Udayana)



Article Info

Publish Date
20 Feb 2019

Abstract

This paper demonstrates the use of R for a unified data science in corpus linguistics via a series of corpus-based analyses on Indonesian Negating Construction. The data is based on c17-million word-tokens of an online-news corpus, a part of the Indonesian Leipzig Corpora. We identified that tidak is the most frequent form in our corpus. Next, we found that tak has significantly higher type frequency for negated-predicates with [ter-X-kan] schema compared to tidak; this finding provides a quantitative nuance against a description in an Indonesian reference grammar, stating that (i) in present-day Indonesian tidak is also common to negate ter- related predicates, while (ii) the compulsoriness of tak to negate ter- predicates is a past usage. Lastly, we refine our second finding by applying Distinctive Collexeme Analysis to determine that tak strongly attracts specific verbs predominantly in the [ter-X-kan] schema compared to tidak; this finding offers a deeper characterisation for tidak and tak.

Copyrights © 2018






Journal Info

Abbrev

linguistik_indonesia

Publisher

Subject

Description

Linguistik Indonesia is published by Masyarakat Linguistik Indonesia (MLI). It is a research journal which publishes various research reports, literature studies and scientific writings on phonetics, phonology, morphology, syntax, discourse analysis, pragmatics, anthropolinguistics, language and ...