M Prihantoro, M
Universitas Diponegoro

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

CREATING AND PROCESSING A CORPUS Prihantoro, M
Lingual: Journal of Language and Culture Vol 3, No 4 (2015)
Publisher : Lingual: Journal of Language and Culture

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

This paper seeks to describe some crucial importance of corpus and text processing. Corpus is a projection of how language is used by its speakers. Technology support has improved corpus for easier maintenance, made it space-saving, and it may electronically structure its data. The latest offers much freedom for corpus users to access and exploit it for language teaching, analysis or other specified tasks. This paper will demonstrate how to use open-access corpus on internet such as Corpus of Contemporary American English (COCA) and British National Corpus (BNC). Besides how to use a corpus, another crucial importance that this paper seeks to describe is how to build a corpus. In this paper, the writer will use UNITEX, a corpus (text-based) processing software. This software will demonstrate steps of corpus building, ranging from text collection, annotation, electronic dictionary application to some natural language based operations ranging from pattern matching, concordance, to simple extraction. It will show how graph technology may outperform regular expression, a retrieval method exploited by other corpus processor, in terms of writing output