JAIS (Journal of Applied Intelligent System)
Vol 6, No 1 (2021): Journal of Applied Intelligent System

Keyphrase Extraction on Covid-19 Tweets Based on Doc2Vec and YAKE

Fahri Firdausillah (Universitas Dian Nuswantoro)
Erika Devi Udayanti (Universitas Dian Nuswantoro)



Article Info

Publish Date
10 May 2021

Abstract

Keyword and keyphrase extraction are one of the initial foundations for performing several text processing operations such as summarization and document clustering. YAKE is one of the techniques used for unsupervised and independent keyphrase extraction, it does not require a corpus for linguistic tools such as NER and POS-tag. However, the use of YAKE in microblogging documents such as Twitter often results in a keyphrase that is less representative because of the lack of words used for ranking. This paper offers a solution to this problem by looking for similar tweets in the keyphrase extraction process using Doc2Vec so that the number of words used in the YAKE ranking process can be greater. Covid-19 tweets related are used as dataset as the topic is currently widely discussed on social media to prove that the proposed approach could improve keyphrase extraction performance

Copyrights © 2021






Journal Info

Abbrev

JAIS

Publisher

Subject

Description

Journal of Applied Intelligent System (JAIS) is published by LPPM Universitas Dian Nuswantoro Semarang in collaboration with CORIS and IndoCEISS, that focuses on research in Intelligent System. Topics of interest include, but are not limited to: Biometric, image processing, computer vision, ...