Garrett, Michael
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

YouTube Transcripts Word Frequency Measure Smith, Vincent; Garrett, Michael; Harwood, Austin; Shamblin, James
Journal of linguistics, culture and communication Vol 1 No 2 (2023): Journal of Linguistics, Culture, and Communication
Publisher : CV. Rustam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.61320/jolcc.v1i2.91-99

Abstract

Many YouTube videos provide written audio transcripts which provide information on the language used on YouTube. One important measure relating to language usage is word frequency. Using student-developed software and libraries in R, Python, and Microsoft Excel, the transcripts of one million YouTube videos from the YouTube-8M data set were scraped and analyzed. The word frequency of the YouTube data set was shown to correlate with commonly used word frequency measures from established studies, such as the subtitle word frequency and the HAL word frequency.