Zipf’s law is a theory based on mathematics and linguistics that analyzes and quantifies how words are distributed within a text. It is possible to represent by graphs and statistical analyzes which are the terms that are repeated over so that a ranking of keywords is created. This research found, through the Zipf’s law, variations and uniformities of written academic papers and they presented orally. The oral presentations were inserted in video form on YouTube, it was possible to recover automatically the transcript of the audio. Using a Bash script, texts and transcribed presentations were quantified and organized, thereby creating tag clouds and tables with rankings, facilitating the analysis of the contents. It was possible to identify the spheres of content, identifying common words or not and, mathematically, analyze and compare what was written with what was presented in oral discourse.

Full document

The PDF file did not load properly or your web browser does not support viewing PDF files. Download directly to your device: Download PDF document
Back to Top

Document information

Published on 10/03/15
Accepted on 10/03/15
Submitted on 10/03/15

Volume 24, Issue 2, 2015
DOI: 10.3145/epi.2015.mar.09
Licence: CC BY-NC-SA license

Document Score


Views 0
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?