Predicting TED Talk Ratings from Language and Prosody

05/21/2019
by   Md Iftekhar Tanveer, et al.
0

We use the largest open repository of public speaking—TED Talks—to predict the ratings of the online viewers. Our dataset contains over 2200 TED Talk transcripts (includes over 200 thousand sentences), audio features and the associated meta information including about 5.5 Million ratings from spontaneous visitors of the website. We propose three neural network architectures and compare with statistical machine learning. Our experiments reveal that it is possible to predict all the 14 different ratings with an average AUC of 0.83 using the transcripts and prosody features only. The dataset and the complete source code is available for further analysis.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset