Predicting the Popularity Rating of Thai TV Drama by Text Mining of Social Network

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

NRCT Data Center

Abstract

The objectives of this study were to predict the popularity ratings of Thai TV drama programs with a prediction model, based on found and synthesized factors affecting them, and to check the accuracy of the model in terms of Root Mean Square Error (RMSE) of the predicted outcomes. The analyzed data were both structured and unstructured data. The structured data included the TV channels airing the programs, type of drama, on-air time, number of episodes, average time per episode, number of viewers watching already aired programs, number of viewers watching the highlight of already aired programs, and number of viewers listening to program soundtracks. The unstructured data included messages posted on Twitter. The messages were processed by sentiment analysis, and the sentiments found were statistically analyzed together with the structured data by multiple regression, yielding predicted popularity ratings. The results show that comments on Thai TV drama programs in social media significantly affected the predicted popularity ratings of those programs. A factor affecting the predicted ratings was ‘message with positive sentiment’. A factor, the number of viewers watching the highlight of already aired programs, positively affected the popularity ratings when other factors were kept fixed. Another factor, number of viewers watching already aired programs, negatively and significantly affected the popularity ratings (< 0.05). Finally, the RMSE of the prediction model was 0.717 on the training data set containing data from 430,256 people, and the RSME of the prediction model was 0.41 on the test data set containing data from 246,133 people. Our findings may directly benefit Thai TV drama program producers and TV channel administrators in their effort to provide programs that will fully satisfy most viewers. Keywords: Text Mining, Sentiment Analysis, Multiple Regression, Twitter

Description

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By