Baseline Performance of Pre-trained Models on Movie Genre Classification from Spectrograms

dc.contributor.authorPorawat Visutsak
dc.contributor.authorKavin Treeraphapkajondet
dc.contributor.authorVisaroot Sakphet
dc.contributor.authorWachirawit Nitinuntatip
dc.contributor.authorPawwinkan Satthong
dc.contributor.authorTanajak Tongbai
dc.contributor.authorDuongduen Ongrungruaeng
dc.contributor.authorAtiwitch Juntra
dc.contributor.authorWatcharaporn Aiamlamai
dc.contributor.authorIssares Sungwanna
dc.contributor.authorPrapaporn Phetrak
dc.contributor.authorPonrudee Netisopakul
dc.contributor.authorKeun Ho Ryu
dc.date.accessioned2026-05-08T19:24:46Z
dc.date.issued2025-4-30
dc.description.abstractThis study investigates the use of deep learning for classifying movie genres based on audio spectrograms. We construct a dataset of movie trailers, transform them into spectrograms, and label them by genre. Then, we utilize MATLAB's pre-trained convolutional neural networks (CNNs) for clas- sication, comparing the performance of 9 different architectures, including MobileNet-v2, RestNet-18, DenseNet-201, Places365-GoogLeNet, VGG- 16, VGG-19, Inception-RestNet-v2, Inception-v3, and NASANet-Mobile. We evaluated all models based on their ability to classify movie trailers into ve genres: action, romance, drama, comedy, and thriller. Our results, based on accuracy and F1-score across genres, indicate that VGG16 achieves the highest overall performance with an accuracy of 86.27%, an F1-score of 86.69%, a recall of 86.87%, and a precision of 87.28%. This research demonstrates the potential of leveraging pre-trained CNNs, particularly VGG-16, for efficient and effective audio-based genre classification in movie trailers.
dc.identifier.doi10.37936/ecti-cit.2025192.259990
dc.identifier.urihttps://dspace.kmitl.ac.th/handle/123456789/19756
dc.publisherECTI Transactions on Computer and Information Technology (ECTI-CIT)
dc.subjectGenerative Adversarial Networks and Image Synthesis
dc.subjectMedia Influence and Health
dc.subjectCinema and Media Studies
dc.titleBaseline Performance of Pre-trained Models on Movie Genre Classification from Spectrograms
dc.typeArticle

Files

Collections