A Cross-Modal Transformer Based Model for Box-office Revenue Prediction - Details

Author：

Madongo, Canaan T. (Madongo, Canaan T..) | Tang, Zhongjun (Tang, Zhongjun.) (Scholars：唐中君) | Hassan, Jahanzeb (Hassan, Jahanzeb.)

Abstract：

In　the　dynamic　entertainment　industry,　predicting　a　movie＇s　opening　box　office　revenue　remains　critical　for　filmmakers　and　studios.　To　address　this　challenge,　we　present　a　novel　Cross-modal　transformer　and　a　Hierarchical　Fusion　Neural　Network　(CHFNN)　model　tailored　to　predict　movie　box　office　earnings　based　on　multimodal　features　extracted　from　movie　trailers,　posters,　and　reviews.　The　Cross-modal　Transformer　component　of　the　CHFNN　model　captures　intricate　inter-modal　relationships　by　performing　a　cross-modal　fusion　of　the　extracted　features.　It　employs　self-　attention　mechanisms　to　dynamically　weigh　the　importance　of　each　modality＇s　information.　This　allows　the　model　to　learn　to　focus　on　the　most　relevant　information　from　trailers,　posters,　and　reviews,　adapting　to　the　unique　characteristics　of　each　movie.　The　Hierarchical　Fusion　Neural　Network　within　CHFNN　further　refines　the　fused　features,　enabling　a　deeper　understanding　of　the　inherent　hierarchical　structure　of　multimodal　data.　By　hierarchically　combining　the　cross-　modal　features,　our　model　learns　to　capture　both　global　and　local　interactions,　enhancing　its　predictive　capacity.　We　evaluate　the　performance　of　the　CHFNN　model　on　a　comprehensive　Internet　Movie　Dataset　by　obtaining　metadata　for　50,186　movies　from　the　1990s　to　2022,　which　includes　movie　trailers,　posters,　and　review　data.　Our　results　demonstrate　that　the　CHFNN　model　outperforms　existing　models　in　prediction　accuracy,　achieving　95.80%　prediction　accuracy.　The　CHFNN　model　provides　state-of-the-art　predictive　power　and　offers　interpretability　through　attention　mechanisms,　allowing　insights　into　the　factors　contributing　to　a　movie＇s　box　office　success.

Keyword：

movie trailers movie reviews movie posters cross-modal transformers predictions box-office

Author Community：

[ 1 ] [Madongo, Canaan T.]Beijing Univ Technol, Sch Econ & Management, Beijing Modern Mfg Dev, Beijing, Peoples R China
[ 2 ] [Tang, Zhongjun]Beijing Univ Technol, Sch Econ & Management, Beijing Modern Mfg Dev, Beijing, Peoples R China
[ 3 ] [Hassan, Jahanzeb]Beijing Univ Technol, Sch Econ & Management, Beijing Modern Mfg Dev, Beijing, Peoples R China

Reprint Author's Address：

[Madongo, Canaan T.]Beijing Univ Technol, Sch Econ & Management, Beijing Modern Mfg Dev, Beijing, Peoples R China;;

Email：

ctmadongo@yahoo.co.uk |
tangzhongjun@bjut.edu.cn |
jahanzab.hassan@gmail.com

Show more details

Related Keywords：

Movie Box-Office Revenue Prediction Model by Mining Deep Features from Trailers Using Recurrent Neural Networks
2024，JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY
A movie box office revenue prediction model based on deep multimodal features
2023，Multimedia Tools and Applications
Box-office Revenue Prediction by Mining Deep Features from Movie Posters and Reviews Using Transformers
2023，6th International Conference on Artificial Intelligence and Pattern Recognition, AIPR 2023
Research on automatic sentiment analysis of text movie reviews with machine learning methods
2021，

Source ：

JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY

ISSN： 1798-2340

Year： 2024

Issue： 7

Volume： 15

Page： 822-837

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to