Movie Genre Classification from Plot Summaries using Bidirectional LSTM

Ertugrul A. M., KARAGÖZ P.

12th IEEE International Conference on Semantic Computing (ICSC), California, United States Of America, 31 January - 02 February 2018, pp.248-251 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/icsc.2018.00043
  • City: California
  • Country: United States Of America
  • Page Numbers: pp.248-251
  • Keywords: Movie genre classification, LSTM, Recurrent Neural Networks (RNNs)
  • Middle East Technical University Affiliated: Yes


Movie plot summaries are expected to reflect the genre of movies since many spectators read the plot summaries before deciding to watch a movie. In this study, we perform movie genre classification from plot summaries of movies using bidirectional LSTM (Bi-LSTM). We first divide each plot summary of a movie into sentences and assign the genre of corresponding movie to each sentence. Next, using the word representations of sentences, we train Bi-LSTM networks. We estimate the genres for each sentence separately. Since plot summaries generally contain multiple sentences, we use majority voting for the final decision by considering the posterior probabilities of genres assigned to sentences. Our results reflect that, training Bi-LSTM network after dividing the plot summaries into their sentences and fusing the predictions for individual sentences outperform training the network with the whole plot summaries with the limited amount of data. Moreover, employing Bi-LSTM performs better compared to basic Recurrent Neural Networks (RNNs) and Logistic Regression (LR) as a baseline.