Sentiment analysis is a natural language processing problem where text is understood and the underlying intent is predicted.
Let us learn how you can predict the sentiment of movie reviews as either positive or negative in Python using the Keras deep learning library.
By the end you will know:
The dataset is the Large Movie Review Dataset often referred to as the IMDB dataset.
The Large Movie Review Dataset contains 25,000 highly polar moving reviews (good or bad) for training and the same amount again for testing. The problem is to determine whether a given moving review has a positive or negative sentiment.
The data was collected by Stanford researchers where a split of 50/50 of the data was used for training and test. An accuracy of 88.89% was achieved.
The data was also used as the basis for a Kaggle competition titled “Bag of Words Meets Bags of Popcorn” in late 2014 to early 2015. Accuracy was achieved above 97% with winners achieving 99%.