Recommendation Datasets

Datasets for recommendation systems and sentiment analysis.

IMDB Reviews

The ACL IMDB dataset for sentiment classification. Contains movie reviews labeled as positive or negative.

Dataset edu.stanford.aclimdb

datamaestro.data.ml.Supervised

Large Movie Review Dataset

External link: http://ai.stanford.edu/~amaas/data/sentiment/

Paper http://ai.stanford.edu/~amaas/papers/wvSent_acl2011.pdf

Example usage:

from datamaestro import prepare_dataset

# Load IMDB sentiment dataset
imdb = prepare_dataset("edu.stanford.aclimdb")

# Access training and test data
train = imdb.train
test = imdb.test