Recommendation Datasets
Datasets for recommendation systems and sentiment analysis.
IMDB Reviews
The ACL IMDB dataset for sentiment classification. Contains movie reviews labeled as positive or negative.
-
Dataset edu.stanford.aclimdb
datamaestro.data.ml.Supervised
Large Movie Review Dataset
External link: http://ai.stanford.edu/~amaas/data/sentiment/
Paper http://ai.stanford.edu/~amaas/papers/wvSent_acl2011.pdf
Example usage:
from datamaestro import prepare_dataset
# Load IMDB sentiment dataset
imdb = prepare_dataset("edu.stanford.aclimdb")
# Access training and test data
train = imdb.train
test = imdb.test