Dear All,
the training dataset for the Task #1 of the Semantic Sentiment Analysis challenge of ESWC 2016 has been released.
The dataset contains 1 million product reviews from Amazon.com belonging to 20 domains.
For each domain, 50.000 reviews equally split between positive and negative polarities have been considered.
For easing the running of cross-fold validation, we split, in each domain, both positive and negative reviews in five folds of 5.000 reviews each.
The test set will be released during the month of April and it will contain product reviews randomly extracted from the 20 domains contained in the dataset.
Apart from this training set, you are allowed to use any other resource and knowledge base for building your models.
Please follow the links below for downloading the dataset and for further resources.
Resources (not limited to these ones)
Even if you are not planning to compete in the challenge, please consider to adopt the dataset also for training and evaluating your approaches.
Submissions to the Semantic Sentiment Analysis workshop are welcome.
For any question, do not hesitate to contact the chairs.
Kind regards.