Dataset - B2FIND

NoReC: The Norwegian Review Corpus

While the NoReC dataset was primarily created for training and evaluating models for document-level sentiment analysis, many other use cases are of course possible. The corpus...
Manually sentiment annotated Slovenian news corpus SentiNews 1.0

Between 2 and 6 annotators independently sentiment annotated a stratified random sample of 10,427 documents from the Slovenian news portals 24ur, Dnevnik, Finance, Rtvslo, and...
Automatically sentiment annotated Slovenian news corpus AutoSentiNews 1.0

The corpus contains 256,567 documents from the Slovenian news portals 24ur, Dnevnik, Finance, Rtvslo, and Žurnal24. These portals contain political, business, economic and...
Facebook Data for Sentiment Analysis

Corpus consisting of 10,000 Facebook posts manually annotated on sentiment (2,587 positive, 5,174 neutral, 1,991 negative and 248 bipolar posts). The archive contains data and...
Czech SubLex 1.0

Czech subjectivity lexicon, i.e. a list of subjectivity clues for sentiment analysis in Czech. The list contains 4626 evaluative items (1672 positive and 2954 negative) together...

You can also access this registry using the API (see API Docs).

5 datasets found