Dataset - B2FIND

Text classification model fastText-Trendi-Topics 1.0

The fastText-Trendi-Topics model is a text classification model for categorizing news texts with one of 13 topic labels. It was trained on a set of approx. 36,000 Slovene texts...
Slovenian keyword extraction dataset from SentiNews 1.0

The dataset consists of 7514 Slovenian news articles from the SentiNews 1.0 corpus by Bučar et al. 2017 (http://hdl.handle.net/11356/1110) which had available article keywords....
Text classification model SloBERTa-Trendi-Topics 1.0

The SloBerta-Trendi-Topics model is a text classification model for categorizing news texts with one of 13 topic labels. It was trained on a set of approx. 36,000 Slovene texts...

You can also access this registry using the API (see API Docs).

3 datasets found