-
Snow density data from a tundra and forest vegetation zone in the low-Arctic ...
The data collection contains meteorological, soil and snow measurements from two nearby sites in the forest-tundra ecotone in the Tasiapik valley near Umiujaq in northern... -
Snow surface area data from a tundra and forest vegetation zone in the low-Ar...
The data collection contains meteorological, soil and snow measurements from two nearby sites in the forest-tundra ecotone in the Tasiapik valley near Umiujaq in northern... -
Heat- and CO2 flux data from a tundra vegetation zone in the low-Arctic Tasia...
The data collection contains meteorological, soil and snow measurements from two nearby sites in the forest-tundra ecotone in the Tasiapik valley near Umiujaq in northern... -
Soil temperature and water content data from a forest, lowshrub and lichen ve...
The data collection contains meteorological, soil and snow measurements from two nearby sites in the forest-tundra ecotone in the Tasiapik valley near Umiujaq in northern... -
Precipitation data from a tundra vegetation zone in the low-Arctic Tasiapik v...
The data collection contains meteorological, soil and snow measurements from two nearby sites in the forest-tundra ecotone in the Tasiapik valley near Umiujaq in northern... -
Radiation data from a tundra vegetation zone in the low-Arctic Tasiapik valle...
The data collection contains meteorological, soil and snow measurements from two nearby sites in the forest-tundra ecotone in the Tasiapik valley near Umiujaq in northern... -
Meteorological data from a tundra vegetation zone in the low-Arctic Tasiapik ...
The data collection contains meteorological, soil and snow measurements from two nearby sites in the forest-tundra ecotone in the Tasiapik valley near Umiujaq in northern... -
Air temperature and wind speed data from a forest vegetation zone in the low-...
The data collection contains meteorological, soil and snow measurements from two nearby sites in the forest-tundra ecotone in the Tasiapik valley near Umiujaq in northern... -
PCB-Vision: A Multiscene RGB-Hyperspectral Benchmark Dataset of Printed Circu...
PCB-Vision Dataset Description: The PCB-Vision dataset is a multiscene RGB-Hyperspectral benchmark dataset comprising 53 Printed Circuit Boards (PCBs). The RGB images are... -
Real-World PP Attachment Disambiguation Dataset
This resource contains a German dataset for real-world PP attachment disambiguation. The creation, analysis and experiment results of the dataset are described in the paper: Do... -
Datasets for Dependency Tree Reranking
This resource contains the datasets for dependency tree reranking in 3 languages: English, German and Czech. The creation, analysis and experiment results of the datasets are... -
GECCC Grammar Error Correction Corpus for Czech (2022-09-28)
Grammar Error Correction Corpus for Czech (GECCC) consists of 83 058 sentences and covers four diverse domains, including essays written by native students, informal website... -
GECCC Grammar Error Correction Corpus for Czech
Grammar Error Correction Corpus for Czech (GECCC) consists of 83 058 sentences and covers four diverse domains, including essays written by native students, informal website... -
DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking
We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains... -
Extensions to the Slovene translation of SuperGLUE
SuperGLUE is a benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a public leaderboard. It is comprised of 8... -
Slovenian datasets for contextual synonym and antonym detection
Slovenian datasets for contextual synonym and antonym detection can be used for training machine learning classifiers as described in the MSc thesis of Jasmina Pegan "Semantic... -
Slovenian Word in Context dataset SloWiC 1.0
The SloWIC dataset is a Slovenian dataset for the Word in Context task. Each example in the dataset contains a target word with multiple meanings and two sentences that both... -
Slovene translation of the SQuAD2.0 dataset
Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to... -
Slovene Translation of the Atomic 2020 data set SloATOMIC 2020
The SloATOMIC 2020 corpus contains the Slovene translations of the ATOMIC 2020 data set, a commonsense knowledge graph with 1.33M everyday inferential knowledge tuples about... -
MultiEmo: Multilingual, Multilevel, Multidomain Sentiment Analysis Corpus of ...
MultiEmo, a new benchmark data set for the multilingual sentiment analysis task including 11 languages. The collection contains consumer reviews from four domains: medicine,...