-
Dataset for color terms, 2012
This dataset comprises adjective-noun phrases with color terms. -
Scrambled text: training Language Models to correct OCR errors using syntheti...
This data repository contains the key datasets required to reproduce the paper "Scrambled text: training Language Models to correct OCR errors using synthetic data". In addition... -
Combining text and vision in compound semantics: Towards a cognitively plausi...
In the current state-of-the art distributionalsemantics model of the meaning of noun-noun compounds (such aschainsaw, but-terfly, home phone),CAOSS(Marelli... -
Propositional Claim Detection (NLP Datensatz)
Es handelt sich um einen natural language processing (NLP) Trainingsdatensatz. Modelle, die auf diesen Daten trainiert werden, sollen Behauptungen klassifizieren können, die... -
Evidence - Computer-assisted Interactive Extraction of Dictionary Examples fr...
Anonymized models from the expert and lay-user studies conducted in the project Evidence. Each model was train for 50-60 iterations on a specific word class (adjective, verb,... -
Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Rev...
A dataset of aligned scientific paper revisions manually labeled according to their action and intent, and supplemented with the respective peer reviews and human-written edit... -
LLaMA-2 13B Model checkpoints for Fine-Tuning with Divergent Chains of Though...
LLaMA-2 13B Model checkpoints for the paper Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models -
Phi 1.5 Model checkpoints for Fine-Tuning with Divergent Chains of Thought Bo...
Phi 1.5 Model checkpoints for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models -
LLaMA-2 13B-Chat Model checkpoints for Fine-Tuning with Divergent Chains of T...
LLaMA-2 13B-Chat Model checkpoints for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models -
LLaMA-2 7B Model checkpoints for Fine-Tuning with Divergent Chains of Thought...
LLaMA-2 7B Model checkpoints for the paper Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models -
LLaMA-2 70B Model checkpoints for Fine-Tuning with Divergent Chains of Though...
LLaMA-2 70B Model checkpoints for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models -
Outputs for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Thr...
Raw responses from the models, clean answers, post-processed predictions and evaluation results for each model and dataset using in the publication Fine-Tuning with Divergent... -
Constrained C-Test Generation via Mixed-Integer Programming (Supplementary Ma...
This work proposes a novel method to generate C-Tests; a deviated form of cloze tests (a gap filling exercise) where only the last part of a word is turned into a gap. In... -
AMR parse quality prediction [Source Code]
Accuracy prediction for AMR parsing predicts 33 accuracy metrics for a given sentence and its (automatic) AMR parse Abstract (Opitz and Frank, 2019): Semantic proto-role... -
NLP in Diagnostic Texts from Nephropathology [Research Data]
This data set contains all annotated topic word tables from the work "NLP in Diagnostic Texts from Nephropathology", as well as all pre-processed and tf-idf-vectorized text... -
WikiEvents Dataset from January 2020 to December 2022
WikiEvents is a knowledge graph based dataset for NLP and event-related machine learning tasks. This dataset includes RDF data in JSON-LD about events between January 2020 and... -
Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals
This is the resource for the dataset and models released as a part of our EMNLP 2023 paper "Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals" -
Data Linking Workshop 2023: Computer Vision and Natural Language Processing –...
The humanities meet computer science to create new synergies using computer vision and natural language processing. Aim & Scope Historians are increasingly using... -
Data Linking Workshop 2023: Computer Vision and Natural Language Processing –...
The humanities meet computer science to create new synergies using computer vision and natural language processing. Aim & Scope Historians are increasingly using... -
Annotation Curricula to Implicitly Train Non-Expert Annotators
Annotation studies often require annotators to familiarize themselves with the task, its annotation scheme, and the data domain. This can be overwhelming in the beginning,...