-
A Digital Dictionary of Tunis Arabic - TUNICO (ELEXIS)
A corpus-based dictionary, enriched with historical data. The dictionary was not only built on data from the corpus of spoken language that was compiled in the same project, but... -
Dictionary of Viennese Dialect - Jakob (1929) (ELEXIS)
Wörterbuch des Wiener Dialektes (1929). Dictionary of the Viennese dialect with a concise grammar. -
Multilingual comparable corpora of parliamentary debates ParlaMint 3.0
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2022, with the individual corpora... -
Treq Translation Equivalents (ELEXIS)
Data for Treq interface 2.0 derived from the InterCorp parallel corpus release 12. -
Renish Dictionary + Supplements to the Renish Dictionary - RhWB (ELEXIS)
Rheinisches Wörterbuch + Nachträge zum Rheinischen Wörterbuch. With its nine volumes, the Rhenish Dictionary (1928-1971) is the most comprehensive dialect dictionary of West... -
Slovenian manuscript sermons by Ignacij Holzapfel 1.0
This corpus consists of editions of three volumes of sermons written by Ignatius Holzapfel (1799-1866) when he was active as parish priest in Črnomelj and Ribnica. The bulk of... -
A Machine-readable Dictionary of Damascus Arabic - dc-apc-eng (ELEXIS)
This dictionary has been prepared to support the Syrian Textbook prepared at the University of Vienna. See also: https://hdl.handle.net/11022/0000-0007-C093-9 -
Palatinate Dictionary - PfWB (ELEXIS)
Pfälzisches Wörterbuch. The Palatinate Dictionary lists the entire dialectal vocabulary of the Palatinate in use today. -
Slovenian-German Dictionary of Maks Pleteršnik (1894-1895)
The Slovenian-German Dictionary of Maks Pleteršnik was first published in 1894-1895. It contains 103,185 dictionary entries. Beside standard and dialect lexis of the 19th... -
New Idioticon Viennense - Loritza (ELEXIS)
Neues Idioticon Viennense. Digitized version of a historic dialect dictionary of Viennese (1847). -
Ladin-German Dictionary (Letter F) (ELEXIS)
Dizionar Ladin-Deutsch (Letter F). This dataset contains the letter F of the Ladin-German dictionary by Giovanni Mischí. -
Concreteness and imageability lexicon MEGA.HR-Crossling
The lexicon contains concreteness and imageability predictions of words in 77 languages. The resource is built via supervised machine learning, using average human responses... -
The news articles reporting on the 2021 Tokyo Olympics data set OG2021 (resea...
The OG2021 corpus contains multilingual news articles that are reporting on the events happening during the 2021 Tokyo Olympics. The data set was created to evaluate the... -
Lemma list of the German Dictionary elexiko (ELEXIS)
elexiko is an online information system ("dictionary") on contemporary German language (mainly post World War II), which documents, explains and scientifically comments on the... -
The German Dictionary by Jacob and Wilhelm Grimm (first edition) - DWB (ELEXIS)
Deutsches Wörterbuch von Jacob Grimm und Wilhelm Grimm (Erstbearbeitung). Deutsches Wörterbuch by Jacob and Wilhelm Grimm is the largest and most comprehensive dictionary of the... -
Emoji Sentiment Ranking 1.0
A lexicon of 751 emoji characters with automatically assigned sentiment. The sentiment is computed from 70,000 tweets, labeled by 83 human annotators in 13 European languages.... -
Dictionary of the Alsatian Dialects - ElsWB (ELEXIS)
Wörterbuch der elsässischen Mundarten. The dialectal vocabulary of the German-speaking parts of Lorraine is recorded in the one-volume Dictionary of German-Lorraine Dialects. It... -
Multilingual comparable corpora of parliamentary debates ParlaMint 4.0
ParlaMint 4.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and... -
Linguistically annotated multilingual comparable corpora of parliamentary deb...
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2022, with the individual corpora... -
xLiMe Twitter Corpus XTC 1.0.1
The xLiMe Twitter Corpus contains tweets in German, Italian and Spanish manually annotated with part-of-speech, named entities, and message-level sentiment polarity. In total,...