-
POLFIE Bank, an LFG structure bank of Polish: pol-składnica-pargram
The pol-składnica-pargram structure bank was created using POLFIE: an LFG grammar of Polish. This structure bank contains FULL type sentences from Składnica, which were in turn... -
Word embeddings for Polish (KGR10, Fasttext binary) kgr10_fasttext_bin_v1
Distributional language model (binary) for Polish trained on KGR10 using Fasttext (vector dimension: 100). -
Big Data language model - subword - BPE - RAW
Big data language model based on subword units, based on byte pair encoding in RAW format -
AspectEmo 1.0: Multi-Domain Corpus of Consumer Reviews for Aspect-Based Senti...
AspectEmo 1.0 Corpus is an extended version of a publicly available PolEmo 2.0 corpus of Polish customer reviews, that was used in many projects on the use of different methods... -
Dependency parsing models for Polish
PDB-based parsing models are trained on the current version of Polish Depedency Bank with the publicly available parsing systems: MaltParser, MateParser, and UDPipe. -
XLM-RoBERTa events recognition
Event recognition models for the Polish language, based on the XLM-RoBERTa language model. -
Description of nominal lexico-semantic relations in plWordNet 4.0 (Guidelines)
The pdf document contains guidelines of decription of Nouns in the Polish part of plWordNet. -
Big data language model stemmed with BPE in RAW format
Big data language model stemmed with BPE in RAW format -
Big Data language model - subword - SYLLABED - RAW
Big data language model based on syllabes in RAW format -
Polish-Lithuanian Parallel Corpus "2"
New upgraded version of the Polish-Lithuanian Parallel Corpus (http://hdl.handle.net/11321/309) with extra files and features (Including General, Medical, Technical, Legal,... -
Powieść - Lalka
a book in Polish by Bolesław Prus -
POLFIE: an LFG grammar of Polish
POLFIE is an LFG grammar of Polish implemented in the XLE system (Xerox Linguistic Environment). POLFIE has been developed at the Institute of Computer Science, Polish Academy... -
Świgra
Świgra is a parser of Polish generating constituency trees using a DCG style grammar stemming from Marek Świdziński’s grammar “Gramatyka formalna języka polskiego” (1992). The... -
Polish Parliamentary Corpus
The Polish Parliamentary Corpus (PPC) is a large collection of linguistically analysed documents from the proceedings of Polish Parliament, Sejm and Senate. The corpus files are... -
Krokodyl: A hybrid depencency parser of Polish
Krokodyl is an experimental hybrid deep depencency parser of Polish. Krokodyl has been developed at the Institute of Computer Science, Polish Academy of Sciences (IPI PAN)... -
POLFIE-OT: an LFG grammar of Polish with OT marks
POLFIE-OT is a version of POLFIE, an LFG grammar of Polish implemented in the XLE system (Xerox Linguistic Environment), enriched with OT (Optimality Theory) constraints for the... -
Wroclaw Corpus of Consumer Reviews Sentiment (WCCRS)
Wroclaw Corpus of Consumer Reviews is a corpus of Polish reviews annotated with sentiment at the level of the whole text (text) and at the level of sentences (sentence) for the... -
Smyrna
Smyrna is a tool for building and searching own Polish corpora from HTML files. -
Polish-Lithuanian Parallel Corpus
Database