SELEXINI corpus

PID

We present here a large automatically annotated corpus for French. This corpus is divided into two parts: the first from BigScience, and the second from HPLT. The annotated documents from HPLT were selected in order to optimise the lexical diversity of the final corpus SELEXINI.

Identifier
PID http://hdl.handle.net/11234/1-5822
Related Identifier https://selexini.lis-lab.fr/
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-5822
Provenance
Creator Scholivet, Manon; Savary, Agata; Estève, Louis Clément; Candito, Marie; Ramisch, Carlos
Publisher Université Paris-Saclay, CNRS, Laboratoire Interdisciplinaire des Sciences du Numérique
Publication Year 2024
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language French
Resource Type toolService
Format application/x-gzip; downloadable_files_count: 4
Discipline Linguistics