huntoken - tokenizer and sentence splitter

PID

HunToken is a rule based tokenizer and sentence boundary detector for Hungarian (and English) texts.

Identifier
PID http://hdl.handle.net/11372/LRT-1338
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11372/LRT-1338
Provenance
Creator Németh, László; Halácsy, Péter; Kornai, András
Publisher Budapest Technical University Media Research Centre
Contributor László, Németh
Publication Year 2014
Rights GNU Library or "Lesser" General Public License 3.0 (LGPL-3.0); http://opensource.org/licenses/LGPL-3.0; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Resource Type toolService
Format application/x-gzip; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics