Universal Dependencies
90 languages
StanfordNLP
53 languages
Polyglot 51 languages
(Toma Tasovac)
Every NLP project can benefit from the fine-tuning of statistical language models on project materials. This is especially true for:
🌘 Cadet
in development
guest ~monolingualism
Interface to add or edit stop words, tokenization rules, lemmata, normalization rules, for base language object
Run as an external recommender for INCEpTION to generate annotation data
Actively update the spaCy model from annotations
Debug and batch train on annotation data
Package and export customized spaCy model
#TODO