We are pleased to announce the release of
### DKPro Core 1.10.0
a collection of interoperable software components for natural language
processing (NLP) based on the Apache UIMA framework.
https://dkpro.github.io/dkpro-core
This is a feature release.
#### Notable changes since DKPro Core 1.9.3
- Added support for Arabic to CoreNlpSegmenter (thanks @Jibun)
- Added support for Token "form" to CoNLL writers (thanks @Jibun)
- Added ability to provide extra non-standard parameters to CoreNlpSegmenter (thanks @Jibun)
- Added ArkTreet POS tagger trainer (thanks @schrieveslaach)
- Added WebAnno TSV3 reader/writer
- Added reader for Leipzig Corpora Collection
- Upgraded to CoreNLP 3.9.1 (stanfordnlp and corenlp modules)
- Upgraded to OpenNLP 1.9.0
- Upgraded to PDFBox 2.0.9 (io-pdf module)
- Upgraded to LanguageTool 4.2
- Upgraded to CogComp 4.0.7 (lbj module)
- Upgraded to Tika 1.18 (io-tika module)
- Improved handling of multi-line annotations in brat module (thanks @parisni)
- Fix discontinuous annotations crashing the brat reader by reading only the first fragment
- Added dataset description for GUM 4.1.0 dataset
- Removed PARAM_INTERN_TAGS
- Improved component metadata
A more detailed overview of the changes in this release can be found here:
https://github.com/dkpro/dkpro-core/issues?q=milestone%3A1.10.0
Thanks for contributions go to: @jibun, @parisni, @schrieveslaach, @jgrivolla
When upgrading, please mind that you should not mix different versions of DKPro Core
components in your projects - they may not be compatible with each other.