Standford CoreNLP包含很多功能,github上有源码,github地址:Stanford CoreNLP,有需要的话可以下载看看。
主要内容在网站上都有描述,原文是这样写的:
Choose Stanford CoreNLP if you need:
- An integrated toolkit with a good range of grammatical analysis tools
- Fast, reliable analysis of arbitrary texts
- The overall highest quality text analytics
- Support for a number of major (human) languages
- Interfaces available for various major modern programming languages
- Ability to run as a simple web service
工具以及对各种语言的支持如下表(英文和中文支持的最好),分别对应:分词,断句,定词性,词元化,分辨命名实体,语法分析,情感分析,同义词分辨等。
Annotator | ar | zh | en | fr | de | es |
---|---|---|---|---|---|---|
Tokenize / Segment | ✔ | ✔ | ✔ | ✔ | ✔ | |
Sentence Split | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
Part of Speech | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
Lemma | ✔ | |||||
Named Entities | ✔ | ✔ | ✔ | ✔ | ||
Constituency Parsing | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
Dependency Parsing | ✔ | ✔ | ✔ | ✔ | ||
Sentiment Analysis | ✔ | |||||
Mention Detection | ✔ | ✔ | ||||
Coreference | ✔ | ✔ | ||||
Open IE | ✔ |