|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.columbia.cs.ref.tool.tokenizer.impl.OpenNLPTokenizer
public class OpenNLPTokenizer
The OpenNLPTokenizer is an implementation of the Tokenizer interface that uses OpenNLP models to split the text into tokens.
Constructor Summary | |
---|---|
OpenNLPTokenizer(java.lang.String path)
Instantiates a new OpenNLP tokenizer. |
Method Summary | |
---|---|
Span[] |
tokenize(java.lang.String text)
Splits the content of a text into several tokens |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public OpenNLPTokenizer(java.lang.String path)
path
- the pathMethod Detail |
---|
public Span[] tokenize(java.lang.String text)
Tokenizer
tokenize
in interface Tokenizer
text
- the content of the text
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |