|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.columbia.cs.api.OpenIEUnsupervisedRelationshipExtractor<D>
public class OpenIEUnsupervisedRelationshipExtractor<D extends Document>
Implementation of the relationship extractor that is based on the unsupervised
learning of KnowItAll. Additionally, this extractor can use a classifier to determine
the confidence that each result of the unsupervised learning is a relationship (ReVerb
approach)
This class uses the original software of ReVerb that can be found in
http://reverb.cs.washington.edu/
To know more about KnowItAll or Reverb please refer to:
Identifying Relations for Open
Information Extraction
Constructor Summary | |
---|---|
OpenIEUnsupervisedRelationshipExtractor(SentenceSplitter splitter,
Tokenizer tokenizer,
POSTagger pos,
Chunker chunker)
Constructor of the Open IE relationship extractor. |
|
OpenIEUnsupervisedRelationshipExtractor(SentenceSplitter splitter,
Tokenizer tokenizer,
POSTagger pos,
Chunker chunker,
weka.classifiers.Classifier cla)
Constructor of the Open IE relationship extractor. |
|
OpenIEUnsupervisedRelationshipExtractor(SentenceSplitter splitter,
Tokenizer tokenizer,
POSTagger pos,
Chunker chunker,
weka.classifiers.Classifier cla,
double threshold)
Constructor of the Open IE relationship extractor. |
|
OpenIEUnsupervisedRelationshipExtractor(SentenceSplitter splitter,
Tokenizer tokenizer,
POSTagger pos,
Chunker chunker,
double threshold)
Constructor of the Open IE relationship extractor. |
Method Summary | |
---|---|
java.util.List<Relationship> |
extractTuples(Document doc)
Implementation of the extractTuples method. |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public OpenIEUnsupervisedRelationshipExtractor(SentenceSplitter splitter, Tokenizer tokenizer, POSTagger pos, Chunker chunker)
splitter
- the sentence splittertokenizer
- the tokenizerpos
- the POS taggerchunker
- the NLP chunkerpublic OpenIEUnsupervisedRelationshipExtractor(SentenceSplitter splitter, Tokenizer tokenizer, POSTagger pos, Chunker chunker, weka.classifiers.Classifier cla)
splitter
- the sentence splittertokenizer
- the tokenizerpos
- the POS taggerchunker
- the NLP chunkercla
- the classifier used to compute the confidence that a given answer is a relationshippublic OpenIEUnsupervisedRelationshipExtractor(SentenceSplitter splitter, Tokenizer tokenizer, POSTagger pos, Chunker chunker, double threshold)
splitter
- the sentence splittertokenizer
- the tokenizerpos
- the POS taggerchunker
- the NLP chunkerthreshold
- the confidence threshold to consider that a given candidate is
a relationship: if the confidence of a candidate is higher than the threshold
then the candidate is considered a relationship otherwise it is discardedpublic OpenIEUnsupervisedRelationshipExtractor(SentenceSplitter splitter, Tokenizer tokenizer, POSTagger pos, Chunker chunker, weka.classifiers.Classifier cla, double threshold)
splitter
- the sentence splittertokenizer
- the tokenizerpos
- the POS taggerchunker
- the NLP chunkercla
- the classifier used to compute the confidence that a given answer is a relationshipthreshold
- the confidence threshold to consider that a given candidate is
a relationship: if the confidence of a candidate is higher than the threshold
then the candidate is considered a relationship otherwise it is discardedMethod Detail |
---|
public java.util.List<Relationship> extractTuples(Document doc)
extractTuples
in interface RelationshipExtractor<Document>
d
- the document that contains the information to be extracted
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |