edu.columbia.cs.api
Class PatternBasedRelationshipExtractor<T extends Relationship,D extends Document>

java.lang.Object
  extended by edu.columbia.cs.api.PatternBasedRelationshipExtractor<T,D>
All Implemented Interfaces:
RelationshipExtractor<D>

public class PatternBasedRelationshipExtractor<T extends Relationship,D extends Document>
extends java.lang.Object
implements RelationshipExtractor<D>

Implementation of the relationship extractor that uses patterns to extract information from a document

Since:
2011-09-27
Version:
0.1
Author:
Pablo Barrio, Goncalo Simoes

Constructor Summary
PatternBasedRelationshipExtractor(java.util.Set<Pattern<T,D>> patterns)
          Constructor of the pattern based relationship extractor.
 
Method Summary
 java.util.List<Relationship> extractTuples(D d)
          Implementation of the extract Tuples method that uses all the patterns available to extract tuples from the input document
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PatternBasedRelationshipExtractor

public PatternBasedRelationshipExtractor(java.util.Set<Pattern<T,D>> patterns)
Constructor of the pattern based relationship extractor. It receives as input a set of patterns to extract relationships from tokenized documents

Parameters:
patterns - a set of patterns to extract relationships from tokenized documents
Method Detail

extractTuples

public java.util.List<Relationship> extractTuples(D d)
Implementation of the extract Tuples method that uses all the patterns available to extract tuples from the input document

Specified by:
extractTuples in interface RelationshipExtractor<D extends Document>
Parameters:
d - a tokenized document from which we will extract information