edu.columbia.cs.cg.prdualrank.pattern.extractor
Interface PatternExtractor<T extends Matchable>

All Known Implementing Classes:
DocumentSearchPatternExtractor, ExtractionPatternExtractor, SearchPatternExtractor, WindowedSearchPatternExtractor

public interface PatternExtractor<T extends Matchable>

This class is used for our implementation of: "Searching Patterns for Relation Extraction over the Web: Rediscovering the Pattern-Relation Duality" . Y. Fang and K. C.-C. Chang. In WSDM, pages 825-834, 2011. For further information, WSDM 2011 Conference Website .

Description

Gives the interface for any pattern extractor in PRDualRank. The word "any" relies on all the different kind of Patterns that can be implemented.

Since:
2011-10-07
Version:
0.1
Author:
Pablo Barrio, Goncalo Simoes
See Also:
WSDM 2011 Conference Website

Method Summary
 java.util.Map<Pattern<T,TokenizedDocument>,java.lang.Integer> extractPatterns(TokenizedDocument document, Relationship relationship, java.util.List<Relationship> matchingRelationships)
          Extract specific patterns from the document in the parameter list for the specified relationship and other matching relationships in the same document.
 

Method Detail

extractPatterns

java.util.Map<Pattern<T,TokenizedDocument>,java.lang.Integer> extractPatterns(TokenizedDocument document,
                                                                              Relationship relationship,
                                                                              java.util.List<Relationship> matchingRelationships)
Extract specific patterns from the document in the parameter list for the specified relationship and other matching relationships in the same document. The definition of matching used in this project is based on the EntityMatchers contained in the specified relationship.

Parameters:
document - the document to be processed.
relationship - the relationship that the extractor is trying to generate patterns for.
matchingRelationships - the relationships contained in 'document' that match the specified relationship.
Returns:
the map