edu.columbia.cs.cg.prdualrank.pattern.extractor
Interface PatternExtractor<T extends Matchable>
- All Known Implementing Classes:
- DocumentSearchPatternExtractor, ExtractionPatternExtractor, SearchPatternExtractor, WindowedSearchPatternExtractor
public interface PatternExtractor<T extends Matchable>
This class is used for our implementation of:
"Searching Patterns for Relation Extraction over the Web: Rediscovering the Pattern-Relation Duality" . Y. Fang and K. C.-C. Chang. In WSDM, pages 825-834, 2011.
For further information, WSDM 2011 Conference Website .
Description
Gives the interface for any pattern extractor in PRDualRank. The word "any" relies on all the different kind of Patterns that can be implemented.
- Since:
- 2011-10-07
- Version:
- 0.1
- Author:
- Pablo Barrio, Goncalo Simoes
- See Also:
- WSDM 2011 Conference Website
extractPatterns
java.util.Map<Pattern<T,TokenizedDocument>,java.lang.Integer> extractPatterns(TokenizedDocument document,
Relationship relationship,
java.util.List<Relationship> matchingRelationships)
- Extract specific patterns from the document in the parameter list for the specified relationship and other matching relationships in the same document. The definition of matching used in
this project is based on the EntityMatchers contained in the specified relationship.
- Parameters:
document
- the document to be processed.relationship
- the relationship that the extractor is trying to generate patterns for.matchingRelationships
- the relationships contained in 'document' that match the specified relationship.
- Returns:
- the map