edu.columbia.cs.cg.prdualrank.pattern.extractor.impl
Class DocumentSearchPatternExtractor<T extends Document>

java.lang.Object
  extended by edu.columbia.cs.cg.prdualrank.pattern.extractor.SearchPatternExtractor<Document>
      extended by edu.columbia.cs.cg.prdualrank.pattern.extractor.impl.DocumentSearchPatternExtractor<T>
All Implemented Interfaces:
PatternExtractor<Document>

public class DocumentSearchPatternExtractor<T extends Document>
extends SearchPatternExtractor<Document>

This class is used for our implementation of: "Searching Patterns for Relation Extraction over the Web: Rediscovering the Pattern-Relation Duality" . Y. Fang and K. C.-C. Chang. In WSDM, pages 825-834, 2011. For further information, WSDM 2011 Conference Website .

Description

This class defines the behavior of the generation of search pattern as stated in Algorithm PatternSearch(To,S,E) in Figure 9 on Section 5 and Definition 1 in Section 3.1.

For an efficient implementation (different to what is described in the paper, but discussed with the authors) refer to WindowedSearchPatternExtractor.

Since:
2011-10-07
Version:
0.1
Author:
Pablo Barrio, Goncalo Simoes
See Also:
WSDM 2011 Conference Website , WindowedSearchPatternExtractor

Constructor Summary
DocumentSearchPatternExtractor(int ngram, int numberOfPhrases)
          Instantiates a new document search pattern extractor.
 
Method Summary
 
Methods inherited from class edu.columbia.cs.cg.prdualrank.pattern.extractor.SearchPatternExtractor
extractPatterns
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

DocumentSearchPatternExtractor

public DocumentSearchPatternExtractor(int ngram,
                                      int numberOfPhrases)
Instantiates a new document search pattern extractor.

Parameters:
ngram - the maximum size of ngrams to be calculated in order to generate the search patterns.
numberOfPhrases - the maximum number of ngrams to be combined in the search pattern generation.