edu.columbia.cs.cg.prdualrank.pattern.extractor.impl
Class DocumentSearchPatternExtractor<T extends Document>
java.lang.Object
edu.columbia.cs.cg.prdualrank.pattern.extractor.SearchPatternExtractor<Document>
edu.columbia.cs.cg.prdualrank.pattern.extractor.impl.DocumentSearchPatternExtractor<T>
- All Implemented Interfaces:
- PatternExtractor<Document>
public class DocumentSearchPatternExtractor<T extends Document>
- extends SearchPatternExtractor<Document>
This class is used for our implementation of:
"Searching Patterns for Relation Extraction over the Web: Rediscovering the Pattern-Relation Duality" . Y. Fang and K. C.-C. Chang. In WSDM, pages 825-834, 2011.
For further information, WSDM 2011 Conference Website .
Description
This class defines the behavior of the generation of search pattern as stated in Algorithm PatternSearch(To,S,E) in Figure 9 on Section 5 and Definition 1 in Section 3.1.
For an efficient implementation (different to what is described in the paper, but discussed with the authors) refer to WindowedSearchPatternExtractor
.
- Since:
- 2011-10-07
- Version:
- 0.1
- Author:
- Pablo Barrio, Goncalo Simoes
- See Also:
- WSDM 2011 Conference Website ,
WindowedSearchPatternExtractor
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
DocumentSearchPatternExtractor
public DocumentSearchPatternExtractor(int ngram,
int numberOfPhrases)
- Instantiates a new document search pattern extractor.
- Parameters:
ngram
- the maximum size of ngrams to be calculated in order to generate the search patterns.numberOfPhrases
- the maximum number of ngrams to be combined in the search pattern generation.