edu.columbia.cs.ref.tool.segmentator
Interface DocumentSegmentator
- All Known Implementing Classes:
- SimpleSegmentDocumentSegmentator
public interface DocumentSegmentator
The Interface DocumentSegmentator.
Document segmentators are used to divide the content of a document as sections.
A Section correspond to a natural division of the text (e.g., chapters, paragraphs)
- Since:
- 2011-09-27
- Version:
- 0.1
- Author:
- Pablo Barrio, Goncalo Simoes
Method Summary |
java.util.List<Segment> |
segmentate(java.lang.String documentContent)
This method receives as input the content of a document as a String and returns
a division of the document as a List of sections |
segmentate
java.util.List<Segment> segmentate(java.lang.String documentContent)
- This method receives as input the content of a document as a String and returns
a division of the document as a List of sections
- Parameters:
documentContent
- the content of a document
- Returns:
- division of the content into sections