edu.columbia.cs.ref.tool.segmentator
Interface DocumentSegmentator

All Known Implementing Classes:
SimpleSegmentDocumentSegmentator

public interface DocumentSegmentator

The Interface DocumentSegmentator. Document segmentators are used to divide the content of a document as sections. A Section correspond to a natural division of the text (e.g., chapters, paragraphs)

Since:
2011-09-27
Version:
0.1
Author:
Pablo Barrio, Goncalo Simoes

Method Summary
 java.util.List<Segment> segmentate(java.lang.String documentContent)
          This method receives as input the content of a document as a String and returns a division of the document as a List of sections
 

Method Detail

segmentate

java.util.List<Segment> segmentate(java.lang.String documentContent)
This method receives as input the content of a document as a String and returns a division of the document as a List of sections

Parameters:
documentContent - the content of a document
Returns:
division of the content into sections