A method of document segmentation. Specifically, one embodiment of the present
invention discloses a method of document segmentation that performs a plurality
of projection profiles of pixel intensities on a document containing a plurality
of text lines over a range of angles. A plurality of slope values for a plurality
of discrete distances perpendicular to said range of angles is calculated for the
plurality of projection profiles. A set of maximum absolute slope values is sorted
out from the plurality of slope values. Text lines of first and second type are
identified by setting a threshold slope value. Absolute slope values greater than
the threshold slope value indicate the plurality of text lines of said first type.
Absolute slope values less than the threshold slope value indicate the plurality
of text lines of a second type.