The present invention is directed to systems and methods for isolating
sentence boundaries between sentences in text. Sentences of the
normalized document feeds or source text are separated by determining
boundaries between individual sentences, by a Bayesian algorithm, that
has been seeded with rule frequencies, developed from a previous training
phase, that employed a text of sentences with marked boundaries between
the sentences.