A machine translation method includes receiving source text in a first
language and retrieving text fragments in a target language from a
library of bi-fragments to generate a target hypothesis. Each bi-fragment
includes a text fragment from the first language and a corresponding text
fragment from the second language. Some of the bi-fragments are modeled
as elastic bi-fragments where a gap between words is able to assume a
variable size corresponding to a number of other words to occupy the gap.
The target hypothesis is evaluated with a translation scoring function
which scores the target hypothesis according to a plurality of feature
functions, at least one of the feature functions comprising a gap size
scoring feature which favors hypotheses with statistically more probable
gap sizes over hypotheses with statically less probable gap sizes.