A method and apparatus for generating an index entry for a record in a
semi-structured database involves analysing each field to identify an
entry within each field and to identify a sequence of characters having a
format corresponding to a predetermined format. Thereafter, the method
and apparatus operate to generate an index entry for the identified
entry, and for at least one field, define any characters not identified
as an entry as a free text entry.