An XML (eXtensible Markup Language) schema to define a list of terms to be
recognized as semantic categories is disclosed. Using an instance of the
XML schema, a user may easily define terms to be recognized as semantic
categories without having to compile a complete recognizer plug-in. The
XML schema may be written using any editing tool or XML schema editing
tool to create an instance of the schema. An instance of the semantic
category list schema is also referred to as a semantic category list
file. Typically, the user completes all of the properties of the XML
schema and specifies the list of terms to recognize directly in the XML
schema. In addition to specifying terms directly in the XML schema, the
user may use a binary representation of the list of terms because of size
constraints. The user may also define actions in the semantic category
list file.