A new application programming language is provided which is based on user
interaction with any device which a user is employing to access any type
of information. The new language is referred to herein as a
"Conversational Markup Language (CML). In a preferred embodiment, CML is
a high level XML based language for representing "dialogs" or
"conversations" the user will have with any given computing device. For
example, interaction may comprise, but is not limited to, visual based
(text and graphical) user interaction and speech based user interaction.
Such a language allows application authors to program applications using
interaction-based elements referred to herein as "conversational
gestures." The present invention also provides for various embodiments of
a multimodal browser capable of supporting the features of CML in
accordance with various modality specific representations, e.g., HTML
based graphical user interface (GUI) browser, VoiceXML based speech
browser, etc.