Systems and methods are provided for performing focus detection, referential
ambiguity resolution and mood classification in accordance with multi-modal input
data, in varying operating conditions, in order to provide an effective conversational
computing environment for one or more users.