A multimodal, multilanguage mobile device which can be employed to enhance
note taking and/or annotation of a document, and gaming. Input data types
such as optical character recognition (OCR), speech, handwriting, and
visual information (e.g., image and/or video), etc., can be fused to
generate rich documents with a multidimensional level of data to provide
an increased level of context over conventional documents. Such
architecture can be utilized by students for homework management, as well
as entertainment (e.g., gaming).