A system and method for switching between the command-mode of operation and the
text-mode of operation in voice-recognition systems. The system operates using
voice-recognition software and a mechanical switch which together enable the system
to differentiate between spoken commands that are used to operate the system and
spoken words that are inserted as text into a software application. The mechanical
switch can be added to a conventional computer mouse, hand-held or desk-mounted
microphone, personal digital assistant such as a Palm Pilot, or any other apparatus
either used with or having a microphone. Alternatively, the system and method can
be implemented using software alone without a mechanical switch, i.e., an electronic
switch, wherein a specialized word or words are used to command the system to switch
between the command-mode and the text-mode. Such an electronic switch can be implemented
in a mouse, microphone or electronic organizer in addition to or in lieu of any
mechanical switch such apparatus may contain.