A system and method that integrates automated voice recognition technology
and speech-to-text technology with automated translation and closed
captioning technology to provide translations of "live" or "real-time"
television content is disclosed. It converts speech to text, translates
the converted text to other languages, and provides captions through a
single device that may be installed at the broadcast facility. The device
accepts broadcast quality audio, recognizes the speaker's voice, converts
the audio to text, translates the text, processes the text for multiple
caption outputs, and then sends multiple text streams out to caption
encoders and/or other devices in the proper format. Because it automates
the process, it dramatically reduces the cost and time traditionally
required to package television programs for broadcast into foreign or
multi-language U.S. markets.