Disclosed is a system and method for generating a spoken dialog service
from website data. Spoken dialog components typically include an
automatic speech recognition module, a language understanding module, a
dialog management module, a language generation module and a
test-to-speech module. These components are capable of being
automatically trained from processed website data. A website analyzer
converts a website into structured text data set and a structured task
knowledge base. The website analyzer further extracts linguistic items
from the website data. The dialog components are automatically trained
from the structured text data set, structured task knowledge base and
linguistic items.