An integrated and interactive real-time media creation system and method
are disclosed that visually link the order and execution of media events
to a scrolling on-screen text script that is spoken. As the user speaks,
the on-screen text script scrolls under user control. Adjacent to the
script text are visual images which represent associated media events.
The visual images maintain a constant relative spatial relationship with
the text. When the text reaches a predetermined region of the screen it
is spoken out loud by the user and the media events are caused to occur
at that time. The invention allows a single user with a single personal
computer to easily self-produce a multimedia presentation of undetermined
and variable length, featuring media events appropriately inserted in
real-time without the assistance of a director or other operator.