A system and process for highlighting the current speaker on an on-going
basis in each frame of a low frame-rate video of an event having multiple
people in attendance, such as a video teleconference, is presented. In
general, this is accomplished by periodically identifying an attendee
that is currently speaking at a rate substantially faster than the video
frame rate, and for each frame of the video updating the frame to
highlight the attendee currently speaking. More particularly, an A/V
source provides video and audio data streams to the client computing
device, with current speaker data embedded into the audio stream via
audio watermarking techniques. The client device extracts the current
speaker data from the audio stream, and then renders and displays the
video while using the current speaker data to periodically update the
frame being displayed to highlight the current speaker.