A multi-point video conferencing system and method for managing display of images
of multiple participating locations and bandwidth allocated for a multi-point video
conference. The multi-point video conferencing system detects an audio signal associated
with an image not currently displayed to each participating location in the multi-point
video conference. When the number of active display windows is less than the maximum
number of active display windows, a first active display window is activated and
the image displayed in the first active display window. When the number of the
active display windows is not less than the maximum number, a second active display
window of the active display windows is inactivated, a third active display window
is activated, and the image is displayed in the third active display window.