Some embodiments provide an architecture for establishing
multi-participant video conferences. This architecture has a central
distributor that receives video images from two or more participants.
From the received images, the central distributor generates composite
images that the central distributor transmits back to the participants.
Each composite image includes a set of sub images, where each sub image
belongs to one participant. In some embodiments, the central distributor
saves network bandwidth by removing each particular participant's image
from the composite image that the central distributor sends to the
particular participant. In some embodiments, images received from each
participant are arranged in the composite in a non-interleaved manner.
For instance, in some embodiments, the composite image includes at most
one sub-image for each participant, and no two sub-images are
interleaved.