A method of decoding, composing and rendering a scene. First information is obtained,
the first information including a part of a MPEG-4 BIFS scene description stream
and at least one coded MPEG-4 media stream. The first information is decoded by
invoking a BIFS scene decoder and one or more specific media decoders that are
required by the scene. Second information is obtained, the second information including
a second part of a BIFS scene description stream that contains a reference to an
external application. The second information is decoded by invoking the BIFS scene
decoder and an external application decoder. An integrated scene is composed, the
integrated scene including one or more decoded MPEG-4 media objects and one or
more external application objects specified in the decoded scene descriptions streams.
The composed integrated scene is rendered on a display.