Apparatus and methods for efficiently and flexibly providing caption data
(e.g., closed captioning) to subscribers of a content-based network, such
as for example Internet protocol television (IPTV) subscribers. In one
exemplary embodiment, the apparatus includes a server performing
real-time extraction and encapsulation of caption data, transport of
caption data to client devices over the network, and use of one or more
applications running on the client devices to decode and display the
caption data consistent with the multimedia (audio/video) content with
which it is associated. In one variant, instant messaging (IM)
infrastructure is used to authenticate clients and receive and display
the caption data via a separate transport process. Server and client-side
apparatus adapted for caption data receipt, decoding and display are also
disclosed.