A document processing apparatus is disclosed that has a capability of
outputting video data related to document data. When electronic document
data including information (video tag) specifying video data is input,
video data related to that electronic document data is detected. The
video data related to the electronic document data is output in
synchronization with or independently of the output of the electronic
document data thereby presenting to a user not only the document but also
the video data related to the document.