An image processing apparatus includes a scanner for reading out
documents, a first extraction unit for extracting text contained in
document images, a second extraction unit for extracting at least one Web
address from the text, a fetch unit for obtaining at least one Web page
corresponding to the Web address, a first generation unit for generating
a concatenated image by concatenating the document images with the Web
page, and a second generation unit for generating an index indicating a
corresponding relationship between the document images and the Web page
in the concatenated image.