A system and a method for publishing a newspaper page or other data
through a Web page, to make it available more easily through a network
such as the Internet. The data is automatically converted by first
rendering the newspaper page into a digital format; the converting the
digital format to a basic internal publishing format; and then publishing
the data in a number of different possible publishing formats, including
a mark-up language document such as a Web page. Features include
arrangement of content according to relationships within the information
by analyzing the page as distinct objects. Object types include titles,
articles, pictures etc. Objects may be categorized, and objects in each
category are preferably compressed according to a different image format.