A method for determining logical components of a portable document format
(PDF) document is disclosed. The method includes separating the document
into a plurality of layers. A PDF document is created for each of the
plurality of layers. The method also includes determining a logical
structure for each layer. The logical structures of the plurality of
layers are combined to determine the logical components of the PDF
document.