Methods and apparatus, including computer program products, to process an
electronic document that includes a non-coded representation of
characters of text. Based on text coding information that identifies the
characters of the non-coded representation, a coded representation is
generated and associated with the non-coded representation. In the coded
representation, each identified character has a code value. Each code
value is associated with a glyph that has no semantic relation with the
identified character. A visual representation of the non-coded
representation can be displayed, and the coded representation can be used
to identify or search characters in the visual representation.