Analyze in detailed the PDF Document Structure

Published: 16th November 2011
Views: N/A
Ask About This Article Print Republish This Article


PDF document structure,

PDF document structure is the logic structure of PDF files contents, It reflects the files in the body between the indirect object hierarchy relationship. PDF document structure is a tree structure. The tree root node is the root of PDF file object. There are four root nodes under the tree: page (tree), tree mix bookmarks tree (tree), clues tree), tree (name tree (take) supply. if. in a page tree, all of the pages in the leaves of the tree object node, it allows the user to bookmark name according to the content of the document. Because the bookmark can have levels, can use to organize the directory, and so sometimes document and bookmark tree called directory trees. The tree will clue clues and articles from block (clues to a head) tree structure according to organize management.

PDF (Portable Document format) has been developed by Adobe company, it is a readable file format which no matter what type of the computer you use. A PDF document contains a PDF document head and supporting data. a PDF document contains one or more pages, each page contains any combination of text, graphics, images which have nothing to do with equipment and resolution, that’s called the page description. The document may also contain some information that exists only in electronic books, such as hypertext links, voice and animation, etc. In addition to PDF documents, also contains a few other information, such as: documents used in PDF version number of the regulation of the file, the structure of important position.

The characteristics of the PDF format



PDF is Posts crypt technology as the foundation of the document format, rather than page description language (des creation brief language), it has removed the uncertainty that may occur when Compiling. It can convert any page to PDF document that generated by all software. completely embedding the text, graphics, image links of the original document into PDF document. You can choose the text page to be embedded to PDF when conversion. even if a Chinese PDF documents are also available in Chinese font not installed the pure English system of the right, the real to open print text exchange network without borders. PDF also can be converted into containing the font EPS (Encapsulated Posts cript), and the conversion of the document after document can again group EPS version or other software, but for myself, when I want to alter, copy, and edit PDF files, I always use mac PDF editor to crack the PDF document production.

In order to better understand the PDF files, the PDF files can be divided into four parts. The first part is the object, the PDF is a group of basic object types. Most of these types of language use and Posts crypt corresponding data types. PDF support a lot of basic data types: Boolean type, number, string literal, name, arrays, dictionary and flow, another is empty object. In PDF file, often give some object gives a label for other object call, this label object is called indirect object.



The second part is the PDF document structure is PDF. PDF files, which determines how the object in PDF files are stored, how to access and how to update.

The third part is a PDF document architecture. PDF document structure specifies how to use the basic objects type to illustrate the PDF document components, including: pages, annotated, hypertext links, font, etc.

The fourth part is PDF page description. Page description refers to the page contains any combinations of the text, graphics which have nothing to do with equipment and resolution.

1. High compatibility

PDF is compatible document text image data, or independently in various computer platforms and applications of high compatibility document format, can use all sorts of platform PDF documents between generic Binary (Binary) or ASCII coding, realize the real cross-platform homework, it can convey to almost any platform.

3. Protective

PDF document allow to set password and many other protection methods, in order to prevent illegally use. For example using the password is allowed to read, print, copy, annotation or modification

2. page independence

Each page of the Posts crypt documents are interdependent, this means that must process all previous pages before jumping to a page. But PDF document format does not have the limitation. .

Can directly reading PDF files on any one page, need not consider other pages. Because of the PDF documents each page and other pages is irrelevant to single page for the unit.

Author works as the professional magazine writer and reviewer which has always been devoting himself to review and research the best technical news and digital products like PPT to DVD software, recently he announced that he will launch the PDF to ePub converter to the world, please keep reading!


This article is free for republishing
Source: http://linkjehion.articlealley.com/analyze-in-detailed-the-pdf-document-structure-2389293.html

Report this article Ask About This Article Print Republish This Article


Loading...
More to Explore
 

Ask a Professional Online Now
27 Experts are Online. Ask a Question, Get an Answer ASAP.
Type your question here...
Optional:
Select...