Package org.apache.pdfbox.pdfparser
package org.apache.pdfbox.pdfparser
The pdfparser package contains classes to parse PDF documents and objects within the document.
-
ClassDescriptionThis class is used to contain parsing logic that will be used by both the PDFParser and the COSStreamParser.PDF-Parser which first reads startxref and xref tables in order to know valid objects and parse only these objects.This will parse a PDF 1.5 object stream and extract all of the objects from the stream.This will parse a PDF byte stream and extract operands and such.This will parse a PDF 1.5 (or better) Xref stream and extract the xref information from the stream.This class will collect all XRef/trailer objects and creates correct xref/trailer information after all objects are read using startxref and 'Prev' information (unused XRef/trailer objects are discarded).The XRefType of a trailer.