PDF Analysis Tools
document analysis, PDF
A set of utilities for advanced PDF document analysis. Unlike the existing PDF to HTML convertors that focus on obtaining a DOM or HTML representation of the document that is visually as close as possible to the original document, the goal of the PDF Analysis Tools is to produce an output document that has the same logical stucture. For this purpose, the tools implement different algorithms for detecting common graphical patterns in the source PDF document that can be represented by some standard HTML elements and CSS constructions. The resulting document may not display exactly as the source PDF but it is more suitable for further analysis and/or editing.
Free software under the terms of the GNU GPL license.