pdf2xml convertor based on Xpdf library (http://www.foolabs.com/xpdf/home.html). It converts information contained in a PDF file into XML. First, you need to install xpdf and libxml2 (see documentation).
Hervé Déjean
Xerox Research Centre Europe
http://www.xrce.xerox.com/About-XRCE/People/Herve-Dejean
Features
- pdf to xml conversion
- text extraction
- vectorial instruction extraction
License
GNU General Public License version 2.0 (GPLv2)Follow pdf2xml
Other Useful Business Software
Rate This Project
Login To Rate This Project
User Reviews
-
The link for the SVN code is not working i want to integrate this functionality in my java project , please provide valid link
-
Thanks very good project! +
-
Used on the irs f1040.pdf to produce f1040.xml; however, when viewed in firefox, firefox indicated it had no styling; hence, it didn't look anything like the pdf file when viewed by adobe reader.
-
Very useful, a must-have program. Great job!
-
Simple, no fuss. works for all types