pdftohtml

Yesterday I came across pdftohtml, a small command-line tool that converts pdf files to html. You can use the -c option to generate an impressively complex html version that looks almost exactly like the original pdf, but the default conversion with the -noframes option is more usefull for me. The result is a simple html file you can quickly convert to semantically correct markup by hand.

Note: If you were looking at this page in a more modern browser like Mozilla Firefox or Safari, it would look and work better and faster.
Find out more…