pdftohtml
Yesterday I came across pdftohtml, a small command-line tool that converts pdf files to html. You can use the -c option to generate an impressively complex html version that looks almost exactly like the original pdf, but the default conversion with the -noframes option is more usefull for me. The result is a simple html file you can quickly convert to semantically correct markup by hand.