I do it using Poppler in Cygwin:
pdftohtml -noframes input.pdf && sed -i 's| | |g' ${_%.pdf}.html