Re: pdfttotext in pure lua?
[
Date Prev][
Date Next][
Thread Prev][
Thread Next]
[
Date Index]
[
Thread Index]
- Subject: Re: pdfttotext in pure lua?
- From: Dirk Laurie <dirk.laurie@...>
- Date: 2016年10月23日 22:03:08 +0200
2016年10月23日 20:26 GMT+02:00 Dietmar Segbert <didi.segbert@arcor.de>:
> is there a module in pure lua, that converts a pdf-file to a text-file?
I once spent a great deal of time, without gettinga s far as I wanted
to, on a pure Lua program that produces Markdown starting from the
XML output given by "pdftohtml -xml".
Among the difficulties are: recognizing page headers and footers;
reassembling words hyphenated at the end of a line; handling
footnotes and citations; recognizing tabular input; etc.
All that makes me doubt very strongly that the desired module exists.