Ubuntu includes a nice little utility, pdftotext, which can, indeed,
change a pdf to text. One rather long pdf went very nicely into text,
though some of the spacing leaves a great deal to be desired.
On thew other hand, attempting to do the same thing with a text done in
the Cyrillic alphabet produces a file containing nothing but the
punctuation. it's necessary to use the -enc option to turn it to
cyrillic, but the result I get is this:
/Desktop/Zash$ pdftotext -enc cyrillic ./zash1.pdf
Error: Couldn't find unicodeMap file for the 'cyrillic' encoding
Error: Couldn't get text encoding
Anyone know what I'm doing wrong?
Anyone care?(:))
JimW
Received on Tue Jun 27 23:42:07 2006
This archive was generated by hypermail 2.1.8 : Fri Sep 08 2006 - 23:26:38 CST