Just spent a horrible long time trying to do pdftohtml. No matter what
I did, no matter how many pages I limited it to, all I ever got was a
very beautiful version of the Forward to the piece.
JimW
robm wrote:
> Pdf2html might help, I used it to automate MSDS web pages (strangely
> the chemical suppliers logo always got inverted...weird)
>
>
> jimw wagner wrote:
>
>> Lance Levsen wrote:
>>
>>> jimw wagner wrote:
>>>
>>>
>>>> Nathan Koch wrote:
>>>>
>>>>> i think its all your text.
>>>>>
>>>
>>>
>>>
>>>> Perhaps I didn't make it clear. I could blow up the pdfs quite
>>>> nicely,
>>>> but it's hard to put my computer into my backpack to carry with me
>>>> when
>>>> I go out.
>>>>
>>>> I did notice that I wrote " What I'i is if" ; I don't know how that
>>>> made it past the spell checker. For those who can't read my mind,
>>>> that
>>>> should be "What I'd like is if".
>>>>
>>>> JimW
>>>>
>>>
>>> So you're just looking for the raw text? Do you have xpdf installed? If
>>> so then you probably have the 'pdftotext' binary installed too.
>>>
>>> The downside is that special chars aren't converted all that nicely.
>>>
>>> Cheers,
>>> lance
>>>
>>>
>> The real problem is that this is in two columns, and
>> pdftoanythingelse doesn't do columns. (Unless I've missed something
>> important somewhere). All my past attempts have resulted in the text
>> being put together all jumbled, one column, with a line from column 1
>> followed by a line from column 2 and so on.
>>
>> If I could change this from pdf to columnar text, I'd be quite
>> willing to go through and insert all the special characters.
>>
>> JimW
>>
>
>
> --
> "Those who do not understand Unix are condemned to reinvent it, poorly."
> (Henry Spencer, 1987)
>
> To unsubscribe, send a message with the word "unsubscribe" (without the
> quotes) in the body to linux-request@slg.org
> Archives are at http://list.slg.org/
>
Received on Tue Feb 13 22:18:17 2007
This archive was generated by hypermail 2.1.8 : Tue Feb 13 2007 - 22:18:27 CST