[Israel.pm] pdf2txt ps2txt

Shlomo Yona shlomo at cs.haifa.ac.il
Thu Nov 4 01:52:43 PST 2004

On Thu, 4 Nov 2004, Offer Kaye wrote:

> 1. Have you tried to look at the output of "strings"? Depending on
> your locale and terminal abilities, it might actually generate
> something worth looking at :-)

Naa.. pure junk.

> 2. There is this project:
> http://pdftohtml.sourceforge.net/
> It might not preserve the nikud, but since it converts to XML (or
> HTML), it might work, at least partially.

The purporse of my efforts it to get the text with the

I'm looking into this now.
Hopefully, it will be useful.

> 3. ps2html might work:
> http://www.csd.uch.gr/~nikop/thesis.html

I'll check this out too.

> 4. Scribus:
> http://www.scribus.org.uk/
> is an Open Source Desktop Publishing system for Linux. I included it
> in the list of posssible tools because the site says that "Other
> features include PDF Import, EPS import/export, Unicode text including
> right to left scripts such as Arabic and Hebrew."
> So it might be useful to you.

Thanks for all the information.I'll check this one too.

Shlomo Yona
shlomo at cs.haifa.ac.il

More information about the Perl mailing list