Hebrew search in PDFs is backwards?

Hebrew search in PDFs is backwards?

Matitiahu Allouche matial at il.ibm.com
Sun Jan 17 23:41:34 IST 2010


PDF's objective is to reflect the exact appearance of text.  For Hebrew, 
it means that the text is stored in visual order.  If your PDF viewer 
accepts user input in logical order (which is the case in Windows and 
Linux), it should transform search arguments (captured from a user dialog) 
from logical to visual order before performing the search.

Shalom (Regards),  Mati
           Bidi Architect
           Globalization Center Of Competency - Bidirectional Scripts
           IBM Israel
           Phone: +972 2 5888802    Fax: +972 2 5870333    Mobile: +972 52 
2554160




From:
Gadi Cohen <dragon at wastelands.net>
To:
linux-il at cs.huji.ac.il
Date:
17/01/2010 09:02 م
Subject:
Hebrew search in PDFs is backwards?
Sent by:
linux-il-bounces at cs.huji.ac.il



Hi

I've exported a PDF from OpenOffice 3.1 (with tags).
I can search for Hebrew text in Evince or Okular, but it searches 
backwards.
E.g.  If I have the words SHIR and RISHON, if I start typing "RI" it
will match the end of SHIR.

I have no idea if this is a problem with OpenOffice, Evince/Okular or
just the PDF standard.
Any pointers?

Thanks
Gadi

P.S.  Maybe someone can save me some more headache.
I see that (finally!) a PDF export won't randomly insert Hindi
characters in ooMath objects.
This was committed to CWS ooo32gsl09 and is targetted for 3.2.

I also see that 3.2rc2 is based on "OOO320_m9".

Can I hope that since they both start with "ooo32" and end with "9" it
will have the fix? :)

I find the versioning scheme quite complicated.

More on the Hindi bug here:

http://qa.openoffice.org/issues/show_bug.cgi?id=87669

-- 
Gadi Cohen aka Kinslayer <dragon at wastelands.net> www.wastelands.net
Freelance admin/coding/design HABONIM DROR linux/fantasy enthusiast
KeyID 0x93F26EF5: 256A 1FC7 AA2B 6A8F 1D9B 6A5A 4403 F34B 93F2 6EF5


_______________________________________________
Linux-il mailing list
Linux-il at cs.huji.ac.il
http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.huji.ac.il/pipermail/linux-il/attachments/20100117/a61edbce/attachment.html>


More information about the Linux-il mailing list