Hebrew search in PDFs is backwards?
Matitiahu Allouche
matial at il.ibm.com
Sun Jan 17 23:41:34 IST 2010
PDF's objective is to reflect the exact appearance of text. For Hebrew,
it means that the text is stored in visual order. If your PDF viewer
accepts user input in logical order (which is the case in Windows and
Linux), it should transform search arguments (captured from a user dialog)
from logical to visual order before performing the search.
Shalom (Regards), Mati
Bidi Architect
Globalization Center Of Competency - Bidirectional Scripts
IBM Israel
Phone: +972 2 5888802 Fax: +972 2 5870333 Mobile: +972 52
2554160
From:
Gadi Cohen <dragon at wastelands.net>
To:
linux-il at cs.huji.ac.il
Date:
17/01/2010 09:02 م
Subject:
Hebrew search in PDFs is backwards?
Sent by:
linux-il-bounces at cs.huji.ac.il
Hi
I've exported a PDF from OpenOffice 3.1 (with tags).
I can search for Hebrew text in Evince or Okular, but it searches
backwards.
E.g. If I have the words SHIR and RISHON, if I start typing "RI" it
will match the end of SHIR.
I have no idea if this is a problem with OpenOffice, Evince/Okular or
just the PDF standard.
Any pointers?
Thanks
Gadi
P.S. Maybe someone can save me some more headache.
I see that (finally!) a PDF export won't randomly insert Hindi
characters in ooMath objects.
This was committed to CWS ooo32gsl09 and is targetted for 3.2.
I also see that 3.2rc2 is based on "OOO320_m9".
Can I hope that since they both start with "ooo32" and end with "9" it
will have the fix? :)
I find the versioning scheme quite complicated.
More on the Hindi bug here:
http://qa.openoffice.org/issues/show_bug.cgi?id=87669
--
Gadi Cohen aka Kinslayer <dragon at wastelands.net> www.wastelands.net
Freelance admin/coding/design HABONIM DROR linux/fantasy enthusiast
KeyID 0x93F26EF5: 256A 1FC7 AA2B 6A8F 1D9B 6A5A 4403 F34B 93F2 6EF5
_______________________________________________
Linux-il mailing list
Linux-il at cs.huji.ac.il
http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.huji.ac.il/pipermail/linux-il/attachments/20100117/a61edbce/attachment.html>
More information about the Linux-il
mailing list