<div dir="ltr">Alan<br><br>Thanks for clarifying.<br>Makes sense that the problem with reading RTF files produced by the Gilboa agent system was more basic and not related to Hebrew since other Hebrew RTF files are readable in OO<br>
<br>Seems my solution to use Office 2K or Windows Reader was reasonable after all.<br>Danny<br><br><div class="gmail_quote">On Mon, Jul 13, 2009 at 2:38 PM, Alan Yaniger <span dir="ltr"><<a href="mailto:alan@tkos.co.il">alan@tkos.co.il</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Problems with OO's RTF import of text boxes is a known bug:<br>
<br>
see <a href="http://qa.openoffice.org/issues/show_bug.cgi?id=95665" target="_blank">http://qa.openoffice.org/issues/show_bug.cgi?id=95665</a><br>
<br>
<br>
This is part of a general problem with OOo's import of RTF drawing objects, and not restricted to Hebrew.<br><font color="#888888">
<br>
<br>
Alan</font><div><div></div><div class="h5"><br>
<br>
<br>
Ehud Karni wrote:<br>
<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
On Fri, 10 Jul 2009 09:38:45 Micha Silver wrote:<br>
<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Ehud Karni wrote:<br>
<br>
<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
I use `catdoc' which works quiet good (for both *doc and *rtf).<br>
`catdoc' is available as a package for Centos and Debian.<br>
<br>
</blockquote>
Thanks for the tip, but I can't get any sensible output. I ran:<br>
<br>
catdoc -a -d8859-8 invoice150711.rtf | fribidi --charset ISO8859-8<br>
--width=80 --rtl<br>
and I get 188 empty lines. :-(<br>
<br>
</blockquote>
<br>
After Micha sent me his RTF file, I found out that it contain<br>
"text boxes", not plain text.<br>
<br>
Using open office does not help, It shows empty (almost) page.<br>
<br>
Filter your RTF with the sed command bellow, it will drop the boxes.<br>
Than run catdoc as above or use open office (I used `ooviewdoc') both<br>
will show you the data (ooviewdoc saves more of the original layout).<br>
BTW. When viewed with M$word, the filtered file show empty boxes.<br>
<br>
Ehud.<br>
<br>
<br>
sed -e "s/{...do.dobxpage.dobypara.dodhgt8192.dptxbx.dptxbxmar0{/{/g" \<br>
-e "s/}.dpx[0-9]*.dpy[0-9]*.dpxsize[0-9]*.dpysize[0-9]*.dplinehollow0}/}/g"<br>
<br>
<br>
<br>
--<br>
Ehud Karni Tel: +972-3-7966-561 /"\<br>
Mivtach - Simon Fax: +972-3-7976-561 \ / ASCII Ribbon Campaign<br>
Insurance agencies (USA) voice mail and X Against HTML Mail<br>
<a href="http://www.mvs.co.il" target="_blank">http://www.mvs.co.il</a> FAX: 1-815-5509341 / \<br>
GnuPG: 98EA398D <<a href="http://www.keyserver.net/" target="_blank">http://www.keyserver.net/</a>> Better Safe Than Sorry<br>
<br>
_______________________________________________<br>
Linux-il mailing list<br>
<a href="mailto:Linux-il@cs.huji.ac.il" target="_blank">Linux-il@cs.huji.ac.il</a><br>
<a href="http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il" target="_blank">http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il</a><br>
<br>
</blockquote>
<br>
<br>
_______________________________________________<br>
Linux-il mailing list<br>
<a href="mailto:Linux-il@cs.huji.ac.il" target="_blank">Linux-il@cs.huji.ac.il</a><br>
<a href="http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il" target="_blank">http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il</a><br>
</div></div></blockquote></div><br><br clear="all"><br>-- <br>Danny Lieberman<br>-------------------------------------------------------------------------------------------------<br>Protect your data: <a href="http://www.software.co.il">http://www.software.co.il</a><br>
Twitter: <a href="http://twitter.com/onlyjazz">http://twitter.com/onlyjazz</a><br>Skype: dannyl50<br>Warsaw:+48-79-609-5964<br>Israel: +972 8 9701485<br>Mobile: +972 - 54 447 1114<br>
</div>