watch hebrew?
Shachar Shemesh
shachar at shemesh.biz
Wed Jul 1 15:13:05 IDT 2009
Shachar Shemesh wrote:
> Yuval Hager wrote:
>> if I try to run
>> $ watch "echo שלום123"
>> I just get the '123' on the page, Hebrew characters stripped.
>>
>> man watch says:
>>
>>> Non-printing characters are stripped from program output.
>>> Use "cat -v" as part of the command pipeline if you want to see
>>> them.
>>>
>>
>> but if I try
>> $ watch "echo שלום123|cat -v"
>> I get something like:
>> M-WM-)M-WM-^\M-WM-^UM-WM-^]123
>>
>> Any way to idea Hebrew characters under watch?
>>
>>
> This works for iso-8859-8 locale, but not for UTF-8 locale. I think
> it's a bug in both watch (for treating these characters as
> unprintables) and in cat (for the same reason).
>
> I do believe they have problems dealing with multi-byte characters. I
> have nothing to suggest besides sending a bug report.
>
> Shachar
Okay,
Narrowed it down some more.
The culprit is the use of "isprint" to figure out whether a character is
printable or not. The problem is that the interface for isprint has an
interface that makes it impossible for it to know whether the character
is printable or not. More details can be seen at
http://mail.nl.linux.org/linux-utf8/2002-02/msg00086.html
Shachar
--
Shachar Shemesh
Lingnu Open Source Consulting Ltd.
http://www.lingnu.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.huji.ac.il/pipermail/linux-il/attachments/20090701/e8238831/attachment.html>
More information about the Linux-il
mailing list