watch hebrew?

watch hebrew?

Shachar Shemesh shachar at shemesh.biz
Wed Jul 1 15:13:05 IDT 2009


Shachar Shemesh wrote:
> Yuval Hager wrote:
>> if I try to run
>> $ watch "echo שלום123"
>> I just get the '123' on the page, Hebrew characters stripped.
>>
>> man watch says:
>>   
>>>       Non-printing characters are stripped from program output.
>>>       Use "cat -v" as part of the  command  pipeline if you want to see
>>>       them. 
>>>     
>>
>> but if I try
>> $ watch "echo שלום123|cat -v"
>> I get something like:
>> M-WM-)M-WM-^\M-WM-^UM-WM-^]123
>>
>> Any way to idea Hebrew characters under watch?
>>
>>   
> This works for iso-8859-8 locale, but not for UTF-8 locale. I think 
> it's a bug in both watch (for treating these characters as 
> unprintables) and in cat (for the same reason).
>
> I do believe they have problems dealing with multi-byte characters. I 
> have nothing to suggest besides sending a bug report.
>
> Shachar
Okay,

Narrowed it down some more.

The culprit is the use of "isprint" to figure out whether a character is 
printable or not. The problem is that the interface for isprint has an 
interface that makes it impossible for it to know whether the character 
is printable or not. More details can be seen at 
http://mail.nl.linux.org/linux-utf8/2002-02/msg00086.html

Shachar

-- 
Shachar Shemesh
Lingnu Open Source Consulting Ltd.
http://www.lingnu.com

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.huji.ac.il/pipermail/linux-il/attachments/20090701/e8238831/attachment.html>


More information about the Linux-il mailing list