encoding hebrew text

encoding hebrew text

Uri Even-Chen uri at speedy.net
Wed Dec 16 01:09:29 IST 2009


OK, I found a solution.  I opened the file with both notepad and
notepad++, then I changed the encoding to windows-1255 in notepad++,
then I copied all the contents to notepad and saved in utf-8.  It
works.  I'm attaching the result.

Thanks!
Uri Even-Chen
Mobile Phone: +972-50-9007559
E-mail: uri at speedy.net
Blog: http://www.speedy.net/uri/blog/



On Wed, Dec 16, 2009 at 12:55 AM, Matitiahu Allouche <matial at il.ibm.com> wrote:
>
> OK, your file is encoded in windows-1255.  What is the problem?  Open it in Notepad and save it as UTF-8.  You will get a BOM at the beginning.  If this is bad for you, edit the file with any editor except Notepad and remove the first 3 bytes.
>
>
> Shalom (Regards),  Mati
>           Bidi Architect
>           Globalization Center Of Competency - Bidirectional Scripts
>           IBM Israel
>           Phone: +972 2 5888802    Fax: +972 2 5870333    Mobile: +972 52 2554160
>
>
>
> From:
> Uri Even-Chen <uri at speedy.net>
> To:
> Tom Goren <motnerog at gmail.com>
> Cc: linux-il <linux-il at cs.huji.ac.il>
> Date: 16/12/2009 00:47
> Subject: Re: encoding hebrew text
> Sent by: linux-il-bounces at cs.huji.ac.il
> ________________________________
>
>
> OK, I'm attaching an example file with a few lines in hebrew (encode
> it in windows-1255 to read the hebrew).  I checked now and I can
> convert the file to windows-1255 encoding, the problem was that it was
> already in utf-8 and that's why I couldn't convert it.  but I still
> need to encode it in utf-8 with hebrew enabled, so I will be able to
> read the file with notepad and notepad++.
>
> Uri.
>
>
> On Wed, Dec 16, 2009 at 12:21 AM, Tom Goren <motnerog at gmail.com> wrote:
> > could you perhaps attach an example of such a file?
> >
> > it would make it easier to recommend the appropriate conversion for you to
> > make (in my opinion it should eventually all be utf8).
> >
> > i know that notepad++ should be sufficient.
> >
> > tom.
> >
> > 2009/12/15 Uri Even-Chen <uri at speedy.net>
> >>
> >> Hi people,
> >>
> >> I have a problem with encoding hebrew text files on windows.  I used
> >> notepad to edit these files, now I'm using notepad++ (by the way, I
> >> highly recommend notepad++ on windows).  the problem is, hebrew text
> >> appears as gibberish (ëøèéñ etc.).  I tried different encodings,
> >> eventually with using windows-1255 as the character set, I can read
> >> the hebrew in notepad++, but I can't convert it to utf-8.  Also, one
> >> of my files I can't read the hebrew at all, even with windows-1255
> >> encoding.  I need help to fix the hebrew encoding and convert the
> >> files to utf-8.  Am I right that utf-8 is the best solution for
> >> hebrew?
> >>
> >> Thanks,
> >> Uri Even-Chen
> >> Mobile Phone: +972-50-9007559
> >> E-mail: uri at speedy.net
> >> Blog: http://www.speedy.net/uri/blog/
> >>
> >> _______________________________________________
> >> Linux-il mailing list
> >> Linux-il at cs.huji.ac.il
> >> http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il
> >
> >
> [attachment "1.txt" deleted by Matitiahu Allouche/Israel/IBM] _______________________________________________
> Linux-il mailing list
> Linux-il at cs.huji.ac.il
> http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il
>
>
-------------- next part --------------
   זיכרון נייד חדשני Sandisk Cruzer Micro U3 2GB.
   דגם: SDCZ6-2048-E10WT 
   מספר מכירה: 11648541  
   מחיר המוצר כולל דמי משלוח: 139 ₪


More information about the Linux-il mailing list