convert old DOS Hebrew encoding
Shachar Shemesh
shachar at shemesh.biz
Thu Oct 28 15:00:36 IST 2010
On 28/10/10 14:48, Tzadik Vanderhoof wrote:
> I have a binary data file, in a format used by a relatively ancient
> program, which I am trying to convert into something sane. With the
> help of a Hex editor I have basically worked out the file format
> except that it contains Hebrew characters with an odd encoding.
>
> All characters are 8 bits. The "standard" 27 consonants (including
> "final" consonants) go from hex *80* to *9A*. Then there are *vowels*
> that seem to start around hex *9B* or so (I'm guessing right after the
> standard consonants end). Then there are *"dotted" consonants* that
> seem to start at hex *E0*.
>
> If I remember correctly, I think this is some sort of DOS encoding
> (perhaps connected to the old "Hebrew chip"). Does anyone have a table
> of this character mapping or a tool that will translate this mapping
> into a more normal Hebrew encoding like Unicode?
>
> Thanks so much!
>
> --
And I should add:
iconv -f cp862 < file > output
--
Shachar Shemesh
Lingnu Open Source Consulting Ltd.
http://www.lingnu.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.huji.ac.il/pipermail/linux-il/attachments/20101028/cd5edee4/attachment-0001.html>
More information about the Linux-il
mailing list