VIM removal of Unicode ('bomb')

VIM removal of Unicode ('bomb')

Shachar Shemesh shachar at shemesh.biz
Wed Feb 4 20:06:20 IST 2009


Noam Rathaus wrote:
> I am using:
> VIM - Vi IMproved 6.1 (2002 Mar 24, compiled Jan 15 2003 08:05:27)
>
> And I have a few unicode characters (unicode encoding 'bomb'/marker) at the 
> beginning of the file that I want to remove.
> 0000000 bbef 3cbf 4421 434f 5954 4550 6820 6d74
>
> I m referring to 0xbb, 0xef, 0x3c, 0xbf
Supplementing my previous answer, your stated byte order is wrong. The 
order is 0xef 0xbb 0xbf. The first three bytes of the file are the BOM. 
The fourth byte, 0x3c, encodes a "<", and is not part of the meta-data.

Shachar



More information about the Linux-il mailing list