Preparing to convince to shift to non-propriety documents formats

Preparing to convince to shift to non-propriety documents formats

Yedidyah Bar-David linux-il at didi.bardavid.org
Mon Feb 20 11:29:41 IST 2012


On Mon, Feb 20, 2012 at 10:40:58AM +0200, Nadav Har'El wrote:
> On Sun, Feb 19, 2012, Dotan Cohen wrote about "Re: Preparing to convince to shift to non-propriety documents formats":
> > Undocumented? Which file format is that? All the .doc and .docx
> > formats are documented, even the older binary formats.
> 
> Where is the ".doc" format documented?
> 
> I once wrote a tool to extract the text in MS Office files (for a search
> engine). It was a really annoying reverse-engineering-like
> trial-and-error process, and I could hardly find any documentation.
> The PowerPoint format (.ppt) was particularly odd.
> 
> What documentation do you refer to?

According to Wikipedia, it's partially documented. I did not follow the
links inside:
http://en.wikipedia.org/wiki/DOC_(computing)#Specification
-- 
Didi




More information about the Linux-il mailing list