Preparing to convince to shift to non-propriety documents formats
Yedidyah Bar-David
linux-il at didi.bardavid.org
Mon Feb 20 11:29:41 IST 2012
On Mon, Feb 20, 2012 at 10:40:58AM +0200, Nadav Har'El wrote:
> On Sun, Feb 19, 2012, Dotan Cohen wrote about "Re: Preparing to convince to shift to non-propriety documents formats":
> > Undocumented? Which file format is that? All the .doc and .docx
> > formats are documented, even the older binary formats.
>
> Where is the ".doc" format documented?
>
> I once wrote a tool to extract the text in MS Office files (for a search
> engine). It was a really annoying reverse-engineering-like
> trial-and-error process, and I could hardly find any documentation.
> The PowerPoint format (.ppt) was particularly odd.
>
> What documentation do you refer to?
According to Wikipedia, it's partially documented. I did not follow the
links inside:
http://en.wikipedia.org/wiki/DOC_(computing)#Specification
--
Didi
More information about the Linux-il
mailing list