Preparing to convince to shift to non-propriety documents formats

Preparing to convince to shift to non-propriety documents formats

Nadav Har'El nyh at math.technion.ac.il
Mon Feb 20 10:40:58 IST 2012


On Sun, Feb 19, 2012, Dotan Cohen wrote about "Re: Preparing to convince to shift to non-propriety documents formats":
> Undocumented? Which file format is that? All the .doc and .docx
> formats are documented, even the older binary formats.

Where is the ".doc" format documented?

I once wrote a tool to extract the text in MS Office files (for a search
engine). It was a really annoying reverse-engineering-like
trial-and-error process, and I could hardly find any documentation.
The PowerPoint format (.ppt) was particularly odd.

What documentation do you refer to?

-- 
Nadav Har'El                        |                    Monday, Feb 20 2012, 
nyh at math.technion.ac.il             |-----------------------------------------
Phone +972-523-790466, ICQ 13349191 |I'm experiencing both amnesia and deja
http://nadav.harel.org.il           |vu. I think I've forgotten this before!



More information about the Linux-il mailing list