DOCX should not be confused with [[DOC]], the format used by earlier versions of Microsoft Office.

+

An executable file is used to perform tasks according to encoded instructions. Executable files are sometimes also referred to as binaries which technically can be considered a sub class of executable files.

−

= Container Format =

+

There are multiple families of executable files:

+

* Scripts; e.g. shell scripts, batch scripts (.bat)

+

* DOS, Windows executable files (.exe) which can be of various formats like: MZ, PE/COFF, NE

+

* ELF

+

* Mach-O

−

DOCX is written in an XML format, which consists of a [[ZIP archive]] file containing [[XML]] and binaries. Content can be analysed without modification by unzipping the file (e.g. in WinZIP) and analysing the contents of the archive.

The file _rels/.rels contains information about the structure of the document. It contains paths to the metadata information as well as the main XML document that contains the content of the document itself.

Metadata information are usually stored in the folder docProps. Two or more XML files are stored inside that folder, app.xml that stores metadata information extracted from the Word application itself and core.xml that stores metadata from the document itself, such as the author name, last time it was printed, etc.

Another folder contains the actual content of the document, in a Word document, or an .docx document the folder's name is word. A XML file called document.xml is the main document, containing most of the content of the document itself.

Office Open XML is an open XML standard developed by Microsoft for word processing documents, spreadsheets, presentations and charts. The OOXML standard was submitted to the ISO for approval. After initially being rejected over technical concerns, the ISO approved a modified version as ISO/IEC 29500:2008. Microsoft intended to use the OOXML standard for its Office suite. However, Office does not support the standard that the ISO approved, it only supports the standard that was originally rejected by the ISO[http://arstechnica.com/microsoft/news/2010/04/iso-ooxml-convener-microsofts-format-heading-for-failure.ars]. As of Office 2010, Microsoft has still not brought its software into compliance with the standard.

+

=== Mach-O ===

+

* [http://en.wikipedia.org/wiki/Mach-O Wikipedia: Mach-O]

−

For most purposes OOXML may be considered a subset of DOCX (DOCX contains additional features, like OLE serialization).

Revision as of 17:11, 18 May 2014

Please help to improve this article by expanding it.
Further information might be found on the discussion page.

An executable file is used to perform tasks according to encoded instructions. Executable files are sometimes also referred to as binaries which technically can be considered a sub class of executable files.

There are multiple families of executable files:

Scripts; e.g. shell scripts, batch scripts (.bat)

DOS, Windows executable files (.exe) which can be of various formats like: MZ, PE/COFF, NE