Note: I determined to learn LaTeX around 5 years ago. At that time, a friend of mine gave me a book about how to use LaTeX. I also found some resources on the Internet. But due to limited use for my research and work, I do not have a chance to focus on it. LaTeX has so many advantages over other word processors like Microsoft Word. The relationship between LaTeX and Word is similar to that between keyboard and mouse. Mouse makes the operation easier, but keyboard is much more important and necessary. For common users, Word could satisfy their needs, but the equations and batch operations need LaTeX. In previous pages here, I insert some equations with LaTeX codes supported by WordPress. Because people around me do not use LaTeX very often, I had to find a way to convert my LaTeX files into Microsoft Word to exchange my ideas about research and work. Google found the useful information for me. In this blog, I forwarded an overview on converters between LaTeX and PC word processors for your and my own references.

I maintain these pages because I need converters between LaTeX and PC Textprocessors for my work and I want to share the information with others who need it. Because I maintain them in my spare time (uh, what is spare time?), I can not answer individual questions.

This list is as good or as bad as its support, and it needs YOUR support to update and supplement this list. Please supplement if you know more and/or better ones. There are some more converters on the CTAN sites, but the following seem to be most promising for conversion to and from the current versions of wordprocessors.

Note that this FAQ list contains information about converters ONLY between LaTeX and PC word processors. Converters to and from other formats may have own FAQ lists – e.g. see the link for converters to and from HTML.

General Remarks

Before looking for a converter, stop and think about a principal question:

What do you want to be converted in which way?

Do you want to convert the document structure, i.e. a heading should remain a heading, a list should remain a list etc., no matter how it will look like in the target format?
Or do you want to convert the appearance, i.e. how it looks like, no matter how it is represented in the target format?
Or do you want a mixture of both?
For using SGML as an intermediate format, you would have to specify the translation rules yourself (as far as I understood). This makes sense, and explains why different people have very different opinions about which converter best fits their needs: They simply have different demands and expectations on what should be converted and how.
So, not only practically there is no converter which is good for everyone and every purpose, but this is even principally impossible because there are no well-defined requirements which a converter should meet.

An additional problem is that TeX/LaTeX can be extended by an unlimited number of macros. Unless the converter contains a full-scale TeX system, it can at best support the publicly available macro commands, not the ones privately written by individual users. Practically you can expect that it supports the standard LaTeX commands and perhaps a few more widely used packages. The only converter which uses a full-scale TeX system is TeX4ht.

So keep this in mind when looking through the following list of converters, try yourself and decide what you need.

There are several ways to convert.

To illustrate these, let me restrict it to the Microsoft Word case:

directly type or paste LaTeX code into Word

use a Word import filter

use a Word macro: load LaTeX file as plain text, then search for LaTeX markup and replace the markup by formatting, special characters and equations.

TexPort – a commercial TeX/LaTeX to WordPerfect and Microsoft Word converter for PC.

TeX4ht – a free LaTeX to html or XML converter for PC and Unix produces html which is good for loading into Word. TeX4ht relies on other software, it needs at least a full TeX system.

There are also converters to Powerpoint and to FrameMaker (see further below).

Directly type or paste LaTeX code into Word

All of these only allow typing or pasting LaTeX coded equations in Word, not LaTeX coded text elements.

“Aurora” (formerly named Ribbit) can now convert a LaTeX coded equation (which must be placed on the Windows clipboard) to Word. The converter is still experimental and as such has a number of limitations, some of which will be addressed in future releases. The converter’s output will generally need some manual touching up to achieve the level of fidelity on a par with the original document.
The other functionality of Aurora, which was the only functionality of its predecessor “Ribbit”, is letting you enter LaTeX equations in word processors such as MS Word or in Powerpoint. One can enter the equation in LaTeX markup, and the formatted equation is inserted as an object.
See homepage (external link) (Shareware)
Aurora needs a working LaTeX installation. If there is no, it will install a micro version of MiKTeX.

“LaTeX in Word”: See homepage (external link) (Freeware, GPL).
It allows to enter LaTeX equations in word processors such as MS Word in LaTeX markup, and the formatted equation is inserted in the Wordprocessor as a png bitmap. It needs a server which performs the conversion. Server installation files are available from the download page (external link).

Use a Word import filter

TeX2Word: Shareware, 99$ (45$ academic).
Current version: 2.5, released April 2008.
Support for more document styles and packages will be available with future versions. You can also supply support for document styles, packages and user defined macros by yourself (needs TeX programming knowledge).
Needs:
* MS Windows 95 or later (NT4, ME, Win2000, WinXP),
* MS Word 95 or later (97, 2000, XP) and
* MathType (external link) 4 or later (full version of the Equation Editor which comes with MS Word).
See homepage (external link)

ltx2word, by myself: LaTeX to WinWord 6, WinWord 7(95) and WinWord 97 converter, written as Word macros. No tables. Not compatible with Word 2000 and up, no further development.
See more detailed page.

Use a Word add-in

The following are no full converters but only allow typing or pasting LaTeX code into Word:

“Aurora” (formerly named Ribbit) can now convert LaTeX code (which must be placed on the Windows clipboard) to Word. The converter is still experimental and as such has a number of limitations, some of which will be addressed in future releases. The converter’s output will generally need some manual touching up to achieve the level of fidelity on par with the original document.
The other functionality of Aurora, which was the only functionality of its predecessor “Ribbit”, is letting you enter LaTeX equations in word processors such as MS Word or in Powerpoint. One can enter the equation in LaTeX markup, and the formatted equation is inserted as an object.
See homepage (external link) (Shareware)
Aurora needs a working LaTeX installation. If there is no, it will install a micro version of MiKTeX.

“LaTeX in Word”: It allows to enter equations in word processors such as MS Word in LaTeX markup, and the formatted equation is inserted in the Wordprocessor as a png bitmap. It needs a server which performs the conversion.
See homepage (external link) (Freeware, GPL)
Server installation files are available from the download page (external link).

Use a PowerPoint add-in

TexPoint enables the easy use of Latex symbols and formulas in Powerpoint presentations. See homepage (external link). (Shareware)
Latest version requires PowerPoint2000, does not work with earlier versions of PowerPoint.

HTML as intermediate format

LaTeX to HTML

cost free unless otherwise stated

Because HTML is a structured format, the conversion between HTML and LaTeX is rather straightforward. However there remain the limitations of HTML compared to LaTeX, i.e. there are many elements in LaTeX which can not (yet?) be represented in HTML. Converters from LaTeX to HTML are:

Recommended if you have TeX installed or don’t mind to install it: TeX4ht (external link) is a highly configurable TeX-based converter to hypertext. It comes with a built-in default setting for plain TeX, LaTeX and TeXinfo, and it generates html with accompanying css stylesheet, xhtml, or xml. The converter needs a full TeX installation, but this gives the advantage that TeX’s full support for macros and styles is available (with only few exceptions).
Equations are converted to either bitmaps or MathML. There are some different MathML flavors around which can be chosen by an option. (Following description is partially copied from the TeX4ht web site.)
The special commandoolatex
is available for producing xml compatible with OpenOffice (and probably also StarOffice). The output of the command oolatex <filename> is a zipped file with same name and a “.sxw” extension (containing the document in xml format which does not suffer the limitations of html). For this to work, TeX4ht needs a zip program which is not included in the TeX4ht distribution. It can e.g. be downloaded from http://www.info-zip.org/ (external link).
The resulting .sxw file can directly be opened in OpenOffice, converted equations are editable in OpenOffice’s own equation editor. OpenOffice can save the document also in MS Word 97/2000/XP format, but many equations are not correctly converted to Word. This is known to the OpenOffice developers for long, but yet not fixed.
A command of the formhtlatex filename "html,word" "symbol/!"
asks for HTML output tuned toward MicroSoft Word. Such a format, however, relies on bitmaps for mathematical formulas.
Conversion to bitmaps additionally needs Ghostscript and ImageMagic or netpbm.
For more information see http://www.cse.ohio-state.edu/~gurari/TeX4ht/mn3.html (external link).

TTH (external link): LaTeX-to-HTML converter which translates LaTeX into HTML 4.0 markup. Formulae are also translated into standard html markup. (Free for non-commercial applications.)
A sister of tth, TtM (external link), converts formulae to MathML (Linux version for free, Windows version must be paid).

ltoh (external link): LaTeX-to-HTML converter which is highly customizable, i.e. you can define how the LaTeX macros which are used in your document are to be translated. Requires that the input file conforms to LaTeX2e (see documentation). It was last updated 1979, and it seems that the homepage is no longer available, so look on CTAN in …/support/ltoh/ .

Some converters are available from CTAN (external link) (“Comprehensive TeX Archive Network”), e.g. in …/support/latex2html.
(The … stands for a host specific base directory, which often is either “/pub/tex” or “/tex-archive”)

OpenOffice can also import html, but much better is using tex4ht for lossless conversion to native OpenOffice format.

Other intermediate formats

There are ways to use SGML as intermediate format, and others have used it successfully. Having had a quick look at it, I found it rather complicated, especially it seems that you have to define the translation rules yourself. So I did not put more effort in trying to use it. If anyone can give me a ready-to-use cookbook solution, I will include it here.

An upcoming format is XML, a subset of which can be exported and imported by Microsoft Office 2000 and up, OpenOffice uses it as its native format, and the browser programmers are working on implementing XML. It actually is an instance of SGML. As it is more powerful than HTML, conversion from LaTeX to XML would lose much less information than conversion from LaTeX to HTML. There are good chances that it could be used as a general exchange format in the future. TeX4ht already has scripts for converting to XML (TEI or DOCBOOK). MS Word 2000 and earlier can not import XML, for these target systems convert to html+css using the xwtex and xwlatex scripts. MS Word 2003 can export and import XML, but I haven’t yet tested whether it can import the TEI or DOCBOOK files produced by TeX4ht.
The most successful path is using TeX4ht to convert to the OpenOffice format (.sxw, which actually is a zip compressed archive containing the document and vector graphics as XML and the bitmap graphics as bitmap files) and open this in OpenOffice. One could stop there, as OpenOffice is publicly available, or go on and save from OpenOffice as a “MS Word 97/2000/XP” file.

Most astonishing, one could also use PDF as intermediate format. Generating PDF from LaTeX is straightforward if you have a full TeX implementation installed. If you have the full commercial version of Adobe Acrobat 7, you can open the pdf and “save as” e.g. rtf (saving “as Word doc” actually generates an rtf file, too), xml, or plain text. Or you can use other commercial software to convert PDF to Word, just do a www search for “pdf to word” to get several hits. In this path of conversion however the document strucure and probably some formatting will be lost.

Finally, you can use OCR software to convert any printed document to word or plain text. To avoid the inaccuracy introduced by printing to paper and scanning, you can convert the TeX output to ps or pdf, convert this to a bitmap (using ghostscript), and feed this bitmap into the ocr software.

it must remain intact and the contents unchanged; if you’d like to have something changed, contact me (texconvfaq “at” gmx.de). Reformatting (e.g. from HTML to some other presentation format) is granted as long as the contents are unchanged.

you may NOT ask money for it except a reasonable cost for media and distribution