Projects
Infrastructure libraries
- librevenge
- A base for all the other libraries, providing document interfaces and common types. It also allows export to several formats: plain text, HTML, CSV and SVG.
Export libraries
- libodfgen
- ODF export library.
- libepubgen
- EPUB export library.
- librvngabw
- AbiWord export library.
Import libraries
- libwpd
- Corel WordPerfect import library.
- libwpg
- Corel WordPerfect Graphics import library.
- libwps
- Microsoft Works import library.
- libmwaw
- A library for import of many legacy Mac document formats.
- libabw
- AbiWord import library.
- libcdr
- Corel Draw import library.
- libmspub
- Microsoft Publisher import library.
- libvisio
- Microsoft Visio import library.
- libetonyek
- Apple Keynote/Pages/Numbers import library.
- libfreehand
- Aldus/Macromedia/Adobe FreeHand import library.
- libe-book
- A library for import of many e-book formats.
- libpagemaker
- Adobe PageMaker import library.
- libzmf
- Zoner Callisto/Draw import library.
- libstaroffice
- A library for import of legacy StarOffice binary formats.
All these libraries also provide command line tools for conversion to useful formats, like plain text, XHTML, SVG or CSV. In addition, the writerperfect project (available as a subproject of libwpd) provides command line tools for conversion to ODF.
Our wiki contains more detailed overview of the libraries.
Introspection tools
- OLEToy
- A visualizer for many binary formats. The internal structure of a format is described programmatically in Python code.
- colupatr
- A hex editor with support for breaking lines and adding comments.
- mso-dumper
- Dumper tool for Microsoft Office binary file formats.
Sample document repositories
- SoftMaker Office
- TexMaker .tmd, PlanMaker .pmdx and Presentations .prd files
Ideas for future development
Import
- Software602 602Text and 602Tab
- Serif PagePlus
- QuarkXPress
The above list is just a selection: we keep a complete list of suggested file formats in our wiki.