Open main menu

Changes

1,747 bytes added ,  15:05, 8 August 2015
moves pandoc to the top of the list
This page is See [[wp:File_format]] for information about tools for converting content from one document/file format to another.File Formats
This page is about '''all''' tools for converting content from one document /file format to another. We examine mostly office documents and wiki file formats. There are many other conversionswe could discuss when including multimedia formats etc.  There are very useful tools and discussions about converting between [[MediaWiki]] wikitax and other target [[wiki]] syntaxes, [[XML]], XHTML, DocBook, [[OpenDocument]], Portable Document Format (PDF) and more
==Office Suites and Formats==
Basically following the information at http://www.xml.com/lpt/a/1638, create a local macro, and run it from the command line:
<source lang="bash">
ooffice2 -invisible "macro:///Standard.MyConverters.SaveAsOOO(/home/greg/projects/slides/executive.ppt)"
</bashsource>
(Note that OOO is just shorthand for OpenOffice.Org)
==Wiki formats==
 
=== Mediawiki DTD ===
http://meta.wikimedia.org/wiki/Wikipedia_DTD
 
=== pandoc ===
 
[http://code.google.com/p/pandoc/ Pandoc] is a Haskell library for converting from one markup format to another, and a command-line tool that uses this library. It can read markdown and (subsets of) reStructuredText, HTML, and LaTeX, and it can write markdown, reStructuredText, HTML, LaTeX, ConTeXt, Docbook XML, OpenDocument XML, GNU Texinfo, RTF, ODT, MediaWiki markup, groff man pages, and [[Presentation|S5 HTML slide shows]].
 
Or more simply, Pandoc rocks the free world! Because Pandoc does MediaWiki format, we used it in the [[Html2Wiki]] extension.
 
To convert an HTML document to MediaWiki syntax, you can simply issue a command like
<source lang="bash">
pandoc --from html --to mediawiki foo.html --output foo.wiki.txt
</source>
 
=== Wiki To PDF ===
Announced in 2008, wikis have gone print-on-demand. http://wikimediafoundation.org/wiki/Press_releases/Wikis_Go_Printable Using the [http://www.mediawiki.org/wiki/Extension:Collection Collection extension], you can [[Special:Book|create "Books"]] (collections of wiki articles) that you can share, convert to PDF, and even print on-demand at a high-quality press. On each article on this wiki you should see a "PDF version" link in the toolbox and also a "Create A Book" section in the navigation bar. More information is available at [[Collections]]
=== Wiki To XML ===
The OpenOffice Writer has an export filter that allows you to author in OpenOffice and then save your document in wiki format.
 
== Other ==
=== html to pdf ===
[http://wkhtmltopdf.org/index.html wkhtmltopdf] is an LGPLv3 tool to render HTML into PDF and various image formats using the QT Webkit rendering engine.
 
*{{@todo}} update this page since it was last touched in 2009
[[Category:Wiki]]
4,558

edits