Pages

Saturday, 1 February 2014

wordpress-to-markdown

wordpress-to-markdown

This script uses the standard exported XML file from WordPress, and creates a folder/file structure that contains all of the blog posts, converted to markdown format. It will also download all of the images.
Instructions for exporting your information from WordPress can be found here.
The folder structure was designed after my blog. I like the structure because it groups the files for the post with the post itself. If you want a different format, you'll need to modify the script.
/2013/11/this-is-a-post/index.html.md
/2013/11/this-is-a-post/image-for-the-post.jpg

Works on my box

This is highly experimental at best. It was developed for my own use to do a one time conversion from WordPress to markdown for a static generator such as DocPad. It is designed to be used one time and then throw away.

Technical Details

This uses xml2js to parse the XML easily, and then uses to-markdown to convert the HTML post content into Markdown.

Requirements

  • You must have Node.js installed and available in the folder you wish to run the script from.
  • The folder the script is is must contain a WordPress export file called "export.xml". The file name is hard-coded.
from https://github.com/ytechie/wordpress-to-markdown