ExpressionEngine CMS
Open, Free, Amazing

Thread

This is an archived forum and the content is probably no longer relevant, but is provided here for posterity.

The active forums are here.

Transfer MS Word sections into EE blog entries

July 20, 2008 9:57pm

Subscribe [0]
  • #1 / Jul 20, 2008 9:57pm

    DavidHarperTwo

    63 posts

    Hi,
    I am looking to re-purpose year-old paid content (contained in MS word .doc files) into free content on my site EE site (blog entries).

    Current paid content is about 800 MS Word pages, which divides/maps cleanly into almost 600 blog entries. Each entry is marked with its own (header) style in MS Word.

    So, I’d love to find an efficient way to transfer 600 style-demarcated MS word text blocks (thought many have images inside) into EE blog entries. If it were a few, I’d cut and paste, of course. Would appreciate any ideas on this? Thanks, David Harper

  • #2 / Jul 22, 2008 4:06pm

    Adrienne L. Travis

    213 posts

    Hm. If i were doing this, i’d do something like the following (this assumes your weblog is already created to hold the entries!):

    1. Export the whole shebang to HTML (images will end up in an images folder)
    2. Use Dreamweaver or another tool to clean up Word’s entirely abysmal HTML output
    3. Upload the images into /images/uploads/ on your server
    4. Do some regex-fu to change all the image paths to point to http://www.yourdomain.com/images/uploads/ , and to add some sort of simple (invented) XML tags around each whole entry and the title and body of each entry (something like <entry><title></title><body></body></entry>, for each entry)
    5. Put an XML wrapper tag of some sort (<blogentries></blogentries>) around the whole document (because valid XML can only have one root element)
    6. Use Andrew’s (spectacular!) XMLGrab plugin to pull in all the entries.

    ...I’m not sure this is really “efficient”, but it’s better than cut-and-paste, and it’s the best i can think of. I’m betting it’d take about 3 or 4 hours total.

  • #3 / Jul 22, 2008 4:15pm

    DavidHarperTwo

    63 posts

    Thanks alienne, that sounds pretty killer. I don’t know how to do it, but i’d pay a premium even it took 10 hours b/c it would save me a lot of time. If you know anybody who would do this, let me know. And thanks for taking time to offer the steps! David

.(JavaScript must be enabled to view this email address)

ExpressionEngine News!

#eecms, #events, #releases