?

Log in

No account? Create an account
Pablo Garuda
Give Me The Facts
Word to Wiki 
1st-Jul-2008 02:47 am
Bushido Winnie
There is an enormous amount of knowledge locked up in PDF and Word documents, which could be shared on wikis such as Appropedia, if only it could be easily converted.

Take Word documents. There are a few ways to convert them, with more or less messy results. OpenOffice 2.3 and above has an export-to-MediaWiki function, but it's broken on my machine (I'm still looking for help), and reviews I found online are not very impressive.

Perhaps more promising: The Word2Wiki macro conversions are now more complete, as footnotes can be converted. See ReadFootNotes - at the moment it's 2 extra macros (one by a programmer friend who I badgered into writing it, and one by me). But someone with Windows and Word and a modest knowledge of Visual Basic should be able to make a single macro out of everything on that page, that hopefully will do a very good job of converting the text.

See also:
Porting PDF files to MediaWiki
Comments 
1st-Jul-2008 10:31 am (UTC)
Anonymous
A couple of things:

Word docs:

1. convert your Word doc to HTML (can be done in Open office)

2. Run through HTML tidy to clean up html

3. Use http://diberri.dyndns.org/wikipedia/html2wiki/ perl script to output media wiki syntax. Done!


PDF

1. use http://pdftohtml.sourceforge.net/ or xpdf to convert to html

2. clean up with htmltidy

3. HTML2Wiki

Done!

Sam Rose
2nd-Jul-2008 05:14 pm (UTC) - Promising
Thanks heaps Sam! These look promising. I've downloaded from http://www.appropedia.org/CSE_exclusion_list_test_page - when I get some time (or find a helpful geek) I'll work out how to install it.

I still would like to see the macro developed further, as a one-click solution that does all the fancy stuff like footnotes/references.
This page was loaded Aug 16th 2018, 11:55 pm GMT.