Automate pdf to html conversion for MOODLE

So guys, I have a problem. Am working on a Moodle site and the bulk of my work has been conversion of modules [Learning Material] which are in *.pdf into *.html files. The said modules are divided into sections About Course, About Author, Learning Activities etc sections of which go into different html files. I have resulted into manually having to convert this which is very tedious and repetitive to say the least. Is there a way to automate this? Say a way to parse the pdf file to produce the different html files that I produce from each pdf. Please suggest ways I can do this. Thanks, -- With Kind Regards James Wachira Nairobi .ke twitter: jwaciira | yahoo: jwaciira | gtalk: jwaciira | skype: jwaciira

http://pdftohtml.sourceforge.net/ On Tue, Mar 23, 2010 at 5:15 PM, James Wachira <jwaciira.lists@gmail.com>wrote:
So guys,
I have a problem. Am working on a Moodle site and the bulk of my work has been conversion of modules [Learning Material] which are in *.pdf into *.html files. The said modules are divided into sections About Course, About Author, Learning Activities etc sections of which go into different html files. I have resulted into manually having to convert this which is very tedious and repetitive to say the least. Is there a way to automate this? Say a way to parse the pdf file to produce the different html files that I produce from each pdf.
Please suggest ways I can do this.
Thanks,
-- With Kind Regards James Wachira Nairobi .ke twitter: jwaciira | yahoo: jwaciira | gtalk: jwaciira | skype: jwaciira
_______________________________________________ Skunkworks mailing list Skunkworks@lists.my.co.ke http://lists.my.co.ke/cgi-bin/mailman/listinfo/skunkworks ------------ Skunkworks Server donations spreadsheet
http://spreadsheets.google.com/ccc?key=0AopdHkqSqKL-dHlQVTMxU1VBdU1BSWJxdy1f... ------------ Skunkworks Rules http://my.co.ke/phpbb/viewtopic.php?f=24&t=94 ------------ Other services @ http://my.co.ke

On Tue, Mar 23, 2010 at 5:15 PM, James Wachira <jwaciira.lists@gmail.com> wrote:
I have a problem. Am working on a Moodle site and the bulk of my work has been conversion of modules [Learning Material] which are in *.pdf into *.html files. The said modules are divided into sections About Course, About Author, Learning Activities etc sections of which go into different html files. I have resulted into manually having to convert this which is very tedious and repetitive to say the least. Is there a way to automate this? Say a way to parse the pdf file to produce the different html files that I produce from each pdf.
See http://itextpdf.com/ which can be used from the command line and also has an API. But pdf to html is not always a good idea since PDF is primarily meant for print media. How were the original PDF files created ? if you find the original source files from which the PDFs were created -- you may find it easier to convert the source files to HTML.
participants (3)
-
Ashok Hariharan
-
James Wachira
-
Rad!