Guys,
I have tried most of this tools available online. Thing is they convert the whole pdf into one large html file.
The process requires me to convert one Module[the pdf file] into separate html files [usually 7 or 8] from different sections of the module[pdf file] and maintain the formatting on the pdf e.g paragraphs, ordered lists, unordered lists, images, bod text etc.
So the online solutions [at least the ones I've come across] do not help

Please have a look at the attached pdf.

Thanks


On Tue, Mar 23, 2010 at 5:39 PM, James Wachira <jwaciira.lists@gmail.com> wrote:
Guys,
I have tried most of this tools available online. Thing is they convert the whole pdf into one large html file.
The process requires me to convert one Module[the pdf file] into separate html files [usually 7 or 8] from different sections of the module[pdf file] and maintain the formatting on the pdf e.g paragraphs, ordered lists, unordered lists, images, bod text etc.
So the online solutions [at least the ones I've come across] do not help

Please have a look at the attached pdf.

Thanks

On Tue, Mar 23, 2010 at 5:24 PM, Ashok Hariharan <ashok@parliaments.info> wrote:
On Tue, Mar 23, 2010 at 5:15 PM, James Wachira <jwaciira.lists@gmail.com> wrote:
> I have a problem.
> Am working on a Moodle site and the bulk of my work has been conversion of
> modules [Learning Material] which are in *.pdf into *.html files.
> The said modules are divided into sections About Course, About Author,
> Learning Activities etc sections of which go into different html files.
> I have resulted into manually having to convert this which is very tedious
> and repetitive to say the least.
> Is there a way to automate this? Say a way to parse the pdf file to produce
> the different html files that I produce from each pdf.
>


See http://itextpdf.com/ which can be used from the command line and
also has an API.
But pdf to html is not always a good idea since PDF is primarily meant
for print media.
How were the original PDF files created ? if you find the original
source files from which the
PDFs were created -- you may find it easier to convert the source files to HTML.
_______________________________________________
Skunkworks mailing list
Skunkworks@lists.my.co.ke
http://lists.my.co.ke/cgi-bin/mailman/listinfo/skunkworks
------------
Skunkworks Server donations spreadsheet
http://spreadsheets.google.com/ccc?key=0AopdHkqSqKL-dHlQVTMxU1VBdU1BSWJxdy1fbjAwOUE&hl=en
------------
Skunkworks Rules
http://my.co.ke/phpbb/viewtopic.php?f=24&t=94
------------
Other services @ http://my.co.ke



--
With Kind Regards
James Wachira
Nairobi .ke
twitter: jwaciira | yahoo: jwaciira | gtalk: jwaciira | skype: jwaciira





--
With Kind Regards
James Wachira
Nairobi .ke
twitter: jwaciira | yahoo: jwaciira | gtalk: jwaciira | skype: jwaciira