atom feed12 messages in org.oasis-open.lists.docbookRe: DOCBOOK: converting to docbook
FromSent OnAttachments
jonathonAug 13, 2002 2:26 am 
Rizwan VirkAug 13, 2002 6:22 am 
Pradeep PadalaAug 13, 2002 7:54 am 
Georges SchmitzAug 13, 2002 8:31 am 
Bob StaytonAug 13, 2002 9:38 am 
Pradeep PadalaAug 13, 2002 10:05 am 
Pradeep PadalaAug 13, 2002 10:08 am 
David CramerAug 13, 2002 10:22 am 
Pradeep PadalaAug 13, 2002 2:19 pm.bin
Rizwan VirkAug 13, 2002 3:21 pm 
Georges SchmitzAug 14, 2002 4:13 am 
Marc BrierleyAug 15, 2002 11:44 am 
Subject:Re: DOCBOOK: converting to docbook
From:Bob Stayton (bo@caldera.com)
Date:Aug 13, 2002 9:38:00 am
List:org.oasis-open.lists.docbook

On Tue, Aug 13, 2002 at 02:26:24AM -0700, jonathon wrote:

All:

I have roughly 10 000 documents of various formats [ plain ASCII, TeeX, DocBook, HTML 4.01, XHTML 1.0 word, wordperfect, pdf and a couple of others. ]

Can anybody point me to something that will easilly convert these to docbook, and preserve some/most of their current formatting?

I'm not looking forward to doing the conversion manually.

If I had that problem, I would convert as many of them as I could to HTML, run 'tidy' to clean up the HTML, and then run the DocParse tool from www.commmandprompt.com to convert them to DocBook. DocParse is not free, but it is not expensive either.

For your PDF documents, I'd look for the source document that generated the PDF. It is tough (impossible?) to convert PDF.