Re: Cheicken and eggs scenario for structred writing

Subject: Re: Cheicken and eggs scenario for structred writing
From: Yves Barbion <yves -dot- barbion -at- gmail -dot- com>
To: Stuart Burnfield <slb -at- westnet -dot- com -dot- au>
Date: Fri, 26 Jul 2013 09:14:52 +0200

On Fri, Jul 26, 2013 at 4:14 AM, Stuart Burnfield <slb -at- westnet -dot- com -dot- au>wrote:

> (...)
>
> Start experimenting with a conversion process. It would be a mountain of
> work to manually paste your Word content into skeleton DITA topics. There
> won't be a simple single-step 'Save As DITA' but you should be able to at
> least semi-automate the process. For example:
> - Save a Word document as XML. This creates a text file with Word
> formatting mapped to generic tags. Use scripts, macros or regular
> expressions to convert all generic XML tag s <foo> to actual DITA tag <bar>
> and to delete unwanted MS Word fluff.
>

[Yves] >>> Instead of saving the Word file as (Microsoft) XML, you could
also try Eliot Kimber's Word-to-DITA transformation Framework:

http://dita4publishers.sourceforge.net/d4p-user-guide/user_docs/d4p-users-guide/word2dita/word2dita-intro.html#chapter-id

It does require a bit of setup, but you can then actually "save" your Word
files as DITA. You can download the framework from sourceforge.net:

http://sourceforge.net/projects/dita4publishers/files/

Another option is to:

1. Open the Word files in FrameMaker.
2. Use mif2go (www.mif2go.com) to set up your conversion (basically a style
to element mapping), for example:

- heading* = title
- bulleted list = ul li
- instruction = cmd
- heading 4 = section

3. Save the FM file as DITA.


> - Use MS Excel as an intermediate step to convert large tables to DITA.
>

[Yves] >>> That's not really required. Some DITA editors, such as oXygen
XML Author and FrameMaker 11, have a "Smart Paste" function, which handle
tables very nicely, even more complex tables with merged cells.

>
> As Chris says, you can't magically add structure information that isn't
> there in the original, but you can automate a lot of the grunt work.
>

[Yves] >>> Well, actually, you can, depending on the type of structure
information which needs to be added. Metadata is very important if you use
DITA. In a topic, you can add metadata in the <prolog> element, which can
contain things like the name of the author, the creation and modification
date of the topic, the product name, the version of the topic etc. MIF2Go
can add this prolog to each topic during the conversion.

Still, you will have to check your content after the conversion and
restructure it (a bit, YMMV). For example, suppose you have this paragraph
in the original text:

"Click on Preview. You should see that this output creates a PDF of the 3D
view only."

After conversion, you will get this:

<step><cmd>Click on Preview. You should see that this output creates a PDF
of the 3D view only.</cmd></step>

This is *valid* DITA, but not well-structured yet. You need to restructure
and refactor this to get something like this:

<step><cmd>Click <uicontrol>Preview</uicontrol>.</cmd>
<stepresult>You should see that this output creates a PDF of the 3D view
only.</stepresult>

Conversion can be automated; restructuring/refactoring cannot because you
actually have to read the text and then decide that, in this case, the
second sentence is the result of the instruction in the first sentence. In
other cases, however, the second sentence may be an example (stepxmp), a
tip (note type="tip"), or just some more information about the instruction
(info).

Cheers

--
Yves Barbion
www.scripto.nu


^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
New! Doc-to-Help 2013 features the industry's first HTML5 editor for authoring.

Learn more: http://bit.ly/ZeOZeQ

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

You are currently subscribed to TECHWR-L as archive -at- web -dot- techwr-l -dot- com -dot-

To unsubscribe send a blank email to
techwr-l-leave -at- lists -dot- techwr-l -dot- com


Send administrative questions to admin -at- techwr-l -dot- com -dot- Visit
http://www.techwhirl.com/email-discussion-groups/ for more resources and info.

Looking for articles on Technical Communications? Head over to our online magazine at http://techwhirl.com

Looking for the archived Techwr-l email discussions? Search our public email archives @ http://techwr-l.com/archives


References:
Re: Cheicken and eggs scenario for structred writing: From: Stuart Burnfield

Previous by Author: Re: Cheicken and eggs scenario for structred writing
Next by Author: Re: Cheicken and eggs scenario for structred writing
Previous by Thread: Re: Cheicken and eggs scenario for structred writing
Next by Thread: Re: Cheicken and eggs scenario for structred writing


What this post helpful? Share it with friends and colleagues:


Sponsored Ads