TechWhirl (TECHWR-L) is a resource for technical writing and technical communications professionals of all experience levels and in all industries to share their experiences and acquire information.
For two decades, technical communicators have turned to TechWhirl to ask and answer questions about the always-changing world of technical communications, such as tools, skills, career paths, methodologies, and emerging industries. The TechWhirl Archives and magazine, created for, by and about technical writers, offer a wealth of knowledge to everyone with an interest in any aspect of technical communications.
Sounds like you want source language-to-source language translation memory...
What do you author in? Something like SDL's Author Assistant would be
right up your alley.
On Thu, Jun 10, 2010 at 11:23 AM, Craig Haiss <craighaiss -at- yahoo -dot- com> wrote:
> Can anyone recommend tools that would flag similar paragraphs in a large documentation set? The goal is to re-use this content via XML and reduce the overall word count.
> The technical side of re-use isn't the issue; it's finding the redundant content at a granular level.
> Author-IT eXtend appears to show fuzzy content matches as you are typing new content, but I don't see any way to analyze a bunch of existing content for duplication.
> What I'm looking for is a tool that would analyze the whole doc set and produce a report of content chunks that are good candidates for re-use.
Available for contract and full time opportunities.
Gain access to everything you need to create and publish documentation,
manuals, and other information through multiple channels. Choose
authoring (and import) as well as virtually any output you may need. http://www.doctohelp.com/
- Use this space to communicate with TECHWR-L readers -
- Contact admin -at- techwr-l -dot- com for more information -
You are currently subscribed to TECHWR-L as archive -at- web -dot- techwr-l -dot- com -dot-