Re: How does one find repeated sentences withing a number of document

Subject: Re: How does one find repeated sentences withing a number of document
From: Phil Snow Leopard <philstokes03 -at- googlemail -dot- com>
To: David Harrison <dharrison -at- moldmasters -dot- com>, "techwr-l -at- lists -dot- techwr-l -dot- com (techwr-l -at- lists -dot- techwr-l -dot- com)" <techwr-l -at- lists -dot- techwr-l -dot- com>
Date: Thu, 12 Jan 2012 23:35:27 +0700

In Mac OS X, we have TextWrangler.app and Grep searches to do that.

I'm not sure if there's a non-Mac equivalent, however.


On 12 Jan 2012, at 23:30, David Harrison wrote:

> We are looking a large project where we need to identify if translation memory or conversion to a write-once-use-many application (such as Flare, Author-it etc ) could pay dividends.
> My initial step is wondering how to analyse a lot of various docs to see if there is much text repetition. ( I think I would want granularity should be at sentence level). After all - if the vast majority of all the documents contain fairly unique text then TM or w-o-u-m would not really be worth considering.
> Does anyone know of any application or methodology that would enable us to do such?
> Right now we are at very first steps and we have little raw data although we do know that the main tools for document preparation appear to be MS Word and Adobe InDesign.
> Eventually - if the vast majority of all the documents contain fairly unique text then TM or w-o-u-m would not really be worth considering. We need to find which side of the fence we are.
>
> Thanks all.
> David


Tech Writer:
http://applehelpwriter.com

Critical Thinking & Philosophy:
http://essentialthinking.wordpress.com/



^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Create and publish documentation through multiple channels with Doc-To-Help.
Choose your authoring formats and get any output you may need. Try
Doc-To-Help, now with MS SharePoint integration, free for 30-days.
http://www.doctohelp.com

---
You are currently subscribed to TECHWR-L as archive -at- web -dot- techwr-l -dot- com -dot-

To unsubscribe send a blank email to
techwr-l-leave -at- lists -dot- techwr-l -dot- com
or visit http://lists.techwr-l.com/mailman/options/techwr-l/archive%40web.techwr-l.com


To subscribe, send a blank email to techwr-l-join -at- lists -dot- techwr-l -dot- com

Send administrative questions to admin -at- techwr-l -dot- com -dot- Visit
http://www.techwhirl.com/email-discussion-groups/ for more resources and info.

Looking for articles on Technical Communications? Head over to our online magazine at http://techwhirl.com

Looking for the archived Techwr-l email discussions? Search our public email archives @ http://techwr-l.com/archives


Previous by Author: Re: Ideas for Help 2.0
Next by Author: Re: documentation going away
Previous by Thread: RE: Question about duplicate Search text boxes in Flare Web Help
Next by Thread: comma inside quoted-string-with-question-mark ?


What this post helpful? Share it with friends and colleagues:


Sponsored Ads