TechWhirl (TECHWR-L) is a resource for technical writing and technical communications professionals of all experience levels and in all industries to share their experiences and acquire information.
For two decades, technical communicators have turned to TechWhirl to ask and answer questions about the always-changing world of technical communications, such as tools, skills, career paths, methodologies, and emerging industries. The TechWhirl Archives and magazine, created for, by and about technical writers, offer a wealth of knowledge to everyone with an interest in any aspect of technical communications.
Subject:Re: Indexing and searching a webpage of PDFs? From:Emoto <emoto1 -at- gmail -dot- com> To:salt -dot- morton -at- gmail -dot- com Date:Mon, 22 Jan 2018 14:49:07 -0500
On Mon, Jan 22, 2018 at 2:40 PM, Chris Morton <salt -dot- morton -at- gmail -dot- com> wrote:
> I belong to an association that publishes a quarterly newsletter. Present
> and past issues are available as PDFs on a certain members-only page.
>
> There is no search function on that page, nor is an index provided.
>
> 1) Is it possible to add a search function that could comb through the
> PDFs, as it they were regular HTML pages?
>
> 2) Or might there be a nifty tool that I could aim at that page and build
> an index? Or maybe download the PDFs and index them locally using some
> nifty tool?
Chris,
Adobe Pro (used to be Acrobat Pro, not sure what it is called today)
can generate a searchable index from a folder full of PDFs. Easy as
pie. The search result presents the search string in context and will
take you to the doc if clicked. I have not tried it across the web,
but the index itself is such a nice thing that it would be worth a
try. I used this functionality to index thousands of pages at a time
and it works well.
Bob
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Visit TechWhirl for the latest on content technology, content strategy and content development | http://techwhirl.com