Image and Index PDFs
A How-To Recipe
October 16, 2006
Office productivity tools such as word processors, spreadsheets and presentations generate extensive and valuable material in every organization. PDF is a common distribution format for these productivity documents, and documents not already in PDF format can easily be converted and published as PDFs.
In many cases, the content of the PDF is frequently accessed information. Viewing these PDF documents directly from the browser, without having to download them, is a much more effcient method of accessing their information. In order to do this, it is necessary to move the documents to the Knowledge Center, have the contents displayed directly to the browser and have them indexed so that they can be searched using KnowledgeDex.
The SQI Collaborative Knowledge Base provides PDF publishing capabilities. The PDF document is linked for download. It is imaged in HTML for quick web browsing and is indexed by KnowledgeDex.
PDF Imaging Steps:
Each step is presented below.
1. Create New Page
The first step is to create a new page that will contain the PDF document. To do this, go to the page that will be the parent of the new page. Edit this page to create the link for the new page. For example:
[:NewPage: Title of New Page for PDF Document]
Save the edit and click on the new link (displayed in red becuase the page does not yet exist). Select Create blank page option from the new page dialog box. Then copy the following and paste into the new page.
= Title = [[DisplayPDF(src=attachmentName, dpi=72, pages=5, title=TitleGoesHere)]]
2. Upload PDF file
Click on Attachments at bottom of topic page. Click on Browse and then select the PDF to upload. Click on the Upload button`.
Figure 1: Highlighted attachment name
3. Edit the DisplayPDF Pluggin
Copy the attached file name (highlighted in the above figure), click on Edit(text) and paste the attachment name into the DisplayPDF pluggin right after src=, also put the PDF title right after title= (note this is not in quotes). See example below.
= Chapter 1 = [[DisplayPDF(src=Chaptter-1-v6.pdf, dpi=72, pages=99, title=Chapter 1)]]
The default is for the DisplayPDF to image 5 pages. In the example we want all the pages imaged so 99 replaces the 5 right after pages=.
Save the page.
On the next scheduled KnowledgeDex indexing cycle all the words withhin the PDF will automatically be added to the index.
More detailed on Options
The standard form and default values of the DisplayPDF pluggin are shown below.
[[DisplayPDF(src=attachmentNane.pdf, dpi=72, quality=png, pages=5, title=pdfTitle)]]
The options (parameters) for the DisplayPDF plugin are:
Two common changes to the default values are: