2 messages in com.googlegroups.google-enterprise-developerGSA search of PDFs, Word Docs, etc
FromSent OnAttachments
hfairfield24 Oct 2006 16:18 
Jeff Ragusa29 Oct 2006 18:21 
Subject:GSA search of PDFs, Word Docs, etc
From:hfairfield (hcfa@gmail.com)
Date:10/24/2006 04:18:13 PM
List:com.googlegroups.google-enterprise-developer

Hello,

We have a web front end into a content management system that manages html, word docs, excel documents, etc. From what I can see, these have been catalogued by the appliance, but when you try and search them by the content in the pdfs they don't show up in the search results.

currently our web fromtend doesn't specify the file type of the document. The url just has a guid that specifies the document like this:

https://rimtest/RimWebApp/google/googlecontent.aspx?contentid=0a028ace-0477-491b-80c5-37f43d2d9631

where the aspx just gets the pdf file from a secure location.

Does google need to know what type of file it is getting somehow?

thx!

Hugh