Search code examples
google-search-appliance

Report of all indexed PDFs with URL and Title


I have a GSA indexing around 15,000 documents. After using the GSA on our main website for sometime, we have realized a large amount of our PDFs are named incorrectly.

In order to correct the error we would like to obtain a list from the GSA of all PDFs, with their URL, and their title in search results.

Is such a report possible to pull from the GSA?


Solution

  • You can export all URLs from the GSA and then use a text editor (or spreadsheet application) to view them. If you have a large # of URLs then you might need to open first in a plain text editor and pull out only the lines with PDF in them.