Search code examples
pdfunicodeutf-8character-encoding

Searching for greek characters within a PDF document


I don't know how to search for greek letters in a pdf. How can I do this? Ideally I could do this within a pdf viewer software package.

From forum postings I've seen, the greek letters fall outside the standard unicode characters which are included in the pdf specification.


Solution

  • I have worked out what I was doing wrong, and it was pretty basic.

    I can search for a Greek letter just by using the character map or switching to the Greek alphabet keyboard e.g αβγ.

    This generally finds the Greek characters. Many PDFs do not recognise Greek characters correctly though and instead have a gobbledygook approximation to the character with other shapes. This presumably depends on how the PDFs were made.