What's a good method for extracting text from a PDF using C# or classic ASP (VBScript)?

Is there a good library for extracting text from a PDF? I'm willing to pay for it if I have to.

Something that works with C# or classic ASP (VBScript) would be ideal and I also need to be able to separate the pages from the PDF.

This question had some interesting stuff, especially pdftotext but I'd like to avoid calling to an external command-line app if I can.

Solution

You can use the IFilter interface built into Windows to extract text and properties (author, title, etc.) from any supported file type. It's a COM interface so you would have use the .NET interop facilities.

You'd also have to download the free PDF IFilter driver from Adobe.

How to generate thumb pdf file only first page in node js
How can I determine if a file is a PDF file?
How to embed a local pdf file in mkdocs generated website on github-pages?
Unable to merge pdf watermark at the correct position
Is there library to convert text, image to pdf In react-native?
How to load and present a PDF file from the web in Flutter
Recommended way to embed PDF in HTML?
Full page width image in PDF made in quarto
pandoc doesn't text-wrap code blocks when converting to pdf
Problem with rendering double/float in R Markdown for PDF
Reading data from PDF files into R
How to create "Next" and "Previous" buttons for navigating PDF pages with Autodesk Viewer?
Elixir error when trying to upload pdf from pdf generator
PDFlib use spot color as text background
How to get rid of the red box around hyperlink in PDF?
How to rotate only Table, using borb (PDF library) python?
How to insert a page break in HTML so wkhtmltopdf parses it?
How do I determine the size of a pdf with pdf.js so I can scale to the screen size?
Why Puppeteer PDF generation not working on Windows?
How to interpret signatureCoversWholeDocument() == false?
How to skip choosing folder in microsoft pdf printer?
Efficient multipage PDF creation using matplotlib subplots in Python
! LaTeX Error: Missing \begin{document}. Error: LaTeX failed to compile paper_template.tex
Understanding PDF operators - for iOS app
send DDE command to pdf document in SUMATRA PDF on subform
How to add page number for every page in laravel dompdf?
Where can I a mapping of Identity-H encoded characters to ASCII or Unicode characters?
Reading ascii data with iText fails
How do I center the entire DataTable in the pdf using JQuery pdfHtml5
Signing a PDF with an external signature using a smartcard using iTextSharp 5 gives formatting errors C#