Search code examples
ng2-pdfjs-viewer

ng2-pdfjs-viewer for extracting text


Is there any possibility of using ng2-pdfjs-viewer for extracting solely the text of a pdf file, e.g. as a string? I want to extract text and regex it for cutting out certain paragraphs depending on the text inside of them.

Thanks in advance


Solution

  • ng2-pdfjs-viewer is an angular wrapper over pdfjs and viewerjs. The capabilities of pdfjs is exposed using an attribute PDFViewerApplication and PDFViewerApplicationOptions. If extracting text is feasible using pdfjs, these attributes can be used to call internal functions whic does that.

    PS: I authored ng2-pdfjs-viewer