Search code examples
marklogic

Do we have stand alone application for pdf to epub3?


There is JavaScript function in MarkLogic xdmp.pdfConvert. I'm trying to convert pdf to epub3.

Basically I want to convert pdf into epub format (epub consist of XHTML page include there css and images). So the function in MarkLogic xdmp.pdfConvert which takes pdf as input convert into XHTML as output.

Can you suggest the steps and requirements for how to execute that function? I want to see the quality of the converted output of XHTML as comparing with input pdf.


Solution

  • There is no built-in conversion from PDF to epub3 in MarkLogic. The pdfConvert function's primary purpose is to make the content of a PDF searchable.

    I'm not familiar enough with the epub3 format to make a suggestion about generating it inside of MarkLogic, but my starting point would be to look for an external library that can already do it and use that.


    Updating based on additional information:

    You asked a similar question in the comments on the pdfCovert page (about how to use the function) and I pointed you to the Understanding and Using the Default Conversion Option section of the CPF guide. Earlier in that same chapter, there is a section called Installing the Conversion Pipelines and Framework. Please try those steps and let us know if you have a specific follow-up question.