Search code examples
htmlpdfdompdfbox

Getting java.lang.UnsupportedOperationException at org.apache.pdfbox.pdmodel.graphics.color.PDPattern.toRGB


I am using pdfbox-2.0.9 in my java application to convert a PDF file to html. But I am getting

java.lang.UnsupportedOperationException
    at org.apache.pdfbox.pdmodel.graphics.color.PDPattern.toRGB(PDPattern.java:95)
    at org.fit.pdfdom.PathDrawer.pdfColorToColor(PathDrawer.java:133)
    at org.fit.pdfdom.PathDrawer.clearPathGraphics(PathDrawer.java:79)
    at org.fit.pdfdom.PathDrawer.drawPath(PathDrawer.java:59)
    at org.fit.pdfdom.PDFDomTree.createPathImage(PDFDomTree.java:403)
    at org.fit.pdfdom.PDFDomTree.renderPath(PDFDomTree.java:251)
    at org.fit.pdfdom.PDFBoxTree.processOperator(PDFBoxTree.java:499)
    at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:503)
    at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:477)
    at org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:181)
    at org.apache.pdfbox.contentstream.operator.DrawObject.process(DrawObject.java:65)
    at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:848)
    at org.fit.pdfdom.PDFBoxTree.processOperator(PDFBoxTree.java:542)
    at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:503)
    at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:477)
    at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
    at org.apache.pdfbox.text.LegacyPDFStreamEngine.processPage(LegacyPDFStreamEngine.java:139)
    at org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391)
    at org.fit.pdfdom.PDFBoxTree.processPage(PDFBoxTree.java:208)
    at org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319)
    at org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266)
    at org.fit.pdfdom.PDFDomTree.createDOM(PDFDomTree.java:218)
    at com.demo.pdf.converter.PdfProcessor.convertToHtml(PdfProcessor.java:87)

The pdf I am trying to convert can be accessed from here.


Solution

  • This issue is resolved in PDF2Dom v1.9. I tried the pdf you provided with this version and it's getting converted appropriately.
    No exceptions thrown. Please confirm by updating PDF2Dom to v1.9
    You can find latest dependency here .