linux pdf image-processing imagemagick djvu

efficient image compression for pdf embedding with linux

I would like to compress scanned text (monochrome or few colours) and store it in pdf (maybe djvu) files. I remember that I got very good results with Windows/Acrobat and "ZRLE" compressed monochrome tiff embedded into pdf. The algorithm was loossless as far as I remember. Now I search a way to obtain good results on linux. It should be storage saving and avoid loss (I do not mind loosing colours, but I do not want e.g. jpeg compression which would create noisy results for text scans). I need it for batch conversion, so I was thinking of the ImageMagick convert command. But which output format should I use so I get good results and to be able to embed it into pdf files (for example using pdflatex)? Or is it generally better to use djvu files?

Solution

jbig2enc encoder for images using jbig2 compression, was originally written for GoogleBooks by Adam Langley

https://github.com/agl/jbig2enc

I forked to include latest improvements By Rubypdf and others

https://github.com/DingoDog/jbig2enc

I also built several binaries of jbig2enc for puppy linux (it can be working also on other distributions)

http://dokupuppylinux.info/programs:encoders