I have a large database of XHTML that I wish to render as PDFs and/or RTF using CSS. Is there an off-the-shelf/cheap solution that could do this at scale? Large meaning terabytes. Need something that is robust and good at handling large volumes of files.
This is a difficult problem on the scale you're talking. I suggest looking at http://code.google.com/p/wkhtmltopdf/ for ideas on how you'd do the individual run. However, exec'ing a shell script each time you want to convert a document is probably inadequate for your needs, and so splicing this into some sort of daemon or mass-conversion utility is my suggested approach.