Search code examples
databasearchitecturefull-text-searchmimefile-storage

What architecture would you use to store 10 billion MIME messages and make it deletable and full text searchable incl. attachments


I would like to use components that are free for commercial use.

I looked at a Lucene and MongoDB combo but wonder if there are better approaches, ideally a single system.


Solution

  • Sphinx can also handle billions of documents http://sphinxsearch.com/info/powered/

    (although I also use Lucene and cannot tell whether Sphinx is better)