I am going through the white paper of Google Dremel. I came to know it converts complex data into columnar data layout.
At what location is this data stored?
As Drill has no central metadata repository, I assume it must be in-memory.
Therefore how does Drill handle this data when I have billions of rows?
To get complete, consistent query results from billions of rows, you'll use a distributed file system connected to multiple Drillbits, simulate a distributed file system by copying files to each node, or use an NFS volume, such as Amazon Elastic File System. Drill performs performant querying of big data using a number of techniques, including these:
For more information, see http://drill.apache.org/docs/performance/.