Search code examples
google-cloud-platformgoogle-cloud-dataflowgoogle-cloud-bigtablebigtable

While migrating huge database in GCP, BigTable created from sequence file created from another BigTable having different size


I have a huge database stored in Bigtable in GCP. I am migrating the bigtable data from one account to another GCP Account using DataFlow so, creating a sequence file from a bigtable and accessing it from another account using dataflow. but the new table created is having different size than the original table. Already Storing bigtable's sequence file and temporary files at different bucket locations so that it(dataflow) does not upload any un-necessary data into new bigtable. Original bigtable size is 321.7 GB but the new table is showing 340.7 GB


Solution

  • Your data needs to be compacted/compressed which will happen automatically. If you just did the migration, over the next few days the backend should handle that for you and you should see a more similar table size.