Search code examples
postgresqlamazon-web-servicesreplicationamazon-rds

What's the most efficient way to replicate data from Postgres (RDS) to AWS Redshift while keeping data integrity


I'm looking for an efficient way to replicate data from AWS RDS to Redshift. Based on my research, copying to s3 and loading it from there seems to be the recommended approach. However, this doesn't guarantee data integrity as it produces duplicate rows sometimes which needs to be deduped. Is there a better way of doing this, without using a third party vendor?


Solution

  • You can use the AWS Database Migration Service to continuously replicate from RDS Postgres to RedShift. You can even alter the schema with the Schema Conversion Tool if you need to.