In monthly increment loaded DWH I have task to create process to be able reload random month in the DWH.
Lets say reload data for February 2021 in existing DWH.
If I reload data for February 2021 on 2021/08/15, my SCD2 dimension Customer will end up like this:
I could have wrong dimension attributes until next load. And Dates in DateFrom/DateTo will be messed.
Questions:
Working on sql server using SSIS ETL tool.
Thanks
If you are just running your existing process then in order to reload data you would need to rollback your DWH to the point prior to the incorrect data, apply the updated dataset again and then reapply all the subsequent datasets. Obviously, this is a significant piece of work so not a good idea unless you have no other choice and definitely not something you’d want to run regularly.
If you do want to be able to re-apply a single dataset from the past then you’d need to write a process to do this e.g.