Have a number of referential integrity issues.
There are 5 dimensions which join to multiple fact tables (Galaxy schema rather than just Star)
I have been to
Phil Seamark page on Referential integrity
This show me how to identify the issues.
What is considered best practice to resolve?
My first item is Sales and Products
Sales has 6,000,000 Rows
Product has 265,000
I am missing 1244 Rows in the produuct table that are in the Sales table
Assuming that I can t fix this in the data source is there a best practice for this?
I could Merge on a left join from the sales to the product and if there is none then use a default of -99?
As I am having issues with refresh times any complexity needs to be efficient.
Many thanks again