For PIT tables, do we have to repeat the hashkey for each Satellite table along with the load_dts, if the DV is implemented in Snowflake database? In other DBs to help the query plan we might want to do that but is it necessary for Snowflake. I see examples in Dan’s book and also Patrick Cuba’s article, they repeat the hashkey for each satellite. Please advise. Thanks in advance!
Or - which I assume - just about the hahskey of the satellite.
In that case, I would say yes (if you use hashes at all), because a specific satellite might not contain any data about that business object (hub-record). So, you might have to store the hash of the dummy-record.
the extra hkeys that seems equal/extras, is mainly for the single INSERT of the ghost row (hkey=00000000000000000000000000000000 (32 zeros)) SAT record that all new created SATs shall insert upon SAT ddl initial execution, so for example PITs can have either entitiy hkey ornthe specific SAT hkey.
By definition it is a snapshot of the state of your data at a point in time (PIT). Yes the value is repeated but what does it matter wrt columnar storage?