Zetafall

Diffing datasets

February 26th, 2022

Adam Marcus blogged about his open source implementation of DIFF, a pure-SQL algorithm for explaining the reasons behind the differences of two data sets.

The post uses an example of sensor data, because it wants to replicate the findings of a different non-SQL algorithm that does the same thing. It’s very interesting to try to use this for monitoring data or financial portfolios though. I have to find time to try some of that out.