Posting this on the off-chance it's useful to someone else...
On my current project, we have a tonne of Spark logic which needs to aggregate and (generally) sum data which is grouped by different keys, in order to then apply business rules at the record level (i.e. if the total of amount x, when...
Continue reading...