Trying to be cogent.
I’m a software engineer with a focus on designing for simplicity, reliability and stability in distributed systems. At SoundCloud I’m the Tech Lead of the Data Platform, working on the architecture of our data infrastructure, and guiding how we work with data infrastructure across the company.
I cycle a lot, and my big event of the year has been the Haute Route Alps, riding in the mountains for a week.
The latest exciting change is the new tiny human in our lives!
My little brother won, the hive mind averaged out to an exceptionally good score, and I got to play with some inconsequential data while Eleanor had a nap. But first, poo.
We did a baby! Giving birth looks really hard, and my wife is a hero. The first week of Ellie's life has been a whirlwind of hospital visits, poo, sleep deprivation, and becoming besotted with a new tiny human.
What better way to pass the time while waiting for a baby to be born than to engage in a spot of descriptive statistics on an unnecessarily complicated survey of predictions on the baby!
Google's 2015 paper on the Dataflow model describes general solutions to general data pipeline processing problems. The terms they use have been helpful to me in understanding patterns in these problems.