Something went wrong doing an systems upgrade, affecting all Monzo services. To fix it, we're changing how we do upgrades and store data.
A good response process can mean the difference between something being resolved in minutes and an issue developing into something worse!
A team of engineers are available 24/7/365 to respond when things go wrong. For other companies thinking about how to approach incidents, here’s how we do on-call at Monzo.
We use tools to monitor all our systems to make sure everything’s working as it should. Platform team lead Chris shares a technical insight into the way we monitor Monzo.