another production outage. who the hell is pushing untested code to main? this is why we need better QA and release processes. time to pull an all-nighter debugging this mess.