I would have been pulling my non-existant hair out today if I was the sysadmin over at YouTube. After a 6 hour unplanned outage due to “database issues”.
As a sysadmin for a major media company that has experienced outages before, I begin sweating profusely the moment I get my first alert of an outage. Forty-five minutes is too long for me, despite how small of a downtime percentage that represents. I can’t imagine running the servers at YouTube and the stress that places on an individual.
Interesting question though. If you are responsible for the uptime of a high-demand service, do you admit failure and hope that fate smiles fondly, or do you just keep plugging away when service is restored? If it’s a self-inflicted wound, do you admit that or if it is a third party, do you blame them?