Linking pages
- How one line of code caused a $60 million loss https://engineercodex.substack.com/p/how-one-line-of-code-caused-a-60 239 comments
- How one line of code caused a $60 million loss https://read.engineerscodex.com/p/how-one-line-of-code-caused-a-60 110 comments
- GitHub - jnv/lists: The definitive list of lists (of lists) curated on GitHub and elsewhere https://github.com/jnv/lists 24 comments
Linked pages
- Google - Site Reliability Engineering https://landing.google.com/sre/book/index.html 280 comments
- GitHub - hjacobs/kubernetes-failure-stories: Compilation of public failure/horror stories related to Kubernetes https://github.com/hjacobs/kubernetes-failure-stories 222 comments
- Why you should pick strong consistency, whenever possible | Google Cloud Blog https://cloudplatform.googleblog.com/2018/01/why-you-should-pick-strong-consistency-whenever-possible.html 188 comments
- GitHub - open-guides/og-aws: 📙 Amazon Web Services — a practical guide https://github.com/open-guides/og-aws 122 comments
- Things I Learned Managing Site Reliability for Some of the World’s Busiest Gambling Sites – zwischenzugs https://zwischenzugs.wordpress.com/2017/04/04/things-i-learned-managing-site-reliability-for-some-of-the-worlds-busiest-gambling-sites/ 114 comments
- Who's On Call? — Susan Fowler http://www.susanjfowler.com/blog/2016/9/6/whos-on-call 112 comments
- The Ops Identity Crisis — Susan Fowler http://www.susanjfowler.com/blog/2016/10/13/the-ops-identity-crisis 107 comments
- Google SRE book https://danluu.com/google-sre-book/ 99 comments
- Etsy Engineering | Blameless PostMortems and a Just Culture https://codeascraft.com/2012/05/22/blameless-postmortems/ 89 comments
- Google - Site Reliability Engineering https://landing.google.com/sre/interview/ben-treynor.html 86 comments
- Google Cloud Platform Blog: SRE fundamentals: SLIs, SLAs and SLOs https://cloudplatform.googleblog.com/2018/07/sre-fundamentals-slis-slas-and-slos.html 83 comments
- DevOps Topologies http://web.devopstopologies.com/ 78 comments
- The Infrastructure Behind Twitter: Scale https://blog.twitter.com/2017/the-infrastructure-behind-twitter-scale 78 comments
- Why Percentiles Don’t Work the Way You Think - Orange Matter https://www.vividcortex.com/blog/why-percentiles-dont-work-the-way-you-think 77 comments
- PRINCIPLES OF CHAOS ENGINEERING - Principles of chaos engineering http://principlesofchaos.org/ 75 comments
- GitHub - sindresorhus/awesome: 😎 Awesome lists about all kinds of interesting topics https://github.com/sindresorhus/awesome 68 comments
- GitHub - danluu/post-mortems: A collection of postmortems. Sorry for the delay in merging PRs! https://github.com/danluu/post-mortems 60 comments
- SRE at Google: How release canaries can save your bacon | Google Cloud Blog https://cloudplatform.googleblog.com/2017/03/how-release-canaries-can-save-your-bacon-cre-life-lessons.html 60 comments
- PagerDuty Incident Response Documentation https://response.pagerduty.com/ 60 comments
- How incident management is done at Google | Google Cloud Blog https://cloudplatform.googleblog.com/2017/02/Incident-management-at-Google-adventures-in-SRE-land.html 43 comments
Related searches:
Search whole site: site:sre.xyz
Search title: Site Reliability Engineering Resources
See how to search.