References
Michael Stonebraker and UΔur Γetintemel: β'One Size Fits All': An Idea Whose Time Has Come and Gone,β at 21st International Conference on Data Engineering (ICDE), April 2005.
Walter L. Heimerdinger and Charles B. Weinstock: βA Conceptual Framework for System Fault Tolerance,β Technical Report CMU/SEI-92-TR-033, Software Engineering Institute, Carnegie Mellon University, October 1992.
Ding Yuan, Yu Luo, Xin Zhuang, et al.: βSimple Testing Can Prevent Most Critical Failures: An Analysis of Production Failures in Distributed Data-Intensive Systems,β at 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI), October 2014.
Yury Izrailevsky and Ariel Tseitlin: βThe Netflix Simian Army,β netflixtechblog.com, July 19, 2011.
Daniel Ford, FranΓ§ois Labelle, Florentina I. Popovici, et al.: βAvailability in Globally Distributed Storage Systems,β at 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI), October 2010.
Brian Beach: βHard Drive Reliability Update β Sep 2014,β backblaze.com, September 23, 2014.
Laurie Voss: βAWS: The Good, the Bad and the Ugly,β blog.awe.sm, December 18, 2012.
Haryadi S. Gunawi, Mingzhe Hao, Tanakorn Leesatapornwongsa, et al.: βWhat Bugs Live in the Cloud?,β at 5th ACM Symposium on Cloud Computing (SoCC), November 2014. doi:10.1145/2670979.2670986
Nelson Minar: βLeap Second Crashes Half the Internet,β somebits.com, July 3, 2012.
Amazon Web Services: βSummary of the Amazon EC2 and Amazon RDS Service Disruption in the US East Region,β aws.amazon.com, April 29, 2011.
Richard I. Cook: βHow Complex Systems Fail,β Cognitive Technologies Laboratory, April 2000.
Jay Kreps: βGetting Real About Distributed System Reliability,β blog.empathybox.com, March 19, 2012.
David Oppenheimer, Archana Ganapathi, and David A. Patterson: βWhy Do Internet Services Fail, and What Can Be Done About It?,β at 4th USENIX Symposium on Internet Technologies and Systems (USITS), March 2003.
Nathan Marz: βPrinciples of Software Engineering, Part 1,β nathanmarz.com, April 2, 2013.
Michael Jurewitz: βThe Human Impact of Bugs,β jury.me, March 15, 2013.
Raffi Krikorian: βTimelines at Scale,β at QCon San Francisco, November 2012.
Martin Fowler: Patterns of Enterprise Application Architecture. Addison Wesley, 2002. ISBN: 978-0-321-12742-6
Kelly Sommers: βAfter all that run around, what caused 500ms disk latency even when we replaced physical server?β twitter.com, November 13, 2014.
Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, et al.: βDynamo: Amazon's Highly Available Key-Value Store,β at 21st ACM Symposium on Operating Systems Principles (SOSP), October 2007.
Greg Linden: βMake Data Useful,β slides from the presentation at Stanford University Data Mining class (CS345), December 2006.
Tammy Everts: βThe Real Cost of Slow Time vs Downtime,β slideshare.net, November 5, 2014.
Jake Brutlag: βSpeed Matters,β ai.googleblog.com, June 23, 2009.
Tyler Treat: βEverything You Know About Latency Is Wrong,β bravenewgeek.com, December 12, 2015.
Jeffrey Dean and Luiz AndrΓ© Barroso: βThe Tail at Scale,β Communications of the ACM, volume 56, number 2, pages 74β80, February 2013. doi:10.1145/2408776.2408794
Graham Cormode, Vladislav Shkapenyuk, Divesh Srivastava, and Bojian Xu: βForward Decay: A Practical Time Decay Model for Streaming Systems,β at 25th IEEE International Conference on Data Engineering (ICDE), March 2009.
Ted Dunning and Otmar Ertl: βComputing Extremely Accurate Quantiles Using t-Digests,β github.com, March 2014.
Gil Tene: βHdrHistogram,β hdrhistogram.org.
Baron Schwartz: βWhy Percentiles Donβt Work the Way You Think,β solarwinds.com, November 18, 2016.
James Hamilton: βOn Designing and Deploying Internet-Scale Services,β at 21st Large Installation System Administration Conference (LISA), November 2007.
Brian Foote and Joseph Yoder: βBig Ball of Mud,β at 4th Conference on Pattern Languages of Programs (PLoP), September 1997.
Frederick P Brooks: βNo Silver Bullet β Essence and Accident in Software Engineering,β in The Mythical Man-Month, Anniversary Edition, Addison-Wesley, 1995. ISBN: 978-0-201-83595-3
Ben Moseley and Peter Marks: βOut of the Tar Pit,β at BCS Software Practice Advancement (SPA), 2006.
Rich Hickey: βSimple Made Easy,β at Strange Loop, September 2011.
Hongyu Pei Breivold, Ivica Crnkovic, and Peter J. Eriksson: βAnalyzing Software Evolvability,β at 32nd Annual IEEE International Computer Software and Applications Conference (COMPSAC), July 2008. doi:10.1109/COMPSAC.2008.50
Last updated