<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=utf-8">
<META NAME="Generator" CONTENT="MS Exchange Server version 6.5.7654.12">
<TITLE>Re: [Outages-discussion] [outages] Fwd: RE: FYI Netflix is down</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->
<P><FONT SIZE=2>Does anyone know what actually caused the failure? I know there was a "power outage" due to weather, but presumably the physical site would have also had generator backup so that must have also failed?<BR>
<BR>
[Sent using Blackberry Messaging]<BR>
<BR>
----- Original Message -----<BR>
From: outages-discussion-bounces@outages.org <outages-discussion-bounces@outages.org><BR>
To: Paul Ferguson <fergdawgster@gmail.com><BR>
Cc: outages-discussion@outages.org <outages-discussion@outages.org><BR>
Sent: Sat Jun 30 01:46:05 2012<BR>
Subject: Re: [Outages-discussion] [outages] Fwd: RE: FYI Netflix is down<BR>
<BR>
That is a good point. Why didn't they survive in an adjacent AZ in<BR>
Ashburn? Reports say only a single AZ lost power.<BR>
<BR>
On Fri, Jun 29, 2012 at 10:15 PM, Paul Ferguson <fergdawgster@gmail.com> wrote:<BR>
> I'm looking forward to additional details on this outage -- Netflix<BR>
> has previously avoided EC2 outages due to their creative use of Chaos<BR>
> Monkey:<BR>
><BR>
> <A HREF="http://techblog.netflix.com/2011/07/netflix-simian-army.html">http://techblog.netflix.com/2011/07/netflix-simian-army.html</A><BR>
><BR>
> FYI,<BR>
><BR>
> - ferg<BR>
><BR>
> On Fri, Jun 29, 2012 at 10:07 PM, Kevin Blackham <blackham@gmail.com> wrote:<BR>
><BR>
>> On Fri, Jun 29, 2012 at 9:40 PM, Paul Ferguson <fergdawgster@gmail.com> wrote:<BR>
>>> <A HREF="http://venturebeat.com/2012/06/29/amazon-outage-netflix-instagram-pinterest/">http://venturebeat.com/2012/06/29/amazon-outage-netflix-instagram-pinterest/</A><BR>
>><BR>
>> "The outage underscores the vulnerabilities of depending on the public<BR>
>> cloud versus using your own data centers."<BR>
>><BR>
>> Let me fix that: "The outage underscores the lack of attention to<BR>
>> failure tolerance by many affected services." Building my own sites<BR>
>> isn't going to keep them from failing. Every datacenter will fail<BR>
>> eventually.<BR>
>><BR>
>> Granted, when an Amazon AZ fails, history has shown the thrash induced<BR>
>> by recovery mechanisms can also take out adjacent AZs (e.g. network<BR>
>> overloads vs. EBS), but that doesn't stop an architect from<BR>
>> considering regional redundancy.<BR>
>> _______________________________________________<BR>
>> Outages-discussion mailing list<BR>
>> Outages-discussion@outages.org<BR>
>> <A HREF="https://puck.nether.net/mailman/listinfo/outages-discussion">https://puck.nether.net/mailman/listinfo/outages-discussion</A><BR>
><BR>
><BR>
><BR>
> --<BR>
> "Fergie", a.k.a. Paul Ferguson<BR>
> fergdawgster(at)gmail.com<BR>
_______________________________________________<BR>
Outages-discussion mailing list<BR>
Outages-discussion@outages.org<BR>
<A HREF="https://puck.nether.net/mailman/listinfo/outages-discussion">https://puck.nether.net/mailman/listinfo/outages-discussion</A><BR>
</FONT>
</P>
</BODY>
</HTML>