advertisement Crashes Again On Monday

Written by Evan Schuman
June 10th, 2008

For the second consecutive workday, suffered a major crash on Monday (June 9), with the increasingly unlikely scenarios explaining why the historically robust site is failing.

The cause of the crash, which apparently took the weekend off after bringing down Amazon completely on Friday for almost three hours before seriously (but less severely) slowing down Amazon for several hours on Monday, was unclear.

For itself, the question of what caused the crash was one of those good news/bad news situations. Good news: Amazon knows what caused the crash. Bad news: They won’t say. (This appears to be the closest Amazon plans on getting to issuing a statement explaining the outage.)

"We’re not commenting as to the cause of Friday’s or Monday’s site issues. As a policy, we don’t discuss such matters," said Patty Smith,’s director of corporate communications. "We are aware of the cause. We’re just not disclosing the issue."

Asked about some of the reports of various scenarios for the outage, Smith said they weren’t going to go there. "I’ve seen several comments from Keynote and other third parties speculating as to the cause, but I’m afraid I’m not going to confirm or deny any speculation," Smith said. "Suffice to say, that on those rare occasions when our site experiences problems, we work to resolve the issue as quickly as possible."

I quoted that last line only because it’s such a classic political line, where the candidate throws in a mom-and-apple pie line as though someone has challenged him on it, even though he was the one who brought it up. "I can’t really comment about those undercover FBI tapes of me taking bribes, but I will stress that no one loves my constituents as much as I do. And I vigorously refute anyone who says that the 13th Congressional District is made up of anyone other than kind-hearted, wonderful people."

In short, no one has suggested that Amazon was being slow to fix this issue. The concern is with not saying what caused it. That’s just the kind of confidence-building message Amazon wants to send out to its suppliers and to signal to its customers. Why spoil that wonderful "this could happen again tomorrow" surprise that is at the heart of E-Commerce enjoyment?

Web site performance tracking firm Keynote said that Friday’s crash was most likely due to an overly sophisticated site operation at Amazon. In a sea of unlikely scenarios, that is probably the least unlikely. Amazon has historically deployed the most advanced capabilities of any major E-Commerce site, which could set in motion a domino effect, with a small error anywhere in one app potentially cascading into a major crash.

The only problem with that theory is that Amazon’s systems have had such a level of sophistication for years, maintaining an almost spotless record of high availability. Why would it suddenly crack now? The most likely answer is that they have recently added some new piece of software that brought with it the instability.

Another possible explanation is excessive load—some new game programs were released on Friday, and Monday saw Apple updating its popular iPhone—but that’s the kind of thing that Amazon has handled quite well historically.

Some reports suggested some kind of attack, especially a Denial of Service assault. But Shawn White, Keynote’s director of external operations, said the data he’s been monitoring simply doesn’t fit the DOS pattern.

"When that (DOS) kind of situation happens, the site slowly gets slower and slower. That’s not what happened here," White said. "DOS doesn’t look likely at all."

But that still theoretically leaves open the possibility of some other form of attack, possibly some sort of planted malware. There’s no hard data available thus far, however, to support that theory. The only information that White saw that fueled the malware rumors were reported slowdowns at eBay and the Internet Movie Database, but those could easily have been unrelated.

White is sticking—for now—with his complexity theory. "As a site becomes more and more complex, each moving part becomes more and more dependent upon every other part," he said. "If one piece has a mistake in it, a typo in it," that could potentially trigger a collapse.

Monday’s Amazon crash differed from Friday’s in a few respects. First, it was a major slowdown on Monday, as opposed to the total crash on Friday.

The Monday incident started at 1:03 PM (EDT) when, according to Keynote, Amazon’s homepage plunged to 30 percent availability before recovering completely 20 minutes later. At 1:56 PM, its availability again dropped, this time to 68 percent, before recovering at 2:09 PM. It dropped again to 68 percent from 3:43 PM until 4:01 PM.

Another key difference: Friday’s crash impacted only the main site, with no apparent impact on any of its non-U.S. sites or its cloud computing Amazon Web Services site. Monday’s crash also hit Amazon’s United Kingdom site, Keynote’s White said.

"The problems experienced on this site lasted longer and were, in some cases, more dramatic. Beginning at 1:06 PM (EDT), the same ‘Service Unavailable’ error was being seen by European online shoppers," White said. "Availability dropped to 30 percent and then slowly, over the next couple hours, returned to normal. By 3:02 PM (EDT), the UK site returned to normal."

The UK site also acted a little different as shoppers dug more deeply into the site.

"During each outage period, visitors who were able to make it past the homepage and browse for items would have experienced much slower performance and download times. Both sites regularly download completely in less than seven seconds and in many cases, faster than four seconds," White said. "Download times during the periods mentioned slowed by as much as 200 percent. In some cases, Web browsers would have completely timed out, causing visitors to have to reload their browsers, unsure if the site would return."


Comments are closed.


StorefrontBacktalk delivers the latest retail technology news & analysis. Join more than 60,000 retail IT leaders who subscribe to our free weekly email. Sign up today!

Most Recent Comments

Why Did Gonzales Hackers Like European Cards So Much Better?

I am still unclear about the core point here-- why higher value of European cards. Supply and demand, yes, makes sense. But the fact that the cards were chip and pin (EMV) should make them less valuable because that demonstrably reduces the ability to use them fraudulently. Did the author mean that the chip and pin cards could be used in a country where EMV is not implemented--the US--and this mis-match make it easier to us them since the issuing banks may not have as robust anti-fraud controls as non-EMV banks because they assumed EMV would do the fraud prevention for them Read more...
Two possible reasons that I can think of and have seen in the past - 1) Cards issued by European banks when used online cross border don't usually support AVS checks. So, when a European card is used with a billing address that's in the US, an ecom merchant wouldn't necessarily know that the shipping zip code doesn't match the billing code. 2) Also, in offline chip countries the card determines whether or not a transaction is approved, not the issuer. In my experience, European issuers haven't developed the same checks on authorization requests as US issuers. So, these cards might be more valuable because they are more likely to get approved. Read more...
A smart card slot in terminals doesn't mean there is a reader or that the reader is activated. Then, activated reader or not, the U.S. processors don't have apps certified or ready to load into those terminals to accept and process smart card transactions just yet. Don't get your card(t) before the terminal (horse). Read more...
The marketplace does speak. More fraud capacity translates to higher value for the stolen data. Because nearly 100% of all US transactions are authorized online in real time, we have less fraud regardless of whether the card is Magstripe only or chip and PIn. Hence, $10 prices for US cards vs $25 for the European counterparts. Read more...
@David True. The European cards have both an EMV chip AND a mag stripe. Europeans may generally use the chip for their transactions, but the insecure stripe remains vulnerable to skimming, whether it be from a false front on an ATM or a dishonest waiter with a handheld skimmer. If their stripe is skimmed, the track data can still be cloned and used fraudulently in the United States. If European banks only detect fraud from 9-5 GMT, that might explain why American criminals prefer them over American bank issued cards, who have fraud detection in place 24x7. Read more...

Our apologies. Due to legal and security copyright issues, we can't facilitate the printing of Premium Content. If you absolutely need a hard copy, please contact customer service.