Wikimedia's Servers Declare Independence!
From LizardWiki, FastLizard4's wiki and website
Following are logs from #wikimedia-tech indicating Wikimedia's servers have just failed epically, as recorded around 00:20, 5 July 2010 (UTC) (4th of July in the United States).
Logs
[17:10:31] <nagios-wm> PROBLEM - Disk free on lily is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds.
[17:10:52] PROBLEM - SSH on lily is CRITICAL: CRITICAL - Socket timeout after 10 seconds
[17:11:51] PROBLEM - Disk free on mchenry is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds.
[17:12:21] PROBLEM - Host srv167 is DOWN: PING CRITICAL - Packet loss = 100%
[17:12:22] PROBLEM - Host srv163 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv155 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv154 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv166 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv164 is DOWN: PING CRITICAL - Packet loss = 100%
[17:12:31] PROBLEM - Host srv175 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv176 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv179 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv181 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv178 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv182 is DOWN: PING CRITICAL - Packet loss = 100%
[17:12:39] |<-- darkoneko has left irc.freenode.net:7000 (Quit: it reads 'assume good faith', not 'be stupid')
[17:12:41] <nagios-wm> PROBLEM - Host srv183 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv186 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv184 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv185 is DOWN: PING CRITICAL - Packet loss = 100%
[17:13:11] PROBLEM - Host storage3 is DOWN: CRITICAL - Host Unreachable (208.80.152.169)
PROBLEM - Host sanger is DOWN: CRITICAL - Host Unreachable (208.80.152.187)
PROBLEM - Host mchenry is DOWN: CRITICAL - Host Unreachable (208.80.152.186)
PROBLEM - Host sq76 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host sq77 is DOWN: PING CRITICAL - Packet loss = 100%
[17:13:21] PROBLEM - Host srv152 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv151 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host db5 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host db7 is DOWN: PING CRITICAL - Packet loss = 100%
[17:13:22] PROBLEM - Host db8 is DOWN: PING CRITICAL - Packet loss = 100%
[17:13:31] PROBLEM - Host tridge is DOWN: CRITICAL - Host Unreachable (208.80.152.170)
PROBLEM - Host hume is DOWN: CRITICAL - Host Unreachable (208.80.152.190)
PROBLEM - Host lvs4 is DOWN: CRITICAL - Host Unreachable (208.80.152.123)
PROBLEM - Host db9 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host rr.pmtpa is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host sq75 is DOWN: PING CRITICAL - Packet loss = 100%
[17:13:33] PROBLEM - Host sq72 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host sq73 is DOWN: PING CRITICAL - Packet loss = 100%
[17:13:42] PROBLEM - Host srv168 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv153 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv165 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv156 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv177 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host srv180 is DOWN: PING CRITICAL - Packet loss = 100%
[17:13:51] PROBLEM - Host locke is DOWN: CRITICAL - Host Unreachable (208.80.152.138)
PROBLEM - Host sq74 is DOWN: PING CRITICAL - Packet loss = 100%
PROBLEM - Host sq71 is DOWN: PING CRITICAL - Packet loss = 100%
[17:13:55] <dungodung> hmm, sites seem to be down for me
[17:14:01] <FastLizard4> Hmm.
<nagios-wm> PROBLEM - Host lvs2 is DOWN: PING CRITICAL - Packet loss = 100%
[17:14:05] <FastLizard4> I wonder why that would be... 9_9
[17:14:10] <dungodung> :P
[17:14:11] <nagios-wm> PROBLEM - check_all_memcacheds on spence is CRITICAL: MEMCACHED CRITICAL - Can not connect to 10.0.2.183:11000 (Connection timed out)
PROBLEM - Host upload.pmtpa is DOWN: PING CRITICAL - Packet loss = 100%
-->| Avery_Mason (~lucid@wikipedia/Fetchcomms) has joined #wikimedia-tech
[17:14:17] <FastLizard4> Yay, servers go boom :D
[17:14:34] Big expensive fireworks. :P