{"id":108,"date":"2015-05-17T03:16:28","date_gmt":"2015-05-17T10:16:28","guid":{"rendered":"http:\/\/fastlizard4.org\/blog\/?p=108"},"modified":"2015-05-17T03:16:28","modified_gmt":"2015-05-17T10:16:28","slug":"unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015","status":"publish","type":"post","link":"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/","title":{"rendered":"Unexpected downtime post-mortem: ridley.fastlizard4.org, 17 May 2015"},"content":{"rendered":"<p>Earlier today, at around 08:00 Sun 17 May 2015 UTC, ridley.fastlizard4.org suffered an unexpected downtime. \u00a0At this time, the problem seems to have been caused by a hardware issue of some kind, or some other problem with the hypervisor host that ran ridley. \u00a0CPU usage increased to 800% (all 8 cores under 100% load), seemingly due to iowait, while disk activity was reduced to near-zero. \u00a0This seems to point to a disk I\/O failure of some kind, which eventually caused the server to bog down so much that the CPUs &#8220;stalled&#8221; and the entire system became totally unresponsive. \u00a0The server also did not respond to a hypervisor shutdown command; eventually, the only way to bring down the system was to issue a &#8220;destroy&#8221; command that effectively &#8220;pulled the plug&#8221; on the system.<\/p>\n<p>In line with the I\/O problems ridley has been experiencing for a while now, I took advantage of this unscheduled downtime to also perform the waiting free Linode upgrade on ridley. \u00a0In addition to moving the system to a new hypervisor host, ridley&#8217;s RAM has now doubled and it has more bandwidth available; however, the number of vCPUs has decreased from 8 to 4.<\/p>\n<p>Ridley is now back up and running, and users may now log in to restart any services or programs they may have had running. \u00a0I have checked the system, and everything appears to now be functioning normally, with all daemons and services up and running. \u00a0The LizardIRC server daemon has also been brought back up, so ridley.lizardirc.org has been relinked to the network and services are also back up and running.<\/p>\n<p>Apologies for the\u00a0inconvenience, and thanks for bearing with me!<\/p>\n<div class=\"sharedaddy sd-sharing-enabled\"><div class=\"robots-nocontent sd-block sd-social sd-social-icon-text sd-sharing\"><h3 class=\"sd-title\">Share this:<\/h3><div class=\"sd-content\"><ul><li class=\"share-facebook\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-facebook-108\" class=\"share-facebook sd-button share-icon\" href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/?share=facebook\" target=\"_blank\" title=\"Click to share on Facebook\" ><span>Facebook<\/span><\/a><\/li><li class=\"share-twitter\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-twitter-108\" class=\"share-twitter sd-button share-icon\" href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/?share=twitter\" target=\"_blank\" title=\"Click to share on Twitter\" ><span>Twitter<\/span><\/a><\/li><li><a href=\"#\" class=\"sharing-anchor sd-button share-more\"><span>More<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><div class=\"sharing-hidden\"><div class=\"inner\" style=\"display: none;\"><ul><li class=\"share-email\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-email sd-button share-icon\" href=\"mailto:?subject=%5BShared%20Post%5D%20Unexpected%20downtime%20post-mortem%3A%20ridley.fastlizard4.org%2C%2017%20May%202015&body=http%3A%2F%2Ffastlizard4.org%2Fblog%2F2015%2F05%2F17%2Funexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015%2F&share=email\" target=\"_blank\" title=\"Click to email a link to a friend\" data-email-share-error-title=\"Do you have email set up?\" data-email-share-error-text=\"If you&#039;re having problems sharing via email, you might not have email set up for your browser. You may need to create a new email yourself.\" data-email-share-nonce=\"ecb5a4a43f\" data-email-share-track-url=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/?share=email\"><span>Email<\/span><\/a><\/li><li class=\"share-print\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-print sd-button share-icon\" href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/\" target=\"_blank\" title=\"Click to print\" ><span>Print<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-reddit\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-reddit sd-button share-icon\" href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/?share=reddit\" target=\"_blank\" title=\"Click to share on Reddit\" ><span>Reddit<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><\/div><\/div><\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Earlier today, at around 08:00 Sun 17 May 2015 UTC, ridley.fastlizard4.org suffered an unexpected downtime. \u00a0At this time, the problem seems to have been caused by a hardware issue of some kind, or some other problem with the hypervisor host that ran ridley. \u00a0CPU usage increased to 800% (all 8 cores under 100% load), seemingly <a href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/\"><b>&#8230;Read the Rest<\/b><\/a><\/p>\n<div class=\"sharedaddy sd-sharing-enabled\"><div class=\"robots-nocontent sd-block sd-social sd-social-icon-text sd-sharing\"><h3 class=\"sd-title\">Share this:<\/h3><div class=\"sd-content\"><ul><li class=\"share-facebook\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-facebook-108\" class=\"share-facebook sd-button share-icon\" href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/?share=facebook\" target=\"_blank\" title=\"Click to share on Facebook\" ><span>Facebook<\/span><\/a><\/li><li class=\"share-twitter\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-twitter-108\" class=\"share-twitter sd-button share-icon\" href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/?share=twitter\" target=\"_blank\" title=\"Click to share on Twitter\" ><span>Twitter<\/span><\/a><\/li><li><a href=\"#\" class=\"sharing-anchor sd-button share-more\"><span>More<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><div class=\"sharing-hidden\"><div class=\"inner\" style=\"display: none;\"><ul><li class=\"share-email\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-email sd-button share-icon\" href=\"mailto:?subject=%5BShared%20Post%5D%20Unexpected%20downtime%20post-mortem%3A%20ridley.fastlizard4.org%2C%2017%20May%202015&body=http%3A%2F%2Ffastlizard4.org%2Fblog%2F2015%2F05%2F17%2Funexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015%2F&share=email\" target=\"_blank\" title=\"Click to email a link to a friend\" data-email-share-error-title=\"Do you have email set up?\" data-email-share-error-text=\"If you&#039;re having problems sharing via email, you might not have email set up for your browser. You may need to create a new email yourself.\" data-email-share-nonce=\"ecb5a4a43f\" data-email-share-track-url=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/?share=email\"><span>Email<\/span><\/a><\/li><li class=\"share-print\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-print sd-button share-icon\" href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/\" target=\"_blank\" title=\"Click to print\" ><span>Print<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-reddit\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-reddit sd-button share-icon\" href=\"http:\/\/fastlizard4.org\/blog\/2015\/05\/17\/unexpected-downtime-post-mortem-ridley-fastlizard4-org-17-may-2015\/?share=reddit\" target=\"_blank\" title=\"Click to share on Reddit\" ><span>Reddit<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><\/div><\/div><\/div><\/div><\/div>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":[]},"categories":[16],"tags":[19,17],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p1rJy3-1K","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/posts\/108"}],"collection":[{"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/comments?post=108"}],"version-history":[{"count":1,"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/posts\/108\/revisions"}],"predecessor-version":[{"id":109,"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/posts\/108\/revisions\/109"}],"wp:attachment":[{"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/media?parent=108"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/categories?post=108"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/fastlizard4.org\/blog\/wp-json\/wp\/v2\/tags?post=108"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}