<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0"><channel><atom:link rel="hub" href="http://tumblr.superfeedr.com/" xmlns:atom="http://www.w3.org/2005/Atom"/><description></description><title>Stackable Status</title><generator>Tumblr (3.0; @stackablestatus)</generator><link>http://status.stackable.com/</link><item><title>Hardware Node Outage</title><description>&lt;p&gt;&lt;span&gt;For as of yet unknown reasons, at approximately 3:02 AM MST one of the hardware nodes servicing our cluster started behaving erratically and negativity impacting the performance of customer containers on that hardware node. Systems administrators were immediately notified and the decision to reboot the affected hardware node was made at 4:43 AM. The hardware node full completed a reboot at 5:13 AM MST, at which point customer containers recovered. In total, 44 customers were affected.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;Of course, we are very concerned about this unexpected outage. We plan to begin working with customers with containers on the affected hardware node immediately to begin to move them to other nodes. We are deeply sorry for this outage.&lt;/p&gt;</description><link>http://status.stackable.com/post/16004437192</link><guid>http://status.stackable.com/post/16004437192</guid><pubDate>Tue, 17 Jan 2012 05:41:53 -0700</pubDate></item><item><title>Hardware Node Outage</title><description>&lt;p&gt;At approximately 2:46 PM MST one of the hardware nodes servicing our cluster unexpectedly experienced a kernel panic following a configuration change. Systems administrators on site were immediately notified and the node was brought back online at 2:59 PM MST. In total, 27 customers were affected.&lt;/p&gt;
&lt;p&gt;Customer containers on that hardware node proceeded to recover over the course of the next few minutes. Some customer&amp;#8217;s MySQL containers experienced some minor issues in restarting their MySQL daemons which required manual intervention; however, our experienced team was quickly able to resolve those remaining issues.&lt;/p&gt;
&lt;p&gt;We certainly strive to avoid issues like this from happening, but are lucky to have experienced talent and emergency response on hand to restore service as quickly as possible.&lt;/p&gt;</description><link>http://status.stackable.com/post/15641200033</link><guid>http://status.stackable.com/post/15641200033</guid><pubDate>Tue, 10 Jan 2012 16:45:20 -0700</pubDate></item><item><title>Upstream core router outage</title><description>&lt;p&gt;One of our upstream provider&amp;#8217;s core routers is currently experiencing significant routing issues, preventing us from contacting our DNS servers and degrading access to, from, and within our cluster. They are working on the problem now and we will update this status as more information becomes available. Thank you for your patience.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Update 1:55 PM MST:&lt;/strong&gt; Network service has been restored. We deeply apologize for this outage and are working with our upstream network provider to find solutions to prevent this occurring in the future.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Update 4:03 PM MST: &lt;/strong&gt;One of our upstream provider&amp;#8217;s redundant core routers had serious routing problems from approximately 1:37 to 1:53 PM MST on Wednesday, November 16th.&lt;/p&gt;
&lt;p&gt;While they are still investigating the cause, BGP routing sessions on one of their core Cisco routers (a Cisco 6509-E) started to flap badly at approximately 1:37 PM. As a result, CPU utilization spiked, and the router became unresponsive to management commands and began dropping and re-establishing additional BGP sessions. The flapping of these sessions going up and down and the resulting heavy load of processing routing information meant the router could not recover enough to fully establish all the sessions and keep them up.  &lt;br/&gt;&lt;br/&gt;To help reduce the load caused by this cycle, a network engineer decided to disconnect an Internet transit connection that went directly to this router. This kept the router from constantly processing and dropping the full feed of Internet routes from this provider and the router was able to stabilize very quickly afterward.&lt;br/&gt;&lt;br/&gt;Although our upstream provider&amp;#8217;s network is fully redundant, both at the border and core of their network, a partial outage like this is possible when a router is still partially operable because full failover to the other router might not get triggered. Among other services affected was one of their redundant recursive name servers, causing some DNS queries to time out.&lt;/p&gt;
&lt;p&gt;Again, we are very sorry for any problems this may have caused and we are working to find solutions that will mitigate any future outages.&lt;/p&gt;</description><link>http://status.stackable.com/post/12892413789</link><guid>http://status.stackable.com/post/12892413789</guid><pubDate>Wed, 16 Nov 2011 13:55:00 -0700</pubDate></item><item><title>DNS resolution failures</title><description>&lt;p&gt;During the early-morning hours of November 11th, 2011, DNS resolution for the stackablehost.com domain ceased to work correctly.&lt;/p&gt;
&lt;p&gt;The most frequent issues where arose as a result is that customers depending on resolution of the stackablehost.com domain for inter-container requests such as MySQL or Postgres saw failures. &lt;/p&gt;
&lt;p&gt;Due to a series of mistakes, the domain was not renewed by Stackable as it normally would be and briefly lapsed. &lt;/p&gt;
&lt;p&gt;The issue was corrected at 7:40 AM MST.&lt;/p&gt;
&lt;p&gt;Unquestionably, we&amp;#8217;re deeply concerned about an outage like this which could have been prevented. We&amp;#8217;ve renewed all domains for an additional ten years and we&amp;#8217;re working on putting additional monitoring in place to notify us of potential failures of this type in advance. We deeply regret this outage and we&amp;#8217;re very sorry.&lt;/p&gt;</description><link>http://status.stackable.com/post/12642256469</link><guid>http://status.stackable.com/post/12642256469</guid><pubDate>Fri, 11 Nov 2011 08:17:09 -0700</pubDate></item><item><title>Exim regression causing mail delivery to fail on some PHP containers</title><description>&lt;p&gt;A normal security update to the Exim mail delivery package on PHP containers has been rolled out, however it has been discovered that in some cases mail delivery will fail when mail is sent via a PHP script which utilizes the system binaries to send mail.&lt;/p&gt;
&lt;p&gt;A temporary fix for this problem is being rolled out but may not be available to all customer containers immediately. If you continue experience problems with sending email from your container, please contact &lt;a href="mailto:help@stackable.com"&gt;help@stackable.com&lt;/a&gt; for further assistance. We expect that this problem will be fully resolved by tomorrow, Thursday October 26th 2011. We apologize for the inconvenience. &lt;/p&gt;</description><link>http://status.stackable.com/post/11966434333</link><guid>http://status.stackable.com/post/11966434333</guid><pubDate>Wed, 26 Oct 2011 16:54:41 -0600</pubDate></item><item><title>DNS server degradation</title><description>&lt;p&gt;At approximately 12:15 PM MST on Sunday, October 23, 2011 the recursive name servers at our upstream provider experienced degraded performance as of a result of a Denial of Service attack lasting until about 6:33 PM MST. Systems engineers at Stackable were notified quickly after the problem began and, in concert with our upstream provider, were able to identify and resolve the issue.&lt;br/&gt;&lt;br/&gt;Our provider has now taken proactive steps to assure that these kinds of outages do not occur in the future by implementing systems to prevent a future event of this nature.&lt;br/&gt;&lt;br/&gt;We are, of course, continuing to investigate additional steps we can take to avoid any potential service interruption. To assure that this does not happen again, and as an added precaution, we will be setting up our own internal recursive name servers independent of those currently used by our upstream provider to provide increased reliability and redundancy. It has been, and remains to be, our top priority to provide you with the best service imaginable and we deeply apologize for any inconvenience.&lt;/p&gt;</description><link>http://status.stackable.com/post/11883364500</link><guid>http://status.stackable.com/post/11883364500</guid><pubDate>Mon, 24 Oct 2011 17:25:28 -0600</pubDate></item><item><title>Scheduled service outage Wednesday morning</title><description>&lt;p&gt;Stackable will be performing maintenance in its data center. If you are  one of the customers expected to be affected, you have already been  notified by email.&lt;/p&gt;
&lt;p&gt;We plan to begin this work at Wednesday, Oct 19, 7:00 AM MDT and  expect that all work will be concluded by 7:30 AM (30 minutes). We  expect that these container(s) will be down for only a few minutes during  this time frame.&lt;/p&gt;
&lt;p&gt;Stackable does offer high-availability options to ensure that even  scheduled downtimes like these won&amp;#8217;t affect your site. Contact us by  emailing &lt;a href="mailto:sales@stackable.com"&gt;sales@stackable.com&lt;/a&gt; or calling (877) 977-2253 to learn more.&lt;/p&gt;
&lt;p&gt;If you have any questions at all, please don&amp;#8217;t hesitate to let us  know. We&amp;#8217;re available during regular business hours (9 AM - 5 PM MST,  Monday - Friday) via the Live Chat link on our website  &lt;a href="http://www.stackable.com"&gt;&lt;a href="http://www.stackable.com"&gt;http://www.stackable.com&lt;/a&gt;&lt;/a&gt; or by emailing &lt;a href="mailto:help@stackable.com"&gt;help@stackable.com&lt;/a&gt;.&lt;/p&gt;</description><link>http://status.stackable.com/post/11613329599</link><guid>http://status.stackable.com/post/11613329599</guid><pubDate>Tue, 18 Oct 2011 09:07:22 -0600</pubDate></item><item><title>Control panel issues</title><description>&lt;p&gt;The Stackable Control panel is currently inaccessible. Engineers are working to correct the problem.&lt;/p&gt;
&lt;p&gt;If you have urgent requests, please use our Live Chat feature to connect with a technical support agent who can assist you or email help@stackable.com.&lt;/p&gt;
&lt;p&gt;All containers and sites are operating normally. &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Update:&lt;/strong&gt; As of 12:45 AM MST, the Stackable Control Panel is now functioning normally. We are sorry for any inconvenience this may have caused. Thank you for your patience.&lt;/p&gt;</description><link>http://status.stackable.com/post/9869891393</link><guid>http://status.stackable.com/post/9869891393</guid><pubDate>Tue, 06 Sep 2011 00:09:00 -0600</pubDate></item><item><title>Managed switch replacement</title><description>&lt;p&gt;On Friday, August 26, at 5:00 PM MST, we will be replacing a faulty managed switch with a hot spare we have available for such situations. The faulty switch has not resulted in any problems or outages to date, but we have decided to replace it as a precaution. There will be no downtime as a result of this replacement. As always, our goal is to provide our customers with high-availability, reliable service.&lt;/p&gt;
&lt;p&gt;If you have any questions or concerns, please don’t hesitate to let us know. We’re available during regular business hours (9 AM - 5 PM MST, Monday - Friday) via the Live Chat link on our website &lt;a href="http://www.stackable.com/"&gt;&lt;a href="http://www.stackable.com"&gt;http://www.stackable.com&lt;/a&gt;&lt;/a&gt;, by emailing &lt;a href="mailto:help@stackable.com"&gt;help@stackable.com&lt;/a&gt;, or by calling (801) 983-7394.&lt;/p&gt;</description><link>http://status.stackable.com/post/9382793151</link><guid>http://status.stackable.com/post/9382793151</guid><pubDate>Thu, 25 Aug 2011 13:06:02 -0600</pubDate></item><item><title>Scheduled service outage Friday morning</title><description>&lt;p&gt;Stackable will be performing maintenance in its data center. If you are one of the customers expected to be affected, you have already been notified by email.&lt;br/&gt;&lt;br/&gt;We are beginning work at 12:01 AM MDT on Friday, July 8th, 2011, and don&amp;#8217;t anticipate it taking longer than 10 minutes. During this time period, we will be shutting down the load balancer which directs traffic for some websites to their container(s). Visitors will not be able to reach these website during this time. We are making every effort to minimize the impact this will have and, in the future, will work to find a way to reduce or eliminate the need for service windows like this. Normally we&amp;#8217;d wait until the weekend to perform such service but this situation is somewhat more pressing.&lt;br/&gt;&lt;br/&gt;If you have any questions at all, please don&amp;#8217;t hesitate to let us know. We&amp;#8217;re available during regular business hours (9 AM - 5 PM MST, Monday - Friday) via the Live Chat link on our website &lt;a target="_blank" href="http://www.stackable.com"&gt;&lt;a href="http://www.stackable.com"&gt;http://www.stackable.com&lt;/a&gt;&lt;/a&gt; or by emailing &lt;a href="mailto:help@stackable.com"&gt;help@stackable.com&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Update:&lt;/strong&gt; Beginning at 12:01 AM MST on Friday July 8th 2011, engineers replaced a failing load-balancer with a replacement server. The outage was expected to take only ten minutes to bring down the old server and bring the new one online. A notice was sent to affected customers to inform them of the anticipated brief outage.&lt;/p&gt;
&lt;p&gt;The switch to the new server went as anticipated until it was discovered that networking was not working as designed.&lt;/p&gt;
&lt;p&gt;Nearly all IP addresses bound to customer sites were not accessible from beyond Stackable&amp;#8217;s network. Engineers immediately began to investigate and quickly discovered that the ARP (Address Resolution Protocol) caches were not properly expiring on our upstream Cisco network devices.&lt;/p&gt;
&lt;p&gt;To make matters worse, the Cisco routers refused to clear the cache despite repeated attempts using a variety of methods. Ultimately, the only available fix was to manually create and enter ARP entries for several hundred addresses bound to the load-balancer in order to restore upstream connectivity. It took some time for this fix to be written and implemented.&lt;/p&gt;
&lt;p&gt;This is obviously a very serious flaw in Cisco IOS and our engineers are working with Cisco to determine the cause and implement a permanent resolution.&lt;/p&gt;
&lt;p&gt;Though some sites came online earlier, it took until 2:04 AM before engineers determined that operations were fully restored.&lt;/p&gt;
&lt;p&gt;We certainly don&amp;#8217;t consider this sort of extended outage acceptable and to our customers we offer a sincere apology. We&amp;#8217;re sorry.&lt;/p&gt;
&lt;p&gt;This morning, we&amp;#8217;re working on a number of options to increase the availability of our front-end load-balancers far beyond what we have in place right now. As we work toward increasing reliability, we&amp;#8217;ll continue to keep our customers informed. &lt;/p&gt;</description><link>http://status.stackable.com/post/7350523062</link><guid>http://status.stackable.com/post/7350523062</guid><pubDate>Thu, 07 Jul 2011 12:55:00 -0600</pubDate></item><item><title>Scheduled service outage Sunday morning</title><description>&lt;p&gt;On Sunday morning, Stackable will be performing maintenance in its data center which will affect 13 customers. If you are one of the customers expected to be affected, you have already been notified by email.&lt;/p&gt;
&lt;p&gt;We plan to begin this work at Sunday, Jul 3, 9:00 AM MST and expect that all work will be concluded by 9:30 AM (30 minutes).&lt;/p&gt;
&lt;p&gt;Stackable does offer high-availability options to ensure that even  scheduled downtimes like these won&amp;#8217;t affect your site. Contact us by  emailing sales@stackable.com or calling (877) 977-2253 to learn more.&lt;/p&gt;
&lt;p&gt;If you have any questions at all, please don&amp;#8217;t hesitate to let us know.  We&amp;#8217;re available during regular business hours (9 AM - 5 PM MST, Monday -  Friday) via the Live Chat link on our website &lt;a href="http://www.stackable.com"&gt;http://www.stackable.com&lt;/a&gt;  or by emailing help@stackable.com.&lt;/p&gt;</description><link>http://status.stackable.com/post/7086289808</link><guid>http://status.stackable.com/post/7086289808</guid><pubDate>Thu, 30 Jun 2011 11:16:56 -0600</pubDate></item><item><title>Phones and Live Chat offline</title><description>&lt;p&gt;As of 9:00 AM MST, we are unable to take support requests via phone or live chat. We are working to correct the issue. &lt;/p&gt;
&lt;p&gt;In the interim, please direct all support requests to help@stackable.com and they will be handled promptly. &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;UPDATE:&lt;/strong&gt; This issue was resolved at 9:41 AM MST.&lt;/p&gt;</description><link>http://status.stackable.com/post/6791614967</link><guid>http://status.stackable.com/post/6791614967</guid><pubDate>Wed, 22 Jun 2011 09:24:00 -0600</pubDate></item><item><title>Increased file system lag</title><description>&lt;p&gt;Between 6:15 AM and 1:30 PM MST our back end storage file system experienced an usually high amount of lag causing some customers to experience slow page load times. Our systems administrators quickly identified the problem and worked as quickly as possible to resolve the issue. We are currently evaluating a variety of options to prevent this issue for occurring in the future.&lt;br/&gt;&lt;br/&gt;We are deeply sorry for any inconvenience this may have caused and thank you for your patience.&lt;/p&gt;</description><link>http://status.stackable.com/post/6733298561</link><guid>http://status.stackable.com/post/6733298561</guid><pubDate>Mon, 20 Jun 2011 14:55:54 -0600</pubDate></item><item><title>Brief Network Outage</title><description>&lt;p&gt;Today between 1:35 PM and 1:45 PM MST, the core routers which handle all XMission and Stackable traffic to and from the Internet experienced an overload which caused sporadic traffic outages during this time period.&lt;br/&gt; &lt;br/&gt;XMission has redundant routers with connectivity to several Internet backbone providers. Currently only one of these is IPv6 capable. During an attempt to bring the second router online with IPv6, it caused high CPU use and overload which caused processing all traffic to slow. As soon as the overload was noticed, the changes were undone.&lt;br/&gt;&lt;br/&gt;This was a minor network change that was not expected to cause any problems. XMission will hold off conversion of this second router until they find a resolution as to why the routers overloaded. This change will be executed during off hours just to be certain. Both XMission and Stackable apologize for any inconvenience caused due to this outage.&lt;/p&gt;</description><link>http://status.stackable.com/post/6295890226</link><guid>http://status.stackable.com/post/6295890226</guid><pubDate>Tue, 07 Jun 2011 15:26:33 -0600</pubDate></item><item><title>VPS Outages</title><description>&lt;p&gt;A small number of VPS containers in our SLC1 datacenter were offline on April 30 from 8:30 PM MST - 9:50 PM MST and again on May 1 from 10:44 AM MST - 11:24 AM MST.&lt;/p&gt;&#13;
&lt;p&gt;These issues have been identified and stem from problems we&amp;#8217;ve had with our back end ZFS file system. A decision has been made to replace the Nexenta platform currently in use with a NetApp FAS2020 to serve as the storage facility for VPS containers. We&amp;#8217;ve had perfect reliability with NetApps in the past and expect that trend to continue.&lt;/p&gt;&#13;
&lt;p&gt;Despite the VPS products being in beta stage, we again apologize for any inconvenience this may have caused for the small number of affected customers. We will report further updates following the implementation of the NetApp solution. Thank you again for your patience.&lt;/p&gt;</description><link>http://status.stackable.com/post/5163310478</link><guid>http://status.stackable.com/post/5163310478</guid><pubDate>Tue, 03 May 2011 10:41:00 -0600</pubDate></item><item><title>VPS Outage</title><description>&lt;p&gt;11 VPS (beta) containers in our SLC1 datacenter were offline from 9:04PM MST to 10:45PM MST, affecting a total of 8 customers.&lt;/p&gt;
&lt;p&gt;This outage was caused by the same issue which affected the same VPS containers on April 26, 2011.&lt;/p&gt;
&lt;p&gt;We are deeply sorry for any inconvenience this may have caused. Our engineers continue to investigate the root cause of this problem and are working to develop a solution&lt;/p&gt;</description><link>http://status.stackable.com/post/4960535489</link><guid>http://status.stackable.com/post/4960535489</guid><pubDate>Tue, 26 Apr 2011 11:57:31 -0600</pubDate></item><item><title>VPS outage</title><description>&lt;p&gt;A small number of VPS containers in our SLC1 datacenter were offline from 11:03 AM MST to 11:25 MST. &lt;/p&gt;
&lt;p&gt;Without warning, a storage server which provides disk access for VPS containers suddenly flipped one of its filesystems into read-only mode. Not only did this prevent containers from being able to write files, it also prevented containers from being started, stopped or restarted. &lt;/p&gt;
&lt;p&gt;The hardware which hosts those containers was quickly brought down in order to attempt to re-mount the filesystem. After the filesystem was remounted RW, containers were restarted and all services were restored by 11:25 AM MST.&lt;/p&gt;
&lt;p&gt;We apologize sincerely for this unexpected outage. Engineers continue to investigate the root cause and will move containers storage away from the affected storage server if a systemic issue is discovered.&lt;/p&gt;</description><link>http://status.stackable.com/post/4780544439</link><guid>http://status.stackable.com/post/4780544439</guid><pubDate>Wed, 20 Apr 2011 11:36:08 -0600</pubDate></item><item><title>Control Panel billing issues</title><description>&lt;p&gt;For a short period of time over the weekend, customers were not able to access their billing page through the Stackable Control panel. This issue was first brought to our attention on Sunday, April 17, at 1:02 PM MST and was resolved by 2:38 PM MST.&lt;/p&gt;
&lt;p&gt;We believe this issue was caused by an upgrade to our third party billing system.&lt;/p&gt;
&lt;p&gt;We apologize for any inconvenience this may have cause.&lt;/p&gt;</description><link>http://status.stackable.com/post/4719959090</link><guid>http://status.stackable.com/post/4719959090</guid><pubDate>Mon, 18 Apr 2011 09:32:44 -0600</pubDate></item><item><title>Control Panel issues</title><description>&lt;p&gt;The Stackable Control Panel is not functioning correctly as of 9:00 AM MST, Monday April 11th.&lt;/p&gt;
&lt;p&gt;Users may experience high wait times and may be unable to upgrade/downgrade or purchase new containers during this period.&lt;/p&gt;
&lt;p&gt;We believe this issue to have been caused by an upgrade to our 3rd party billing software which was undertaken by our vendor a day earlier than was scheduled. We are working with the vendor to resolve the issue.&lt;/p&gt;</description><link>http://status.stackable.com/post/4527569437</link><guid>http://status.stackable.com/post/4527569437</guid><pubDate>Mon, 11 Apr 2011 09:43:16 -0600</pubDate></item><item><title>Disc failure in slc1</title><description>&lt;p&gt;At approximately 5:20 AM MST  disc in a server in our SLC1 datacenter crashed, bringing a critical machine offline. A number of customer containers were affected and all provisioning capabilities to existing containers were brought offline.&lt;/p&gt;
&lt;p&gt;Systems engineers were immediately alerted to the problem and were on-site shortly after. The machine was brought fully back online by 7:11 AM MST and as of 7:24 AM MST, most customer containers are back online and provisioning services for all customer containers are being restored. &lt;/p&gt;
&lt;p&gt;Obviously, it&amp;#8217;s not normal for one of our servers to be brought down by a single disc failure. The server in question was configured with a RAID10 array and should have stayed up despite its condition. We are working with the vendor to determine why the disc array did not act as it was designed to.&lt;/p&gt;
&lt;p&gt;As always, we apologize for any interruption in service. We take these events very seriously and will post additional updates as they become available. &lt;/p&gt;</description><link>http://status.stackable.com/post/3784487492</link><guid>http://status.stackable.com/post/3784487492</guid><pubDate>Fri, 11 Mar 2011 07:26:54 -0700</pubDate></item></channel></rss>

