I received this email in response to my ticket about what happened last night… Short answer = Cisco issues. Been there myself, so I understand it. My biggest issue however was no notice since this apparently has been scheduled beforehand. Virtbiz says they sent an email, and I have no reason to doubt it. But I never got it. I don’t look to twitter as a support mechanism so never even thought to go there. So we still have a couple things to work out communication-wise.
But also notice that they need to do more work tonight. According to the virtbiz site, it will start at 8pm Central and go through midnight. If I hear anything different, I’ll post.
I understand and apologize for your frustration. Last night we had a
maintenance window to upgrade software in the row distribution switches
and upgrade the Supervisor modules and MSFC’s in the gateway switches.
We and announced and anticipated interruptions in service for about 30
seconds or so but due to a conflict in software versions it went on
longer than that as we were forced to roll-back.We’ve documented what went happened and identified where we had the
issue, which ultimately resulted in broken STP between the gateway and
distribution switches, to be a conflict in software versioning. We’ve
obtained alternate versions from Cisco as of last night and have the new
software loaded and running. However, once we got everything back
online, we were not comfortable sending things down for another reset.
The network (or more accurately, the users) had been through enough
trauma for one night.Another window will be opened overnight tonight to address the failure
and complete the work that we set out to do last night. We are starting
early to construct a lab environment using equipment that we sourced
overnight and today to test with first, then apply to the network.I understand that a good part of your concern stems from the
communication before and during the event. I’d like to bullet-point a
few things for you to help address that.- We opened last night’s window to address an issue that came about in a
failure from last week that we had made temporary work-arounds for.- Last night’s window was announced on our website, Twitter feed and
email. I know the website was unavailable for a while during the event.- With regard to the phone system, we recognize that as an area that
needs improvement. A few minutes into the event we disabled the
answering system and had staff in to live-answer the phones to help
explain to customers what was going on. We felt that to be important
given the fact that the website was unavailable. Unfortunately, we
missed you.You should know that as a company that takes its service very seriously,
we’re not satisfied with how last night’s window played out. We do feel
that there are lessons we can take that will help us continue to be
better. I certainly appreciate your comments for us, as well as your
understanding, as we continue working to be the guys you can always rely on.Thanks,
Chris Gebhardt
Comments Off