[BlueOnyx:06724] Re: [bluequartz] Re: cced gone wild

Jeffrey Pellin jeffrey at px2co.net
Thu Mar 17 08:41:03 -05 2011


Hi Chris - you're right I did ask!

BTW a solution that means both we and our clients are completely unaware of
any issues or service problems sounds pretty damn good to me and far from
imperfect.

I'm sorry Rashid - but maybe the techniques that Chris used to keep our
server up might help.

Regards

Jeffrey


On Thu, 17 Mar 2011 08:11:38 -0500, Chris Gebhardt - VIRTBIZ Internet
<cobaltfacts at virtbiz.com> wrote:
> Jeffrey Pellin wrote:
>> This might not be the same thing but we had a similar problem with lots
>> of
>> processes bringing down one of our  BO v-servers. 
>> 
>> Our Aventurine box is with Chris over at Virtbiz and it appears that one
>> of
>> his excellent team nailed the problem, as we haven't had an issue for
>> some
>> time. I never thought to ask Chris what it was.
>> 
>> If it's the same thing maybe he'll post.
> 
> Hi Jeffrey,
> Ah yes, this would be our old Albatros, right?  :)  I just had a quick 
> look back at the history on this one and it looks like it wasn't exactly 
> the same issue.  On your box it was Apache instead of cced.init.  What 
> would happen is that Apache would get hammered and then get very out of 
> sorts to the point that it begins to block network traffic for the 
> entire host (not just the VPS).  Restarting Apache on the affected VPS 
> seems to cure the symptom.
> 
> We tightened up some web security and that seemed to bring the number 
> incidents down significantly but not entirely.  We also loaded in a 
> quick script to watch for network to fail and if it does, issue a 
> restart for Apache.  It looks like that restart is happening about once 
> every 1.5 days or so.   An imperfect solution, to be sure, but the 
> system continues to run.
> 
> As a failsafe, we've also got the physical system monitored and if 
> network goes AWOL on it for more than 5 minutes the box gets 
> power-cycled by the PDU.  The 5 minute window leaves time for things 
> like intentional reboots.
> 
> I don't think the above will help Rashid with his issue, but since you 
> asked... I told.  ;)



More information about the Blueonyx mailing list