Message from discussion
combinat is not responding...
Received: by 10.180.101.9 with SMTP id fc9mr874654wib.3.1349292809716;
Wed, 03 Oct 2012 12:33:29 -0700 (PDT)
X-BeenThere: sage-devel@googlegroups.com
Received: by 10.216.27.134 with SMTP id e6ls955115wea.7.gmail; Wed, 03 Oct
2012 12:33:23 -0700 (PDT)
Received: by 10.180.73.134 with SMTP id l6mr3971092wiv.1.1349292803664;
Wed, 03 Oct 2012 12:33:23 -0700 (PDT)
Received: by 10.180.73.134 with SMTP id l6mr3971091wiv.1.1349292803654;
Wed, 03 Oct 2012 12:33:23 -0700 (PDT)
Return-Path: <Florent.Hiv...@lri.fr>
Received: from monge.univ-mlv.fr (monge.univ-mlv.fr. [193.55.63.80])
by gmr-mx.google.com with ESMTPS id hm1si557827wib.3.2012.10.03.12.33.23
(version=TLSv1/SSLv3 cipher=OTHER);
Wed, 03 Oct 2012 12:33:23 -0700 (PDT)
Received-SPF: neutral (google.com: 193.55.63.80 is neither permitted nor denied by best guess record for domain of Florent.Hiv...@lri.fr) client-ip=193.55.63.80;
Authentication-Results: gmr-mx.google.com; spf=neutral (google.com: 193.55.63.80 is neither permitted nor denied by best guess record for domain of Florent.Hiv...@lri.fr) smtp.mail=Florent.Hiv...@lri.fr
Received: from popcorn.rouba.net (monge.univ-mlv.fr [193.55.63.80])
by monge.univ-mlv.fr (8.14.2/8.14.2) with ESMTP id q93JXNwf029724
for <sage-devel@googlegroups.com>; Wed, 3 Oct 2012 21:33:23 +0200
Received: by popcorn.rouba.net (Postfix, from userid 1029)
id AF91F121D15; Wed, 3 Oct 2012 21:33:15 +0200 (CEST)
Date: Wed, 3 Oct 2012 21:33:15 +0200
From: Florent Hivert <Florent.Hiv...@lri.fr>
To: sage-devel@googlegroups.com
Subject: Re: [sage-devel] combinat is not responding...
Message-ID: <20121003193315.GA8987@popcorn.rouba.net>
References: <20121003101538.GA10823@popcorn.rouba.net>
<5b482a1b-66ee-4d61-a531-db46e56d87df@googlegroups.com>
<CACLE5GCHxsXeLg5hNs6qX=mKiPOrskQiNRunFF+_9qG7WYrXwg@mail.gmail.com>
<CACLE5GC3hDDpqOMf0wkrEQaQVCpnFD8G5rU3Enu_-AGA59kqxA@mail.gmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <CACLE5GC3hDDpqOMf0wkrEQaQVCpnFD8G5rU3Enu_-AGA59k...@mail.gmail.com>
User-Agent: Mutt/1.5.21 (2010-09-15)
Hi,
> combinat.math.washington.edu is now fixed. For some mysterious
> reasons the ufw firewall was active, with evidently *no* rules, and it
> was blocking most everything. I don't know 100% for certain why this
> happened; however, I've just done:
>
> apt-get remove unattended-upgrades ufw
>
> so it is unlikely to happen again.
Thanks !!!
> Note that there was no downtime or interruption of anybody's jobs, and
> this was not caused by over-use. (This is the only problem we have
> ever had so far with combinat, by the way!)
A few weeks ago, due to a huge memory leak in a code running in parallel on 32
core, I had my computation killed due to a failed memory alloc. combinat was
not very responsive for a couple of minutes but I had the impression that
except my computations nothing suffered from it. As a consequence I didn't
mention it. Does someone know if it is possible to know if there were some
other consequences ?
Cheers,
Florent