Re: [Hampshire] Repeated server crash overnight |
This message is part of the following thread: | |
---|---|
the complete thread tree sorted by date | |
James Dutton via Hampshire at | |
rmluglist2--- via Hampshire at |
Author: Brad Macpherson via Hampshire Date: To: hampshire CC: Brad Macpherson Subject: Re: [Hampshire] Repeated server crash overnight |
> On Mon, 13 Mar 2023 at 08:03, rmluglist2--- via Hampshire > <hampshire@??? <mailto:hampshire@mailman.lug.org.uk>> wrote: > > Hi all____ > > __ __ > > I have an Ubuntu box which is on 24/7/365. It has ufw running > allowing nothing from outside my lan.____ > > __ __ > > A couple of times recently, I’ve come in to find the machine locked > up with a lot of disk access (it can be ping’d but I can’t ssh into > it and it doesn’t respond to mouse or keyboard on the console – only > power cycling brings it back). As I say, this has now happened > twice in the last 3-4 nights.____ > > __ __ > > > I have seen this behaviour sometimes. > By default Linux can block all interactive conversations when using high > disk access > High disk access can be caused by a number of things: > 1) some app actually needs the disk > 2) Faults on the disk, causing many retries. > 3) Swap file access > > After a reboot, you can look for faults on the disk with "smartctl -a > /dev/sda" and see if there are any log messages there about failed > sectors, or sector reallocation counts increasing etc. > > If an app needs the disk, it is probably something kicked off by cron. > You can force these apps to use a lower priority for io with "ionice" > Google ionice for suitable ways to run it. > But, I think a good diagnosis is probably to disable cron altogether for > say a week, and see if the problem disappears. > Then at least you will then know that cron and the apps it runs are the > problem. >
This message was posted to the following mailing lists: | ||||
---|---|---|---|---|
Hampshire LUG Mailing List Info | Nearby Messages | Re: [Hampshire] Repeated server crash overnight | Re: [Hampshire] Repeated server crash overnight |
Hampshire LUG Mailing List Archive administrated by Webmaster | Lurker (version 2.3) |