reboot summary

From: 0000-Admin0000 (root@aspensys.com)
Date: Wed Jun 21 1995 - 10:51:58 CDT


Here's the list of responses regarding reboots per several requests for a
summary. I will say that Sun HAS told me once that reboots are good because
they refresh the system and clear the process table. The variety of comments
is helpful, although differences of opinion seem to exist...

Thanks again to those that helped out...

Lyle Miller [lmiller@aspensys.com]

*********************************************************************************B

>From bminer@lnd.state.az.us Wed Jun 21 11:39:45 1995
Date: Mon, 19 Jun 95 07:43:19 MST
From: Bob Miner OPS <bminer@lnd.state.az.us>
To: root@aspensys.com
Cc: bminer@lnd.state.az.us
Subject: Re: reboots

This is a question that I have had since we installed our first Sun in 1987.
We have found that a WEEKLY reboot of our entire network of Suns (now at 38
machines of various flavors, all running SunOS4.1.3) keeps our network fairly
clear of hiccups. I have noted an increased frequency of "{halts and hangs"
when, for some reason, we fail to reboot on our regular Friday morning
schedule. All experimentation with this frequency has been subjective, and
may be slightly paranoid, but I prefer to err on the side of conservatism
in this case. I hope that this reply is helpful.

******************************************************************************

>From sunman@criterion.com Wed Jun 21 11:39:55 1995
Date: Mon, 19 Jun 95 08:08:06 CDT
From: Aditya Talwar <sunman@criterion.com>
To: root@aspensys.com
Subject: Re: reboots

I have worked at large site's. Generally, reboots should be done
once a week. The weekly reboots help since everything is reset
at the beggining of the week. The time of the reboot should well
communicated so that everyone knows about them and shut's down
important process's via cron jobs. Also, reboots help if you have
installed new products/scripts. Reboots help in catching any problem
early on.

******************************************************************************

>From jhall@sqi.com Wed Jun 21 11:40:03 1995
Date: Mon, 19 Jun 1995 10:49:52 -0700
From: John Hall <jhall@sqi.com>
To: root@aspensys.com
Subject: Re: reboots

We don't have a reboot schedule, the power company does it for us (about
every three months). :-(

Some of our servers (on UPS) have been running for more than a year without
trouble. One danger sign is hung processes and hung sockets (lsof).

When your Sparc Classic hangs, can you use L1-A to get to a boot prompt?

If so, then your problem is more likely to be software related (soft hang).
If you cannot L1-A it, then your problem is most likely hardware or power
related (hard hang) and I would suggest replacing the whole unit. You
should also have it on a "clean" UPS. I have seen several of the "consumer"
UPS products actually produce worse power than the utilities. Another
source of hard hangs is grounding problems. Make sure all the peripherals
connected to your machine are plugged into the same power source as the
Sparc and have good grounds. Many of the "cheap" powerstrips either do
not have a ground connection internally, or they are broken on one or more
plugs.

Good luck.

*******************************************************************************

>From Birger.Wathne@vest.sdata.no Wed Jun 21 11:40:23 1995
Date: Tue, 20 Jun 95 08:56:37 +0200
From: "Birger A. Wathne" <Birger.Wathne@vest.sdata.no>
To: root@aspensys.com
Subject: Re: reboots

An Oracle server in Sweden (Sun SS10) just passed an uptime of 600 days,
if my memory is correct. So rebooting each day shouldn't be nessesary.
Our admin will reboot the user's CPU servers every 3-4 months.

*******************************************************************************

>From mike@trdlnk.com Wed Jun 21 11:40:38 1995
Date: Tue, 20 Jun 95 12:33 CDT
From: Michael Sullivan <mike@trdlnk.com>
To: root@aspensys.com
Subject: Re: reboots

The best thing is to get a crash dump. See the instructions on using
savecore. If the machine is just freezing, rather than panicing on its
own, you may be able to force a panic to dump memory for savecore after
the lock-up by hitting STOP-A, and then issuing the sync command to the
monitor. Perhaps with a crash dump Sun will be better able to find the
problem.

>Regarding PANIC!, the book explaining how to find out the cause of
>crashes and hangs now on the market from SunSoft Press:

Yes, this book sounds interesting -- we are ordering a copy too.

>But, in the meantime, can anyone discuss their reboot schedules (if any)
>with me? How often should a scheduled reboot take place?

Once or twice a year, when you install a new SunOS version is about right.
We have lots of Suns under heavy use that go 6 months without a reboot.

> What are the
>indicators? What _good_ things does a reboot do for a Sun box,
>specifically?

Nothing, unless you are encountering an OS bug that is causing a
resource leak, or corrupting kernel memory structures, etc.
In this case, you should get the bug fixed, not depend on reboots.
Many (most?) OS bugs that cause crashes and lock-ups are not dependent on
the time since the last reboot and could happen just as easily 5 minutes
after a reboot as at any other time.

> Why do reboots help with overall system performance--or do
>they?

They don't (in a properly functioning OS).

        Root Supervisor Account
       Aspen Systems Corporation
        1600 Research Boulevard
       Rockville, Maryland 20850



This archive was generated by hypermail 2.1.2 : Fri Sep 28 2001 - 23:10:27 CDT