Summary : Gigabit Interface problems and BSA_RC_ABORT_SYSTEM_ERROR

From: Rahul Parasnis <rahul.parasnis_at_creditlyonnais.fr>
Date: Tue Jun 25 2002 - 03:54:56 EDT
I am posting my summary to both the list , since both list helped me and
problem was related linked to Legato and SUN .

Some people may not like this , I sincerely feel that this summary will be
helpful to both the list.

Thanks to everybody who reponded . I have attached their reponses below.
It turns out that , slot 0 is not good for gigabit card in E250 .
Regarding the setup. I did not change anything on networker configuration .
Network Hardware
Netgear HUBS GS504 , Media convertor ( RJ45 to Fiber Optic )  , 1000mbps
Full duplex mode (fix)  on each machine's gigabit ethernet card.

Following is the response from SUN's Sakata-san ,

The GigaBit Card which is inserted in E250 was Slot0.The GBE card x1141A
isn't reccomended to insert in
Solt0. The reccomended slot is Slot3.That is because Slot 0-2 are  64/32bit
33MHz, in a
single  PCI bus.

Please see below

PCI Slot Numbering
SLOT BUS DEVICE TREE
3 A pci@1f,2000/*@1,*
2 B pci@1f,4000/*@2,*
1 B pci@1f,4000/*@4,*
0 B ppci@1f,4000/*@5,*

PCI Bus B shares the address/data bus with onboard Audio,Ethernet,Flash,
Floppy, Keyboard, Mouse, NVRAM, Parallel,
SCSI, and Serial devices.

X1141A Card Gigabit Ethernet 2.0/3.0 (GBE/P) is a Multi-Mode Fiber 3.3/5V
64Bit 33/66MHz card and only 1 card
is supported within this E250 system.

Here are some responses I received .
==================================================================================
Irwan Hadi <irwanhadi@phxby.engr.usu.edu>
If you are getting lots of packet errors, then this means that you are
having full duplex problem. The problem becomes bigger since you are also
using Netgear products, which *always* have full duplex problems with Sun.
My suggestion is either try to change your Netgear to different brand, or
force your Netgear equipment to talk *only* half duplex, or if you don't
have any other choice, try to set your GBIC (Gigabit Interface Card) on
your Sun system to run just half duplex.Let me know if this solve the
problem or not, by cc-ing to
irwanhadi@phxby.com
Thanks
===================================================================================
Stuart Whitby <swhitby@legato.com>
RPC errors aren't necessarily network hardware.  I can't help you with the
problems that you note on the network
side, but if you really want to check NetWorker, run rpcinfo -a (the rest
of the options are dependent on your
environment) against programs 390103 and 390107 on the server, and log the
results at one minute intervals
throughout the night.  Run this directly on the server itself, which means
that you go through the loopback
interface rather than across the network.  If you see a problem here, then
the problem is with the process not
responding to the rpcinfo request in time.  One or two "program not
responding" errors are expected during busy
periods since a) the rpcinfo timeout is much shorter than the NetWorker
client software timeout, and b) the client
software will retry.

If this is a problem for you, upgrade to 6.1.2 (if it's available - I know
it's on limited release) or 6.1.1
with the RPC patch included in LGTpa33799.  You probably don't need the
whole thing, but if you're running two
jukeboxes, then it may not be a bad idea (depends on the size of your
nsrjb.res file).

Hmm, quick further note - I think I cancelled sending this as it was being
delivered.  If 6.1.2 is out, it's
*very* new.  Personally, I'd give it a month then ask people if they've got
any problems with it before
installing, as I would with any new software.  Besides, I'm troubleshooting
it right now and seeing problems :)
For what it's worth, they're probably site specific - I figure that virtual
devices mean we're hitting a
problem with the server figuring that it's got 450 devices...

Cheers,

Stuart.
=====================================================================================



                                                                                                                                              
                      Rahul Parasnis                                                                                                          
                      <rahul.parasnis@creditl         To:      sunmanagers@sunmanagers.org                                                    
                      yonnais.fr>                     cc:      (bcc: Rahul Parasnis/JPN/International/CLY)                                    
                      Sent by:                        Subject: Gigabit Interface problems                                                     
                      sunmanagers-admin@sunma                                                                                                 
                      nagers.org                                                                                                              
                                                                                                                                              
                                                                                                                                              
                      06/24/2002 04:58 PM                                                                                                     
                                                                                                                                              
                                                                                                                                              




Hello Gurus,

I have E250 Machine( Solaris 8 )  with two Autochangers connected ( One L20
and one L40 ) having Gigabit Ethernet card ( X1141A) .
This Machine is used as Storage Node( term used in  Legato Networker) . Lot
of people on this list must be using Legato Networker .
(I am using Networker 6.1 build 136 for those who are interested in the
version )

Problem :
When I start to take backup( 80 GB windows 2000 Machine with Lotus Notes
Option )  through networker using gigabit interface ,
backup failes after 12 GB /15GB or sometimes 30 GB  giving rpc errors
timeouts.
Initially I thought that this could be the configuration problem of
networker . I posted my problem on networker mailing list , and lot people
responded that the error cause is because of network hardware .  One person
having simmilar setup experienced same problem and he pointed out that this
could be because of Compaq NIC . However he did not have evidence since he
can test with another card .

I started to monitor netstat statistics on gigabit ethernet card , and I
found following . I just ftpd 100mb file from this machine to another
machine .
netstat -i -I ge0 3

and I am getting following
    input   ge0       output           input  (Total)    output
packets errs  packets errs  colls  packets errs  packets errs  colls

26738   66    13634   0     0      26822   66    13714   0     0
36142   178   19294   0     0      36210   178   19359   0     0
33140   124   17255   0     0      33205   124   17316   0     0
46008   183   24783   0     0      46055   183   24815   0     0
28399   71    14482   0     0      28488   71    14563   0     0
32110   56    16488   0     0      32183   56    16554   0     0
42485   131   22328   0     0      42572   131   22403   0     0
42686   190   22627   0     0      42781   190   22710   0     0
39271   132   20418   0     0      39350   132   20483   0     0
33431   83    17207   0     0      33557   83    17312   0     0
48936   169   26030   0     0      49018   169   26088   0     0
36564   106   18992   0     0      36714   106   19105   0     0
43169   155   23085   0     0      43297   155   23189   0     0
50800   183   27213   0     0      50908   183   27298   0     0
37223   129   19451   0     0      37361   129   19569   0     0
10735   35    5698    0     0      10877   35    5832    2     4
22383   22    11124   0     0      22555   22    11263   0     0
35079   81    18111   0     0      35183   81    18207   0     0
32951   103   17097   0     0      33050   103   17180   0     0

Now you can see there are lot of input errors , I have escalated this
problem to SUN through vendor CTC Technology corporation.
I feel the Ierrs are not normal , and they are coming only on this machines
gigabit ethernet card , Hence CTC support replaced this card , reinstalled
driver applied patch but invain. Nothing changed .

Does anybody have such experiences with gigabit interface card on E250
Machine running Solaris 8 ?

I have applied 108813-09 patch on E250 Machine .
I am using GS504 Netgear HUBS ( fiber optic ) .

best regards,
- Rahul


Following Mail was posted to Networker Mailing list .


Thanks to james.white@pncbank.com , Scott Bingham <sbingham@legato.com>

who told me that concentrate on Network hardware side rather than on Legato
side.

                                                                                                                                              
                      Rahul Parasnis                                                                                                          
                      <rahul.parasnis@creditl         To:      NETWORKER@LISTMAIL.TEMPLE.EDU                                                  
                      yonnais.fr>                     cc:                                                                                     
                      Sent by: Legato                 Subject: BSA_RC_ABORT_SYSTEM_ERROR                                                      
                      NetWorker discussion                                                                                                    
                      <NETWORKER@LISTMAIL.TEM                                                                                                 
                      PLE.EDU>                                                                                                                
                                                                                                                                              
                                                                                                                                              
                      06/20/2002 12:38 PM                                                                                                     
                      Please respond to                                                                                                       
                      Legato NetWorker                                                                                                        
                      discussion; Please                                                                                                      
                      respond to Rahul                                                                                                        
                      Parasnis                                                                                                                
                                                                                                                                              
                                                                                                                                              




I am trying to backup Windows 2000 Machine , with Lotus Notes Module
option.
I am getting following error after sometime .
Could someone will help me in sorting out this error ?

Windows 2000 Machine is having  2 GB  Memory . Installed 2.1 Notus notes
Module , and server is on Solaris 8 Networker Version 6.1 Build 186.

* tk2kcl4:NOTES nsrnotesv: RPC error: Unable to send
* tk2kcl4:NOTES nsrnotesv: RPC error: Unable to send
* tk2kcl4:NOTES nsrnotesv: RPC error: Unable to send
* tk2kcl4:NOTES nsrnotesv: RPC error: Unable to send
* tk2kcl4:NOTES nsrnotesv: XBSA error: 3 (System error: XBSA-1.0.1
rt_2000_3Q.Build.157 1944 Thu Jun 20 11:55:43 2002
_nwbsa_close_create_object_session: BSA_RC_ABORT_SYSTEM_ERROR System
detected error due to savefile_fini() failure. Operation aborted
* tk2kcl4:NOTES )
* tk2kcl4:NOTES nsrnotesv: XBSA error: 3 (System error: XBSA-1.0.1
rt_2000_3Q.Build.157 2556 Thu Jun 20 11:55:43 2002
_nwbsa_close_create_object_session: BSA_RC_ABORT_SYSTEM_ERROR System
detected error due to savefile_fini() failure. Operation aborted
* tk2kcl4:NOTES )
* tk2kcl4:NOTES nsrnotesv: Backup failed for D:\old_notes\Copy of
JPNQ01\htaguchi.nsf
* tk2kcl4:NOTES nsrnotesv: Backup failed for D:\old_notes\Copy of
JPNQ01\htauro.nsf
* tk2kcl4:NOTES nsrnotesv: XBSA error: 3 (System error: XBSA-1.0.1
rt_2000_3Q.Build.157 1776 Thu Jun 20 11:55:43 2002
_nwbsa_close_create_object_session: BSA_RC_ABORT_SYSTEM_ERROR System
detected error due to savefile_fini() failure. Operation aborted
* tk2kcl4:NOTES )
* tk2kcl4:NOTES nsrnotesv: Backup failed for D:\old_notes\Copy of
JPNQ01\hfukushi.nsf
* tk2kcl4:NOTES nsrnotesv: XBSA error: 3 (System error: XBSA-1.0.1
rt_2000_3Q.Build.157 2460 Thu Jun 20 11:55:43 2002
_nwbsa_close_create_object_session: BSA_RC_ABORT_SYSTEM_ERROR System
detected error due to savefile_fini() failure. Operation aborted


Another concern for me is following message , but I searched in the archive
for this , and it seems I need client for NT workstation License .
But if anybody want to share his/her experiences then I will be delighted
to know them .


06/20/02 12:23:15 PM nsrmmdbd: media db is cross checking the save sets
06/20/02 12:23:17 PM nsrmmdbd: error, client name must be specified
06/20/02 12:23:20 PM nsrmmdbd: media db is open for business
06/20/02 12:23:27 PM nsrindexd: Running nsrck to check client file indices
06/20/02 12:23:27 PM nsrd: RAP error: there are more clients connected than
number of licenses.

best regards,
- Rahul
Ce message contient des informations confidentielles ou appartenant au
Cridit Lyonnais et est itabli ` l'intention exclusive de ses
destinataires. Toute divulgation, utilisation, diffusion ou reproduction
(totale ou partielle) de ce message, ou des informations qu'il contient,
doit jtre prialablement autorisie. Tout message ilectronique est
susceptible d'altiration et son intigriti ne peut jtre assurie.
Le Cridit Lyonnais dicline toute responsabiliti au titre de ce
message s'il a iti modifii ou falsifii. Si vous n'jtes pas
destinataire de ce message, merci de le ditruire immidiatement et
d'avertir l'expiditeur de l'erreur de distribution et de la destruction
du message.

This e-mail contains confidential information or information belonging
to Cridit Lyonnais and is intended solely for the addressees.
The unauthorised disclosure, use, dissemination or copying (either whole
or partial) of this e-mail, or any information it contains, is prohibited.
E-mails are susceptible to alteration and their integrity cannot be
guaranteed.
Cridit Lyonnais shall not be liable for this e-mail if modified or falsified.
If you are not the intended recipient of this e-mail, please delete it
immediately from your system and notify the sender of the wrong delivery
and the mail deletion.
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers
Received on Tue Jun 25 05:10:30 2002

This archive was generated by hypermail 2.1.8 : Thu Mar 03 2016 - 06:42:47 EST