[UPDATE] SUMMARY: E420R unexplaned panic after UE error

From: Tony van Lingen <tony.vanlingen_at_epa.qld.gov.au>
Date: Wed Mar 10 2004 - 18:47:11 EST
G'day All.

A little update on my earlier summary, which I ended with:

>As to the qla error messages in the log, Kevin reinforced my opinion 
>that forceloading the drivers is not necessary. None of these devices 
>contain boot partitions. In the mean time we have been able to trace at 
>least some of those to  a faulty UPS that the storage array is plugged 
>into (the panicky server is not plugged in there, though).
>
>  
>
QLogic have in the mean time done a little investigation in this and 
provided me with an explanation for the qla2300 messages. To refresh 
your memories, these are expamles of the messages I'm talking about:

>Feb  2 12:11:35 Slarty qla2300: [ID 175527 kern.info] qla2300(1): configure_loop, 2 gigabit data rate connection
>Feb  2 12:11:35 Slarty qla2300: [ID 467028 kern.info] qla2300(1): configure_loop, F-PORT connection
>Feb  2 12:11:35 Slarty qla2300: [ID 465925 kern.info] qla2300(1): status_entry, check condition sense data t1d0
>Feb  2 12:11:35 Slarty 70h  0h  6h  0h  0h  0h  0h  6h  0h  0h  0h  0h  29h  0h  0h  0h  0h 20h
>

Lyle Merdan of QLogic provided me with the following explanation of the last two lines (thanks Lyle) :

The t##d## is indicative of the disk that is reporting the check condition. Then at the beginning of the entry is the HBA instance. The example you gave tells me it's HBA instance 6.

> qla2300: [ID 465925 kern.info] qla2300(6): status_entry

  Q) What are these check conditions that appear when extended logging is enabled?
     qla2300: [ID 465925 kern.info] qla2300(6): status_entry, check condition sense data t94d0
     70h  0h  6h 42h 55h 5ah 5ah  ah  0h  0h  0h  0h 29h 0h  1h  0h  0h  
  A) These are errors returned from the storage to the HBA. There are two parts to a check
     condition. The ASC and ASCQ. The ASC is byte 12 and the ASCQ is byte 13. Start counting
     at 0. So in the above example the ASC is 29 and ASCQ is 0. These values can be looked up
     on this website: http://www.t10.org/lists/asc-num.htm 

As to what exactly the reported errors mean, you'll have to contact the storage vendor.

Now the reason you're getting the check conditions is you have extended logging enabled in the driver.
To disable extended logging you have to edit the /kernel/drv/qla2300.conf file and either add a line that explicitly
disables extended logging for HBA driver instance 6 OR use a GUI to turn extended logging off.

You could just add this line:
hba6-extended-logging=0;
---

The website that Lyle mentiones has full explenation of all SCSI ASC/ASCQ combinations possible. It transpires then that all messages are caused by faults on the CLARIION. We'll persue this further with Dell.

Cheers,

-- 
Tony van Lingen
Technical Consultant

Technology One Limited,
67 High Street Toowong Qld 4066

Mobile:    0413 701 284
Phone:    +61 7 3377 7300(TechOne), +61 7 3234 1972 (EPA)
Fax:      +61 7 3377 7301(TechOne), +61 7 3227 6534 (EPA)

E-mail:   tvlingen@acslink.net.au
Visit our home page at:  http://www.TechnologyOneCorp.com
Technology One's entire liability will be limited to resupplying the material enclosed. No other warranties are provided

Technology One designs, develops, implements and supports intelligent enterprise wide software applications using Internet, eBusiness and Client Server technologies for both corporate and government organisations

*********************************** Confidentiality Statement ****************************************
The information transmitted in this email is only for the recipient referred in this email and may contain confidential and/or privileged material.

If you are not the intended recipient (or responsible for delivery of the message to such person), you may not copy or deliver this message to anyone. In such case any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from the computer.


Opinions, conclusions and other information in this message that do not relate to the official business of the company shall be understood as neither given nor endorsed by it.

Technology One's entire liability will be limited to resupplying the material enclosed. No other warranties are provided

We use virus scanning software but exclude all liability for viruses or similar in any attachment.




___________________________
Disclaimer

This e-mail, including attachments if any, has originated from a Queensland government agency and may contain information that is confidential, or covered by legal professional privilege, and is intended for the named recipient(s) only.  If you have received this message in error, you are asked to inform the sender as quickly as possible and delete this message and any copies of this message from your computer system network.

Any form of disclosure, modification, distribution and/or publication of this e-mail, including attachments is prohibited.  Unless otherwise stated, this e-mail, including attachments represents the views of the sender and not the views of the Environmental Protection Agency.

Although this e-mail has been checked for the presence of computer viruses, the Environmental Protection Agency provides no warranty that all possible viruses have been detected and cleaned.  Any use of this e-mail could harm your computer system.
___________________________
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers
Received on Wed Mar 10 18:51:46 2004

This archive was generated by hypermail 2.1.8 : Thu Mar 03 2016 - 06:43:29 EST