T.R | Title | User | Personal Name | Date | Lines |
---|
308.1 | analysis of errors would help. | SSDEVO::MARTENS | Qualification Program Manager | Wed Apr 30 1997 12:27 | 13 |
| This could be many causes. The error log should help understand if
all errors are for a single block, file, area, or random. Is this
error a drive memory error, or a failure during the MSCP compare
function. If the I/O does not have a compare qualifier, then it
is a drive issue. IF the error is a failure of a MSCP data compare
with disk data and host data, it is possible that the host data
is changing before the compare completes. This has been seen with
third party defrag tools, and other third party code.
reviewing the errors logs is a good place to start.
Bert
|
308.2 | errorlog of compare error | DEKVC::YONGHOKIM | | Tue May 06 1997 02:11 | 363 |
| Hi.
This is errorlog file of my customer. (%compare error)
Please help me.
sh dev d
��ġ ��ġ ���� ���� ��� Trans Mnt
�̸� ���� �� ���̺� ���� �� ��
$1$DIA0: (SYSDSK0) ����Ʈ�� 0 SV7600SYS 829160 1424 2
$1$DIA1: (SYSDSK1) ����Ʈ�� 0 SV4300SYS 449740 19 2
$1$DIA2: (OADISK1) ����Ʈ�� 0 USERPACK01 787456 39 2
$1$DIA3: (OADISK2) ����Ʈ�� 0 USERPACK02 336220 14 2
$1$DIA4: (OADISK3) ����Ʈ�� 2 USERPACK03 65527 1312 2
$1$DIA5: (OADISK4) ����Ʈ�� 1282 USERPACK04 1142802 49 2
$1$DIA6: (OADISK5) ����Ʈ�� 241 USERPACK05 497448 369 2
$1$DIA7: (OADISK6) ����Ʈ�� 1735 USERPACK06 917512 416 2
$1$DIA8: (OADISK7) ����Ʈ�� 0 USERPACK07 1699167 178 2
$1$DIA9: (OADISK8) ����Ʈ�� 0 USERPACK08 1562553 247 2
$1$DIA10: (OADISK9) ����Ʈ�� 0 USERPACK09 2345625 171 2
$1$DIA11: (OADISK10) ����Ʈ�� 40 USERPACK10 441064 35 2
$1$DIA12: (SCSIDISK) ����Ʈ�� 0 USERPACK11 1234676 164 2
$1$DIA13: (SCSIDISK) ����Ʈ�� 0 USERPACK12 1787788 1 2
$1$DIA14: (SCSIDISK) ����Ʈ�� 2 USERPACK13 946056 2 2
$1$DIA15: (SCSIDISK) ����Ʈ�� 0 USERPACK14 1442384 5 2
$1$DIA16: (SCSIDISK) ����Ʈ�� 0 USERPACK15 1396076 28 2
$1$DIA17: (HSD2) ����Ʈ�� 0 USERPACK16 612936 88 2
$1$DIA18: (HSD2) ����Ʈ�� 1 USERPACK17 1538541 39 2
$1$DIA19: (HSD2) ����Ʈ�� 0 USERPACK18 1729413 3 2
$1$DIA20: (HSD2) ����Ʈ�� 0 USERPACK19 4659327 4 2
$1$DIA21: (HSD2) �¶�� 0
$2$DIA1: (SV4301) ����Ʈ�� 0 SV4301SYS 428216 1 2
$ anal/err/since=yesterday/inc=dia5/out=disk.lis
Error Log Report Generator Version V5.5
$ TY DISK.LIS
******************************* ENTRY 73571. *******************************
ERROR SEQUENCE 26446. LOGGED ON: SID 17000202
DATE/TIME 6-MAY-1997 12:41:17.99 SYS_TYPE 01030001
SYSTEM UPTIME: 7 DAYS 15:16:38
SCS NODE: SV7600 VAX/VMS V5.5-2
"UNKNOWN DEVICE" ENTRY KA7AA-AA CPU FW REV# 2. CONSOLE FW REV# 0.3
ERROR LOG RECORD
ERF$L_SID 17000202
SYSTEM ID REGISTER
ERL$W_ENTRY 0064
ERROR ENTRY TYPE
EXE$GQ_SYSTIME 419CA460
009B3DA3 64 BIT TIME WHEN ERROR LOGGED
ERL$GL_SEQUENCE 674E
UNIQUE ERROR SEQUENCE = 26446.
BYTE <3:0> 00003402 /.4../
BYTE <7:4> 38465409 /.TF8/
BYTE <11:8> 4D243736 /67$M/
BYTE <15:12> 54094149 /IA.T/
BYTE <19:16> 37363846 /F867/
BYTE <23:20> 41494D24 /$MIA/
BYTE <27:24> 7B010000 /...{/
BYTE <31:28> 37565308 /.SV7/
BYTE <35:32> 20303036 /600 /
BYTE <39:36> 00000002 /..../
BYTE <43:40> 00000000 /..../
BYTE <47:44> 800A0012 /..../
BYTE <51:48> B80C0157 /W..�/
BYTE <55:52> 42000022 /"..B/
BYTE <59:56> 0004016E /n.../
BYTE <63:60> FF440001 /..D./
BYTE <67:64> 5A47017E /~.GZ/
BYTE <71:68> 00430315 /..C./
BYTE <75:72> 00000000 /..../
BYTE <79:76> 00000000 /..../
BYTE <83:80> 001A0205 /..../
BYTE <87:84> C19A0000 /...�/
BYTE <91:88> 000100C6 /�.../
BYTE <95:92> 01010000 /..../
BYTE <99:96> 00010001 /..../
BYTE <103:100> 80808B89 /..../
BYTE <107:104> 90808080 /..../
BYTE <111:108> 14B50481 /..�./
BYTE <115:112> 00000000 /..../
BYTE <119:116> 00000000 /..../
BYTE <123:120> 00000000 /..../
BYTE <125:124> 0000 /../
B E G I N I N G O F I N T E R V E N I N G E N T R I E S
V A X / V M S SYSTEM ERROR REPORT COMPILED 6-MAY-1997 15:57:09
PAGE 1.
******************************* ENTRY 73257. *******************************
ERROR SEQUENCE 26001. LOGGED ON: SID 17000202
DATE/TIME 5-MAY-1997 11:33:47.87 SYS_TYPE 01030001
SYSTEM UPTIME: 6 DAYS 14:09:07
SCS NODE: SV7600 VAX/VMS V5.5-2
ERL$LOGMESSAGE ENTRY KA7AA-AA CPU FW REV# 2. CONSOLE FW REV# 0.3
I/O SUB-SYSTEM, UNIT _OADISK4$DIA5:
MESSAGE TYPE 0001
DISK MSCP MESSAGE
MSLG$L_CMD_REF 1C0B002E
MSLG$W_UNIT 0005
UNIT #5.
MSLG$W_SEQ_NUM 044A
SEQUENCE #1098.
MSLG$B_FORMAT 04
SMALL DISK LOG
MSLG$B_FLAGS 00
UNRECOVERABLE ERROR
MSLG$W_EVENT 0007
COMPARE ERROR
MSLG$Q_CNT_ID 32707567
016D4183
UNIQUE IDENTIFIER, 418332707567(X)
MASS STORAGE CONTROLLER
MODEL = 109.
MSLG$B_CNT_SVR 1B
CONTROLLER SOFT�WARE VERSION #27.
MSLG$B_CNT_HVR 01
CONTROLLER HARDWARE REVISION #1.
MSLG$W_MULT_UNT 0000
MSLG$Q_UNIT_ID 32707567
02364183
UNIQUE IDENTIFIER, 418332707567(X)
DISK CLASS DEVICE (166)
MODEL = 54.
MSLG$B_UNIT_SVR 1B
UNIT SOFTWARE VERSION #27.
MSLG$B_UNIT_HVR 01
UNIT HARDWARE REVISION #1.
MSLG$W_SDE_CYL 039F
MSLG$L_VOL_SER 74555431
VOLUME SERIAL #1951749169.
CONTROLLER DEPENDENT INFORMATION
LONGWORD 1. 08421114
/..B./
LONGWORD 2. 0022AD7F
/.�"./
LONGWORD 3. 00080056
/V.../
LONGWORD 4. 0000C098
/.�../
V A X / V M S SYSTEM ERROR REPORT COMPILED 6-MAY-1997 15:57:09
PAGE 2.
LONGWORD 5. 00000000
/..../
V A X / V M S SYSTEM ERROR REPORT COMPILED 6-MAY-1997 15:57:09
PAGE 3.
******************************* ENTRY 73258. *******************************
ERROR SEQUENCE 26002. LOGGED ON: SID 17000202
DATE/TIME 5-MAY-1997 11:34:54.92 SYS_TYPE 01030001
SYSTEM UPTIME: 6 DAYS 14:10:15
SCS NODE: SV7600 VAX/VMS V5.5-2
ERL$LOGMESSAGE ENTRY KA7AA-AA CPU FW REV# 2. CONSOLE FW REV# 0.3
I/O SUB-SYSTEM, UNIT _OADISK4$DIA5:
MESSAGE TYPE 00
MSLG$W_MULT_UNT 0000
MSLG$Q_UNIT_ID 32707567
02364183
UNIQUE IDENTIFIER, 418332707567(X)
DISK CLASS DEVICE (166)
MODEL = 54.
MSLG$B_UNIT_SVR 1B
UNIT SOFTWARE VERSION #27.
MSLG$B_UNIT_HVR 01
UNIT HARDWARE REVISION #1.
MSLG$W_SDE_CYL 039F
MSLG$L_VOL_SER 74555431
VOLUME SERIAL #1951749169.
CONTROLLER DEPENDENT INFORMATION
LONGWORD 1. 08421114
/..B./
LONGWORD 2. 0022AD7F
/.�"./
LONGWORD 3. 00080056
/V.../
LONGWORD 4. 0000C098
/.�../
V A X / V M S SYSTEM ERROR REPORT COMPILED 6-MAY-1997 15:57:09
PAGE 4.
LONGWORD 5. 00000000
/..../
V A X / V M S SYSTEM ERROR REPORT COMPILED 6-MAY-1997 15:57:09
PAGE 5.
******************************* ENTRY 73259. *******************************
ERROR SEQUENCE 26003. LOGGED ON: SID 17000202
DATE/TIME 5-MAY-1997 11:35:10.71 SYS_TYPE 01030001
SYSTEM UPTIME: 6 DAYS 14:10:30
SCS NODE: SV7600 VAX/VMS V5.5-2
ERL$LOGMESSAGE ENTRY KA7AA-AA CPU FW REV# 2. CONSOLE FW REV# 0.3
I/O SUB-SYSTEM, UNIT _OADISK4$DIA5:
MESSAGE TYPE 0001
DISK MSCP MESSAGE
MSLG$L_CMD_REF EEBE003E
MSLG$W_UNIT 0005
UNIT #5.
MSLG$W_SEQ_NUM 044C
SEQUENCE #1100.
MSLG$B_FORMAT 04
SMALL DISK LOG
MSLG$B_FLAGS 00
UNRECOVERABLE ERROR
MSLG$W_EVENT 0007
COMPARE ERROR
MSLG$Q_CNT_ID 32707567
016D4183
UNIQUE IDENTIFIER, 418332707567(X)
MASS STORAGE CONTROLLER
MODEL = 109.
MSLG$B_CNT_SVR 1B
CONTROLLER SOFTWARE VERSION #27.
MSLG$B_CNT_HVR 01
CONTROLLER HARDWARE REVISION #1.
MSLG$W_MULT_UNT 0000
MSLG$Q_UNIT_ID 32707567
02364183
UNIQUE IDENTIFIER, 418332707567(X)
DISK CLASS DEVICE (166)
MODEL = 54.
MSLG$B_UNIT_SVR 1B
UNIT SOFTWARE VERSION #27.
MSLG$B_UNIT_HVR 01
UNIT HARDWARE REVISION #1.
MSLG$W_SDE_CYL 039F
MSLG$L_VOL_SER 74555431
VOLUME SERIAL #1951749169.
CONTROLLER DEPENDENT INFORMATION
LONGWORD 1. 08421114
/..B./
LONGWORD 2. 0022AD7F
/.�"./
LONGWORD 3. 00080056
/V.../
LONGWORD 4. 0000C098
/.�../
V A X / V M S SYSTEM ERROR REPORT COMPILED 6-MAY-1997 15:57:09
PAGE 6.
LONGWORD 5. 00000000
/..../
V A X / V M S SYSTEM ERROR REPORT COMPILED 6-MAY-1997 15:57:09
PAGE 7.
******************************* ENTRY 73260. *******************************
ERROR SEQUENCE 26004. LOGGED ON: SID 17000202
DATE/TIME 5-MAY-1997 11:35:32.49 SYS_TYPE 01030001
SYSTEM UPTIME: 6 DAYS 14:10:52
SCS NODE: SV7600 VAX/VMS V5.5-2
ERL$LOGMESSAGE ENTRY KA7AA-AA CPU FW REV# 2. CONSOLE FW REV# 0.3
I/O SUB-SYSTEM, UNIT _OADISK4$DIA5:
MESSAGE TYPE 0001
DISK MSCP MESSAGE
MSLG$L_CMD_REF 9B6E0054
MSLG$W_UNIT 0005
UNIT #5.
MSLG$W_SEQ_NUM 044D
SEQUENCE #1101.
MSLG$B_FORMAT 04
SMALL DISK LOG
MSLG$B_FLAGS 00
UNRECOVERABLE ERROR
MSLG$W_EVENT 0007
COMPARE ERROR
MSLG$Q_CNT_ID 32707567
016D4183
UNIQUE IDENTIFIER, 418332707567(X)
MASS STORAGE CONTROLLER
MODEL = 109.
MSLG$B_CNT_SVR 1B
CONTROLLER SOFTWARE VERSION #27.
MSLG$B_CNT_HVR 01
CONTROLLER HARDWARE REVISION #1.
MSLG$W_MULT_UNT 0000
MSLG$Q_UNIT_ID 32707567
02364183
UNIQUE IDENTIFIER, 418332707567(X)
DISK CLASS DEVICE (166)
MODEL = 54.
MSLG$B_UNIT_SVR 1B
UNIT SOFTWARE VERSION #27.
MSLG$B_UNIT_HVR 01
UNIT HARDWARE REVISION #1.
MSLG$W_SDE_CYL 039F
MSLG$L_VOL_SER 74555431
VOLUME SERIAL #1951749169.
CONTROLLER DEPENDENT INFORMATION
LONGWORD 1. 08421114
/..B./
LONGWORD 2. 0022AD7F
/.�"./
LONGWORD 3. 00080056
/V.../
LONGWORD 4. 0000C098
/.�../
V A X / V M S SYSTEM ERROR REPORT COMPILED 6-MAY-1997 15:57:09
PAGE 8.
LONGWORD 5. 00000000
/..../
|
308.3 | what were you doing at the time? | SUBSYS::VIDIOT::PATENAUDE | Ask your boss for ARRAY's... | Tue May 13 1997 09:20 | 33 |
| If you mounted the drive with DATA_CHECK READ enabled, OR when running
applications which issue QIO's with the DATA_CHECK modifier. You will
notice Compare errors are logged. It is important to note that no user
data, is lost or corrupted. What you describe is NORMAL behavior on any
DSSI device.
This is due to the fact that MSCP does not require a READ and COMPAREs
to be executed as an atomic operation!
This is also aggravated by disk defragmenting programs, lots of disk I/O
with small files. The problem may also be demonstrated using some programs
that exercise a few blocks without using any type of locking (EX: QIO's vs.
QIOW's). Replacing hardware will NOT FIX this.
Does the Customer run any defrag programs while useing the device for work?
Is the drive mounted or init'd with DATA=WRITE or DATA=READ (or both)?
If the answer to either of the above questions is YES, then what you are
seeing is NORMAL expected behavior.
If the answer is NO, then;
What did you do to recreate this?
Was any hardware replaced?
Do you have any better errorlog analysis tools? (DECevent or SWEAT or FSTerr)
Can you move the errorlog.sys file to a system with a better analysis tool
than VMS 5.5?
Roger.
|