[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference kernel::csguk_systems

Title:CSGUK_SYSTEMS
Notice:No restrictions on keyword creation
Moderator:KERNEL::ADAMS
Created:Wed Mar 01 1989
Last Modified:Thu Nov 28 1996
Last Successful Update:Fri Jun 06 1997
Number of topics:242
Total number of notes:1855

82.0. "VMS bugs and problems" by KERNEL::ANTHONY () Mon Jan 08 1990 12:32

	Hello All,

	The following info was supplied by Rob Hasking...
	Bottom line is advise upgrade to 5.2, or set mpw_iolimit to 1

	Cheers, Brian
			posted in csg_systems
---------------------------------------------------------------------------


This article details a problem in memory management on VMS V 5 systems prior
to VMS 5.2.



                         Resolution Statement.

  As it has become evident, a possible page corruption under certain 
  conditions in any of VMS Version 5 releases PRIOR to V5.2 can cause a 
  system fatal bugcheck.  Usually, the corrupted page is discovered 
  during an image or process run down, at which point the system
  bugchecks.  SOME of the discovered bugcheck reasons include WSLXVANMAT, 
  BADPAGFILD, PGFIPLHI, SSRVEXCEPT, INVEXCEPTN, with SOME of the addresses
  on the Kernel stack pointing to MMG facility (typically MMG$DELPAG)

  Following are some of the currently known characteristics that may
  yield such system behavior, particularly in the environments with LARGE
  amount of memory on some of the faster CPU's (uni OR multiprocessors): 

      o  Large amount of paging activity can aggravate this condition;

      o  When a process needs to re-use a Working Set List Entry, a 
 	 dead page scan will get initiated.  As a result, the most 
 	 likely candidates to be selected will be the WSLE's that 
 	 describe a page that is not valid (as indicated by its PTE) 
 	 but is still in the working set.  This can cause increased 
 	 I/O activity as modified pages are forced to be written to 
 	 their backing store;

      o  The corrupted page is brought back in from the Page file, as 
 	 indicated in the system Crash Dump by the recently read IRP;

      o  Page file I/O activities are high and the number of outstan-
 	 ding I/O's (controlled by MPW_IOLIMIT SYSGEN Parameter) is 
 	 set to higher than 1. (NOTE: the DEFAULT setting is 4)

  An extensive cooperative effort was recently undertaken by the VMS 
  Engineering, CSSE and Field Customer Support organizations to better
  understand the nature and specific causes of this system behavior.  

  A number of tests were executed on a range of various VMS software and
  hardware configurations.  As a result, this system behavior was 
  simulated on variety of high-speed CPU's and versions of VMS.  While
  the symptoms could be consistently reproduced on releases prior to
  Version 5.2, the problem did not occur in an environment running VMS
  V5.2.  Furthermore, it was determined that none of the VMS V5 releases
  experienced this behavior with MPW_IOLIMIT SYSGEN Parameter set to 1
  (thus simulating pre-V5.0 behavior of the Modified Page Writer (MPW)
  component of VMS Memory Management, MMG).

  In addition, the problem was successfully simulated on a VMS V5.2
  system ONLY when VMS code was modified NOT to include any of the
  functional and design enhancements introduced in VMS MMG at V5.2 time
  frame.  Again, when the SYSGEN Parameter MPW_IOLIMIT was set to 1 on
  the MODIFIED VMS V5.2 system, the problem would NO longer appear.

  Therefore, based on the analysis of the above study, it is believed
  that the following enhancements in VMS V5.2 MMG have significant
  positive impact on the occurrence of this condition:

	o  Addition of selective page writing mechanism based on MPW's
	   internal states;

	o  Addition of termination logic to MMG$MPW_END routine such 
  	   that the Pending I/O, MPW state and Error conditions are 
  	   handled;

	o  Addition of threshold to check whether pending I/O requests 
	   have been disabled.

  The additional information that was obtained as a result of this 
  in-depth analysis was sufficient to better define the nature of this 
  system behavior and to identify a set of recommendations to systems 
  configurations experiencing this failure.  Specifically, based on the 
  testing performed on various configurations, the analysis of the 
  results and of the V5.2 changes to VMS MMG, it was concluded that VMS 
  V5.2 significantly reduces the probability of experiencing this
  problem, if not eliminates it all together.

  Therefore, based on the results of this study, it is recommended that
  systems experiencing this behavior are upgraded to VMS V5.2.
         However,  FOR THE INTERIM, those systems that   continue to 
  run Pre-5.2 version of VMS  AND are experiencing this problem 
  are strongly encouraged to reduce paging  activities, increase
  processes' working set sizes, and DEFINITELY set MPW_IOLIMIT SYSGEN
  Parameter to 1.  It should be noted that these actions are believed
  NOT to have any significant impact on the overall system's
  performance.


  In addition to setting this parameter ,all VMS 5.1-n systems should 
  have the MMG patches installed to address other issues.


T.RTitleUserPersonal
Name
DateLines
82.1KERNEL::MOUNTFORDMon Feb 12 1990 03:2964
From:	GOONS::EY8786U4     12-APR-1988 13:59:35.48
To:	GOSSIP::WRIGHTON
CC:	
Subj:	from norm

From:	GOONS::EY8786U3     12-APR-1988 12:08:19.11
To:	EY8786U14,EY8786U4
CC:	
Subj:	sda help

From:	KERNEL::PETTET "MNEMONIC NORM  11-Dec-1987 1224" 11-DEC-1987 13:31
To:	@POST:RDCENG,QUINTANA
Subj:	More SDA goodies.....

Gents,

	Thought I would get in on the SDA mail coach,

	typing :-


SDA> set proc/index=???
SDA> exam ctl$ag_clidata+2AF;200

	Will exam the Command Recall Buffer. It will contain the last set of
commands that were typed on the terminal, it will not contain any commands that 
were invoked by a DCL command procedure. The buffer is circular so if you want 
to see all the commands typed then use:-


SDA> exam ctl$ag_clidata+2AF;405


	The format of the displayed data is:-

	1) Pointer (VA) to next command.
	2) Null.
	3) Size.
	4) Command String.
	5) Size
	6) pointer......etc


	Rat Holes......
	===============

	If you get the following the buffer is empty or you are pointing to the 
wrong process!!!


SDA> exam ctl$ag_clidata+2AF;200

00000000 00000000 00000000 7FFE350F  .5�.............     7FFE350B
Zeros suppressed from 7FFE351B through 7FFE370A
SDA> 

	You can try the exam ctl$ag...etc on the COMICS cluster (using your new
account) to test the procedure it does work and will work for VMS Versions 4.4
upwards. 


		Cheers...Norm

	My thanks to Pete Dordoy TSC
82.2KERNEL::MOUNTFORDMon Feb 12 1990 03:2921
From:	KERNEL::TRAVELL "John (I'd rather be windsurfing) Travell  09-Apr-1989 0826"  9-APR-1989 09:03:11.88
To:	@POST:SYSTEMS
CC:	
Subj:	V5 crash command recall buffer

Hi all,

	A snippet you have been waiting for, one of those things that could not 
possibly do any sensible work without,... what's that!, you knew it?, could I 
be the last to find out ??

	The DCL command recall buffer MOVED in V5....

	It is now at ... CTL$AG_CLIDATA+2B4;3F0

In symbolic terms, this has moved an enormous amount, all of 5 bytes...

However, the 24 bytes between CTL$AG_CLIDATA+29C & +2B4 appear to contain the
logical terminal name, eg.    ._LOOKIN$VTA894......... 

				John Travell. 
82.3Another way...COMICS::EDMUNDSSun Feb 25 1990 13:4515
    
    
    	And further....
    
    	Another good way,sometimes possibly better,kindly passed on to
    	me by Ken Robb....
    
    	Pick up the process index
    	Find the term name if poss(sho proc/chan....LTAXXX etc)
    	sho dev LTAXXXX and look for the UCB address (<ADDRESS>)
    	read sys$system:sysdef
    	for <ADDRESS> /type=UCB (look for UCB$L_TT_TYPAHD (<ADDRESS2>)
    	ex <ADDRESS2> ;200(or so...)
    
    	
82.4KERNEL::MOUNTFORDTue Jul 03 1990 12:37151
    From Geoff Judd.
    
        A new BACKUP patch has been released which fixes a number of problems.
    The problems are listed in the .README file which is included below.
    This is the patch which is referred to on STARS as CSCPAT_0101 and
    says that its VOID name is BACKUP$PATCH01_531. 

    The patch is called BCK$IMG01_531 and is note #142 in the VMS_PATCHES
    notes conference. 
    
    Geoff Judd.

Patch Identification:

	BCK$IMG01_531


Patch Policy Statement:

	This image is to be used ONLY as part of a solution to a specific 
	problem and NOT as a troubleshooting method, i. e. for 'quick fix 
	tries' at a customer site.


Image Description:

	The Backup image provided in this kit is built based on VMS V5.3-1.
	The Backup cumulative image can be applied to any VMS system running 
	VMS V5.2 to V5.3-1, inclusive.  The following is a list of bug fixes 
	in this image.  A brief description of the problem or symptom is 
	included along with which version(s) these problems have been 
	reported on.


Problem Descriptions:

	- Backup performance degrading on non-CRC emulated machines
	  V5.2 - V5.3-1

	As of v5.2, any machine that does not do CRC emulation is probably 
	seeing a performance degredation if they are using the /CRC qualifier 
	( default, which is recommended by Digital ).  Machines that are 
	likely to see this problem are 11/750, 11/780 and 11/785.

	- ODS-1 data integrity issue
	  V5.2 - V5.3-1

	The problem's symptom was when performing a /VERIFY on the save 
	operation, Backup VBNMISSING errors were reported.  The problem 
	described is more apparent on fragmented disk structures.  Block 
	count for each header extent was being calculated improperly.  This 
	in turn caused the file's size to be truncated and compromised data 
	integrity.

	- ODS-1 Backup looping problem
	  V5.2 - V5.3-1

	When performing a Backup operation on an ODS-1 disk structure, 
	the Backup would hang.  This can be detected by the ^T function, 
	and the amount of time the Backup operation takes.  The problem 
	described is more apparent on fragmented disk structures.  This 
	is usually seen on large files with multiple header extents.

	- BACKUP/DELETE may erroneously delete files
	  V5.3 - V5.3-1

	A input file was erroneously deleted when an error was encountered
	creating the output file during a disk to disk image backup or backup
	copy operation.  

	- Reserved operand when mounting second tape volume
	  V5.3 - V5.3-1

	When Backup issued the command to mount the second volume, a 
	reserved operand error was likely to occur.  This was seen when 
	specifying /NOASSIST on the Backup command.

	- ACCVIO at PC F9AE
	  V5.2 - V5.3-1

	An access violation occurred at the PC F9AE in Backup.

	- Backup hanging when using /COMPARE
	  V5.2 - V5.3-1

	When doing a Backup compare operation, the Backup would hang if 
	process quotas are below the recommended values.  See section 8.2 
	in the "VMS System Manager's Manual" for Digital recommended values.

	- Restoring of multi-volume disk savesets
	  V5.2 - V5.3-1

	During the restore of a multi-volume disk saveset and specifying 
	/ASSIST ( default ), Backup would get into an endless loop, 
	requesting the user to mount the second volume and never returning
	control back to the user.


Installation Instructions:

	This update is designed to be installed with the VMSINSTAL utility
	as follows:

	1. Ensure that the input save set is located on disk as opposed to
	   magnetic tape. 

	2. If this kit is received on a magnetic tape, then MOUNT the tape
	   files-11, and COPY its contents to disk prior to issuing the 
	   following command -

	@SYS$UPDATE:VMSINSTAL BCK$IMG01_531 [ location of .A file ]


Limiting Parameters:

	None.


Special Notes:

	Note that this is an image update.  Any patches applied to BACKUP
	prior to this update will be lost.

	DCLTABLES is modified with this kit to include qualifiers added
	in V5.3-1.  Users will have to log out and log back in to incorporate
	this new version of DCLTABLES.

Listing of save set(s)

Save set:          BAC531.A
Written by:        HELLWIG     
UIC:               [000011,002020]
Date:              11-MAY-1990 10:41:15.62
Command:           BACKUP/NOASSIST/COMMENT=VAX/VMS SPKITBLD Procedure/INTER/LOG/VERIFY *.COM,*.CLD,BACKUP.EXE;6 SODAK$DKA0:[0,0]BAC531.A/LABEL=(BAC)/SAVE/BLOCK=9000/GROUP=25
Operating system:  VAX/VMS version V5.3
BACKUP version:    V5.3
CPU ID register:   0A000004
Node name:         _SODAK::
Written on:        _SODAK$DKA0:
Block size:        9216
Group size:        25
Buffer count:      14

[HELLWIG.BUILD_531]KITINSTAL.COM;8                          7  11-MAY-1990 10:40
[HELLWIG.BUILD_531]BACKUP.CLD;1                             8   8-NOV-1989 16:02
[HELLWIG.BUILD_531]BACKUP.EXE;6                           221  24-APR-1990 16:04

Total of 3 files, 236 blocks
End of save set