[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference spezko::cluster

Title:+ OpenVMS Clusters - The best clusters in the world! +
Notice:This conference is COMPANY CONFIDENTIAL. See #1.3
Moderator:PROXY::MOORE
Created:Fri Aug 26 1988
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:5320
Total number of notes:23384

5247.0. "MC shows unavailable" by TMCUKA::ROWELL (Paul Rowell @BBP) Mon Mar 10 1997 09:11

    I'm trying to build a Memory Channel cluster using SCSI for the disks
    and MC for cluster comms, the problem I'm seeing is similar to note
    5240 but I'm seeing a device status of Unavailable.
    
    All the MC_ parameters are set to their defaults.
    
    I'm using a real hub.
    
    The output of SDA SHOW DEV MCA0/PDT is shown below
    I/O data structures
    -------------------
    MCA0                                           MC_SPUR             UCB:
    80DA55C0
    Device status:   00000010 online
    Characteristics: 00400000 elg
                     00000000
    Owner UIC [000001,000004]   Operation count          0   ORB address   
    80DA5700
          PID        00000000   Error count              0   DDB address   
    80DA5400
    Class/Type          80/31   Reference count          0   DDT address   
    80C975C0
    Def. buf. size          0   BOFF              00000000   CRB address   
    80DA5480
    DEVDEPEND        00000000   Byte count        00000000   I/O wait queue
    80DA562C
    DEVDEPND2        00000000   SVAPTE            00000000
    DEVDEPND3        00000000   DEVSTS            00000001
    FLCK index             3A
    DLCK address     80DA5500
            *** I/O request queue is empty ***
    I/O data structures
    -------------------
              --- Memory Channel PDT Block (MC_PDT) FFFFFFFF.80DA5780 ---
    ADP Address: FFFFFFFF.80D50540    Hub Type:       00000001 (REAL HUB
    SLAVE)
    Adapter Type        0000000049    Adapter Ver           11
    Adapter TMO                  0    Adapter State   00000066
    (LINK_ONLINE)
    Node ID:                     5    Local Node State:              1
    Max Nodes                    0    Online Nodes            000000A0
    Error Count                  0    Init. Retries                  0
    HomeBlk      FFFFFFFF.8E02E000    Community                      0
    Xmit HomeBlk FFFFFFFF.860B0000    Community TMO                  0
    Tags size    00000000.00014000    Interrupt size 00000000.0002A000
    Locks size   00000000.00008000    Regions size   00000000.00008000
    Channel size          00A0A000
     
    Is there anything special I should do for memory channel clustering?
    
    -Paul
T.RTitleUserPersonal
Name
DateLines
5247.1EEMELI::MOSEROrienteers do it in the bush...Mon Mar 10 1997 11:2813
    the SDA output shows a state of 'online' and not 'unavailable' as you
    mentioned in your title. 'Unavailable' is the state after the MC spur
    has passed all the hardware init, but before there is a link ok to
    either a real MC hub or the other node in a virtual hub is also up and
    running.
    
    In your case, there are currently 2 nodes online in a real hub config
    with node id's 5 and 7. 
    
    This all looks ok, so where's the problem? or show us the same SDA
    output from the other node.
    
    /cmos
5247.2Confused?TMCUKA::ROWELLPaul Rowell @BBPMon Mar 10 1997 12:2710
    The unavailable came from:
    
    SHOW DEV MC
    Device                  Device           Error
     Name                   Status           Count
    MCA0:                   Unavailable          0
    
    May be I'm confused?
    
    -Paul
5247.3No Systems Programming Interface yet!EVMS::PERCIVALOpenVMS Cluster EngineeringMon Mar 10 1997 17:208
    This is the usual behaviour with Memory Channel devices - SHOW will
    always give a device status of Unavailable.  You can only see the real
    state through SDA. 
    
    I've asked the memory channel people and they say this is to prevent
    people using the SPI which is not released as yet...
    
    Hope this clears the mystery up a little!
5247.4Confused re Cluster_config also.TMCUKA::ROWELLPaul Rowell @BBPTue Mar 11 1997 03:4316
    The other thing that added to my confusion was that when I tried to use
    cluster_config to disable the LAN for comms it said:
    
        WARNING: This node has no CI, DSSI, or Memory Channel enabled for
    cluster
                 communications. Disabling the LAN will leave this node
    unable
                 to participate in a cluster.
    
        The configuration procedure has completed successfully.
    
    This made me think that the MC wasn't working, so I then did the SHOW
    DEV....
    
    -Paul
           
5247.5EEMELI::MOSEROrienteers do it in the bush...Tue Mar 11 1997 09:039
    was that the initial cluster_config run when PMDRIVER wasn't loaded?
    
    $ SHOW CLUSTER /CONT
    ADD LPORT
    
    Do you see a PMA0 besides the PEA0? If not, then PMdriver is not loaded
    and no SCS traffic going over MC. If there, then things are fine.
    
    /cmos
5247.6A bug?TMCUKA::ROWELLPaul Rowell @BBPTue Mar 11 1997 13:0227
    Currently
    
    $SHOW CLUSTER/CONT
    ADD LPORT 
    
    shows:
    
    View of Cluster from system ID 64716  node: TMC2K4         11-MAR-1997
    17:55:39
          SYSTEMS       MEMBERS CIRCUIT
      NODE   SOFTWARE   STATUS  LPORT 
     TMC2K4  VMS V7.1  MEMBER   PMA0  
                                PEA0  
     TMC2K1  VMS V7.1  MEMBER   PMA0  
                                PEA0  
       
    
    But if I try to disable LAN for cluster comms, from cluster_config then
    I get the message:
    
    WARNING: This node has no CI, DSSI, or Memory Channel enabled for
    cluster communications. Disabling the LAN will leave this node unable
    to participate in a cluster.
    
    I reckon that's a bug!
    
    -Paul
5247.7EEMELI::MOSEROrienteers do it in the bush...Tue Mar 11 1997 13:308
    ok, that might be true that in CLUSTER_CONFIG and/or CLUSTER_CONFIG_LAN
    is a bug.
    
    You could use the LAVC$STOP_BUS in SYS$EXAMPLES to stop the cluster
    traffic on the LAN adapter(s), and if your cluster stays up, you can
    bet that MC works fine for you.
    
    /cmos
5247.8EEMELI::MOSEROrienteers do it in the bush...Tue Mar 11 1997 13:3712
    you're right, both C_C and C_C_L use the following checks when checking
    the interconnects:
    
    lan_cluster		- see if PEA0 exists
    ci_cluster		- see if PNA0 exists
    dssi_cluster	- see if PIA0 or PAA0 exists
    mc_clust_this	- here they should check for PMA0 existance, but it
    			  looks like they do some strange things...
    
    so please QAR this against CLUSTER_CONFIG and CLUSTER_CONFIG_LAN
    
    /cmos
5247.9Here is the fix!TMCUKA::ROWELLPaul Rowell @BBPWed Mar 12 1997 06:2024
    Yes I will log QAR.                                           
    
    The fix is as follows:
    
    In the SUB CHECK_INTERCONNECT there is the following:
    
    $IF host_arch .EQS. "Alpha"
    $THEN
    $  ci_cluster = F$GETDVI("PNA0","EXISTS")
    $  dssi_cluster = F$GETDVI("PIA0","EXISTS") .OR. F$GETDVI("PAA0","EXISTS")
    $  IF ci_cluster
    $  THEN
    $    IF F$GETDVI("PNA0","DEVTYPE") .EQ. 41
    $    THEN
    $      ci_cluster = "FALSE"
    $      dssi_cluster = "TRUE"
    $    ENDIF                  <--- There should be an extra ENDIF here
    $    IF mc_config
    $    THEN
    $      GOSUB mc_check_if_enabled
    $    ENDIF
    $  ENDIF
    $ENDIF                      <---- This ENDIF should be removed
    
5247.10EVMS::MORONEYvi vi vi - Editor of the BeastFri May 09 1997 16:226
I put this fix in RAVEN, so you won't get that spurious message anymore once
it hits the streets.

Thanks for locating the problem.

-Mike