[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference gyro::internet_toolss

Title:Internet Tools
Notice:Report ALL NETSCAPE Problems directly to [email protected].rnet? Read note 448.L for beginner information.
Moderator:teco.mro.dec.com::tecotoo.mro.dec.com::mayer
Created:Fri Jun 25 1993
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:4714
Total number of notes:40609

4545.0. "How many web servers within Digital?" by TENNIS::KAM (AltaVista Software 714/261-4133 DTN 535.4133) Fri Mar 14 1997 16:15

I'm running AltaVista Search Intranet PX on a HiNote VP 500 133 Mhz.  I 
pointed this at *.dec.com and came up with the following statistics.  
Does anyone know if these statistics look correct?  I thought Digital 
had over 1200 Web servers internally?

	Regards,


    AltaVista Search is currently gathering web pages.

                     Page Gathering
Cumulative Statistics:             Recent Activity:
                                   [3/14/97 00:12:49]

Pages fetched:   1947            New URLs found:     490
  Web servers:    580              URLs fetched:    0.11/sec 
                                  Bytes fetched: 1009.72/sec 

                         Index
          Pages indexed:          701 
      (duplicates eliminated)  

    
T.RTitleUserPersonal
Name
DateLines
4545.1VAXCPU::michaudJeff Michaud - ObjectBrokerFri Mar 14 1997 17:2325
> I pointed this at *.dec.com and came up with the following statistics.  
> Does anyone know if these statistics look correct?  I thought Digital 
> had over 1200 Web servers internally?
> Pages fetched:   1947            New URLs found:     490
>   Web servers:    580              URLs fetched:    0.11/sec 

	What does this tool do and what does "pointed this at *.dec.com"
	mean?  Does it mean it enumerates all the registered names in
	the dec.com domain that have address record(s)?  And I assume
	it tries to connect to the default http port (80)?

	Are you sure pointing it at *.dec.com will cause it to enumerate
	subdomains as well (ie. such as site.dec.com), or are you only
	picking up hosts registered in the top level domain specified?

	Also it's only looking for Web (ie. http) servers on the default
	port.  Not all of us (like myself) run our servers on the default
	port for various reasons (such as on UNIX systems your server
	process has to be running as root to use ports < 1024).

	Also for security reasons not all DNS servers are setup to
	allow you to use wildcards.  So in some cases you need to
	know the name (or address) of the host (think of it like
	having execute permission on a directory, but not read
	permission, on a UNIX system that is).
4545.2NYOSS1::GOODMANI see you shiver with antici.........pation!Fri Mar 14 1997 17:324
    I suppose if you tell us what your machine's name is we could all grep
    our server logs and see if you found our pages...
    
    Roy
4545.3TENNIS::KAMAltaVista Software 714/261-4133 DTN 535.4133Sat Mar 15 1997 00:2018
    re .1
    I'm running the same product that Digital does does to Index both the 
       Internet e.g., altavista.digital.com or 
       Intranet e.g, altavista.pa.dec.com
    
    re .2
    I can't provide that because it won't do you much good.  I'm at home
    running a PPP line into digital.  The system has a local LAN IP address
    of 16.62.64.191 and a DHCP IP assigned address of 16.62.80.205.  The
    system name is 'kam'.
    
    I'm using this to index different sites to demonstrate AltaVista Search
    Intranet to customers.  Before going to a customer's site that
    Interested in Search I index their sites and show them what's there.
    
    
       Regards,
    
4545.4VAXCPU::michaudJeff Michaud - ObjectBrokerSun Mar 16 1997 23:576
>     re .1
>     I'm running the same product that Digital does does to Index both the 
>        Internet e.g., altavista.digital.com or 
>        Intranet e.g, altavista.pa.dec.com

	????  I didn't ask "which product".  Reread the question(s) in .1! :-)
4545.5Lots more than thatSTAR::COPEMon Mar 17 1997 14:2518
|
|    Pages fetched:   1947            New URLs found:     490
|      Web servers:    580              URLs fetched:    0.11/sec
|                                      Bytes fetched: 1009.72/sec
|
|                             Index
|              Pages indexed:          701
|          (duplicates eliminated)
    
    The statistics are certainly incorrect. I tried an index of just 
    our site (*.zko.dec.com) over the weekend, and got 24,000+ non-
    duplicate pages indexed, compared to your 701 for "all of Digital". 
    I'd guess you have a bandwidth problem, running from home over PPP. 
    
    If you just want to know the size of the intranet, I'm sure someone
    here has a decent idea - or you could check with the guys running the 
    Palo Alto indexer to see how many they've found.

4545.6TENNIS::KAMAltaVista Software 714/261-4133 DTN 535.4133Mon Mar 17 1997 16:0712
    I don't need to know the exact figure.  I was just curious and wanted
    to ensure that I was operating correclty.
    
    I was told by the AltaVista Developer's that the settings I had were
    correct.  I guess I'll send those figures to them to see what's
    happening.
    
    I just wanted ensure that I was working so when I'm at a customer site
    and I demo this some useful and realistics is happening.
    
    	Regards,