[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference vaxuum::online_bookbuilding

Title:Online Bookbuilding
Notice:This conference is write-locked: see note 1.3.
Moderator:VAXUUM::UTT
Created:Fri Aug 12 1988
Last Modified:Mon Jul 15 1991
Last Successful Update:Fri Jun 06 1997
Number of topics:440
Total number of notes:2134

84.0. "Navigation wishlist" by CURIE::HARTSHORN () Tue Feb 14 1989 13:32

    It might be nice to have an INDEX search tool.  Something that would
    return a list of index entries that contained a specified string.
     Like; SEARCH "file" would return all index entries that contain
    the word "file".
    
    It also would be nice to have some sort of navigation tool that
    would automatically bring you to a specific index entry letter level.
    Like; GOTO T would bring you to the T's in the index without having
    to scroll through A thru S.
    
    Ken
    
T.RTitleUserPersonal
Name
DateLines
84.1Why not a tool that searches the book, not its index?\REORG::SEARLEWed Feb 15 1989 08:064
    Forget searching the index; even better in my mind would be a tool that
    searched the document itself.  If that didn't eliminate the need
    for an index, it sure would lessen the heavy dependence on online
    indexes.
84.2CURIE::HARTSHORNWed Feb 15 1989 09:057
    A tool that could search the book would be a nice tool in its own
    right.  There would be many cases though where searching the book
    would be a bit awkward, for example, if I did a SEARCH BOOK "FILE"
    the tool might return 4 zillion instances where the word file was
    used.
    
    
84.3Let the user decideCRAYON::GENTThere is no poetry without madness -- DemocritusWed Feb 15 1989 09:1019
    Perhaps a generic SEARCH would solve both problems: let the user
    define the scope of the search. For example, possible subsets that
    could be searched are:
    
	Everything
    
    	All of the text

    	The Table of Contents
    
    	Individual Chapters
    
    	Individual topics/sections
    
    	The glossary
    
    	The index
    
    --Andrew
84.4MYVAX::ANDERSONDave A.Tue Mar 14 1989 01:1046
    A search-the-text function could be a very powerful tool, but I
    suspect that it needs at least the following features:
    
    1) the ability to find only those places where at least `m' of a list
       of `n' terms or phrases occur `sufficently near' to each other; without
       this it seems likely that the signal-to-noise ratio will frequently be
       too low for the search to be useful. 
    
    2) the ability to efficiently retry a search with modified criteria,
       typically for use when a search finds too many places (e.g.,
       requiring that more of the items on the list be found in one
       place, or adding additional items to the list).  This implies
       that the search function must announce how many places it has found
       rather than just starting to display them.  It would also seem
       desirable to retain the original (or all intermediate?) search 
       results, so that one can efficiently return to them and/or try
       additional variations.

    3) an automatic `thesaurus' function, so that instead of searching
       only for the exact terms that the user specified the search will
       find all `sufficiently closely related' terms (this relationship
       may need to be decided independently for each book).  This should
       include terms which don't even appear in the book, if they are ones
       that users are likely to try to search for; it should certainly
       include spelling variations.  Without this it may be excessively 
       difficult to determine the `correct' search specification to
       find any given bit of information.

    4) in order to do this sort of search efficiently enough to be useful
       for real-time navigation thru a book, it is probably necessary
       to maintain an index for each possible term to all of the sections
       of the book in which it occurs (`section' here meaning the minimum
       size chunk of the book that the search function will resolve --
       which is most likely also used to define `sufficiently near'
       in #1).  This probably requires setting up standard lists of
       `uninteresting' words (a, an, the, ...) and of related words
       (disk, disc, winchester, floppy, ...) which can be modified by
       directives in the source of each book, and manually setting up
       those directives (or modifying the standard lists) based on the
       actual usage in each book.  It may also be necessary to flag
       particular sections of a book as specifically matching or not 
       matching a given term when a literal check of the text of that
       section would give a misleading result.

    All of which looks like a real pain for the implementor and the
    writer to deal with, but the results could be well worth it.    
84.5MYVAX::ANDERSONDave A.Tue Mar 14 1989 01:145
    Addendum to .4:
    
    A really superb index should make all of this unnecessary -- but
    how likely is it that every book will have an index that anticipates
    _all_ of the ways users will think of to look for information?