Professional Documents
Culture Documents
Tefko Saracevic
School of Communication, information and Library Studies, Rutgers, The State University of New Jersey,
4 Huntington St., New Brunswick, N. J. 08903
Paul Kantor
Tantalus Inc. and Department of Operations Research, Weatherhead School of Management,
Case Western Reserve University, Cleveland, Ohio 44706
The objectives of the study were to conduct a series of systems, expert systems, management and decision infor-
observations and experiments under as real-life a situa- mation systems, reference services and so on, are instituted
tion as possible related to: (i) user context of questions to answer questions by users-this is their reason for exis-
in information retrieval; (ii) the structure and classi-
fication of questions; (iii) cognitive traits and decision tence and their basic objective, and this is (or at least should
making of searchers; and (iv) different searches of the be) the overriding feature in their design. Yet, by and large
same question. The study is presented in three parts: and with very few exceptions (see ref. 1) the basis for their
Part I presents the background ot the study and de- design is little more than assumptions based on common
scribes the models, measures, methods, procedures, sense and interpretation of anecdotal evidence. A similar
and statistical analyses used. Part II is devoted to results
related to users, questions, and effectiveness measures, situation exists with online searching of databases. While
and Part III to results related to searchers, searches, and the activity is growing annually by millions of searches it is
overlap studies. A concluding summary of all results is still a professional art based on a rather loosely stated set of
presented in Part III. principles (see ref. 2) and experience. While there is noth-
ing inherently wrong with common sense, professional art,
and principles derived from experience or by reasoning, our
introduction knowledge and understanding and with them our practice
would be on much more solid ground if they were confirmed
Problem, Motivation, Significance or refuted, elaborated, cumulated, and taught on the basis of
Users and their questions are fundamental to all kinds of scientific evidence.
information systems, and human decisions and human- Since 1980 a number of comprehensive critical literature
system interactions are by far the most important variables reviews have appeared on various topics of information
in processes dealing with searching for and retrieval of in- seeking and retrieving, among them reviews of research on
formation. These statements are true to the point of being . interactions in information systems, by Belkin and
trite. Nevertheless, it is nothing but short of amazing how Vickery [3]
relatively little knowledge and understanding in a scientific
l information needs and uses, by Dervin and Niles [4]
sense we have about these factors. Information retrieval
l psychological research in human computer interaction,
by Borgman [5]
*Work done under the NSF grant IST85-05411 and a DIALOG grant for
search time. l design of menu selection systems, by Shneiderman [6]
Present address: Kent State University, Kent, Ohio
*Present address: Dyke College, Cleveland, Ohio l online searching of databases, by Fenichel [7] and Bell-
ardo [8].
Received February 5, 1987; accepted March 19, 1987.
It is most indicative that an identical conclusion appears in
8 1988 by John Wiley & Sons, Inc. every one of these reviews despite different orientation of
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE. 39(3):161-176, 1988 CCC 0002-8231/88/030161-16$04.00
the review and different backgrounds of the reviewers. They Organization of Reporting
all conclude that research has been inadequate and that more
A comprehensive final report to NSF deposited with
research is needed. In the words of Belkin and Vickery:
1. . . research has not yet provided a satisfactory solution to ERIC and NTIS [ 191provides a detailed description of mod-
els, methods, procedures, and results; a large appendix to
the problem of interfacing between end-user and large scale
the report contains the raw data and forms and flowcharts
databases. Despite a relatively large amount of literature
for procedures used. Thus, for those interested there is a
about the subject, the research in information seeking and
detailed record of the study, particularly in respect to proce-
retrieving is in its infancy. It is still in an exploratory stage.
dures and data.
Yet, the future success or failure of the evolving next
In this journal, the study is reported in three articles or
generation of information systems (expert systems, intel-
parts. This first part is devoted to description of models,
ligent front-ends, etc.) based on built-in intelligence in
measures, variables and procedures involved and relates the
human-system interactions depends on greatly increasing
study to other works. The second part, subtitled Users and
our knowledge and understanding of what is really going on
Questions presents the results of classes or variables that
in human information seeking and retrieving. The key to the
are more closely associated with information seeking; in-
future of information systems and searching processes (and
cluded in Part II are also results related to effectiveness
by extension, of information science and artificial intel-
measures. The third part, subtitled Searchers and
ligence from where the systems and processes are emerging)
Searches is devoted to the classes of variables that are more
lies not in increased sophistication of technology, but in
closely associated with information retrieving. A summary
increased understanding of human involvement with
of conclusions from the study as a whole is also presented
information.
in Part III.
These conclusions form the motivation and rationale for
the study reported here and describe our interpretation of the
significance of research in this area in general. Objectives and Approach
As mentioned, the aim of the study was to contribute to
Background the formal, scientific characterization of the elements in-
volved in information seeking and retrieving, particularly in
The work reported here is the second phase of a larger
relation to the cognitive context and human decisions and
long-term effort whose collective aim is to contribute to the
interactions involved. The objectives were to conduct the
formal characterization of the elements involved in informa-
observations under as real-life conditions as possible related
tion seeking and retrieving, particularly in relation to the
to: (1) user context of questions in information retrieval; (2)
cognitive context and human decisions and interactions in-
the structure and classification of questions; (3) cognitive
volved in these processes. The first phase, conducted from
traits and decision-making of searchers; and (4) different
1981 to 1983 (under NSF grant IST80-15335) was devoted
searches of the same question.
to development of appropriate methodology; the study re- The following aspects of information seeking and re-
sulted in a number of articles discussing underlying con- trieving were studied as grouped in five general classes of
cepts, describing models and measures, and reporting on the entities involved:
pilot tests [9-181. These articles explain the methodological
background for the second phase. 1. User: effects of the context of questions and constraints
The second phase is reported here. It was a study con- placed on questions.
ducted from 1985 to 1987 (under grants listed under title)
2. Question: structure and classification as assigned by
devoted to making quantified observations on a number of
different judges and the effect of various classes on
variables as described below. To our knowledge this is the
retrieval.
largest and most comprehensive study in this area conducted
to date. Still and by necessity (due to the meager state of 3. Searcher: effects of cognitive traits and frequency of
knowledge and observations on the variables involved) this online experience.
is an exploratory study with all the ensuing limitations. The 4. Search: effects of different types of searches; overlap
results are really reflective of the circumstances of the ex- between searches for the same question in selection of
periments alone. While at the end generalizations are made, search terms and items retrieved; efficiehcy and effec-
they should be actually treated as hypotheses ready for veri- tiveness of searches.
fication, refutation, and/or elaboration. 5. Items retrieved: magnitude of retrieval of relevant and
The third phase (planned for 1987 to 1989) will have as nonrelevant items; effects of other variables on the
its objective to study in depth the nature, relations and chances that retrieved items were relevant.
patterns of some of the critical variables observed here. In
this way we are trying to proceed (or inch) along the classic The approach was as real-life as possible (rather than labo-
steps of scientific inquiry. ratory) in the following sense:
effects of various types of search questions and various User (information seeker) has * User Characteristics
user goals on searching, by Rouse and Rouse [41] a problem which needs to be * Problem statement
resolved
observation of end user search behavior in an oper- User seeks to resolve the * Question statement
ational setting, by Sewell and Teitelbaum [42] problem by formulating a * Question characteristics
question and starting an
conceptualization and test of the search process as
interaction with an information
an imperfect decision-making task, by Fischhoff and system
MacGregor [43]
Presearch interaction with a * Searcher characteristics
searcher i.e. a human or * Question analysis
The study reported here is closely related by type to the computer intermediary
empirical studies reviewed above and as a result it builds on Formulation of a search * Search strategy
these studies. * Search characteristics
Searching activity and * Searching
interactions
General Model (* Adjusted search)
(Possible: initial evaluation
Complex systems such as information retrieval systems of results and reiterative
searching)
(or information systems in general) can be modeled and
Delivery of responses to user * Items retrieved
studied in a number of ways. In the 1960s and 1970s the * Formats delivered
emphasis was on study and evaluation of input processes * Relevance
Evaluation of responses by
and components, such as various representations of docu- user * Utility
ments (or texts) and subsequent retrieval effectiveness. In
the 1980s a shift occurred toward study of output processes, FIG. 1. A general model of information seeking and retrieving.
Do you want to restrict the years of publication of the 2, Clarity: classifies a question on the basis of degree of
article retrieved (last 5 years; no limits; specify years: clarity in respect to (a) semantics (meaning of terms)
19- to 19-)? and (b) syntax (relation and logic between terms).
Measure: for a question the classifier indicates on a
scale from 1 to 5 the degree of clarity, where 1 means
Structure of Questions unclear and 5 means clear. Two scales are used:
(a) For semantic clarity (meaning.of terms).
As a rule, questions in information retrieval consist of (b) For syntactic clarity (relation and logic be-
three parts: a lead-in, a subject, and a query. tween terms).
The lead-in is not directly searchable. It may consist of
3. Specikity: classifies a question on the basis of degree
phrases such as: I want information about. . . . However,
of specificity of (a) query terms and (b) subject terms.
at times lead-in may have implication for searching; for
Specificity ranges from very general or broad (e.g. as
instance what is . . . implies request for definition, where found in a thesaurus under BT-Broader terms) to very
is . . . implies request for location, or when was. . . im- specific or narrow (e.g. as found in a thesaurus under
plies involvement of a time element. In such cases, lead-ins NT-Narrower terms).
are important for recognizing presuppositions in a question, Measure: for a question the classifier indicates on a
i.e. implications not directly expressed (see discussion below). scale from 1 to 5 the degree of specificity, where 1
A large number of environmental (or external) and cogni- LSI is based on a theory describing learning as an inte-
tive (or internal) factors, e.g. organizational setting, eco- grated, four stage process that involves the use of four
nomic constraints, affect searchers decisions and thus different cognitive modes as described by Kolb [52]:
retrieval effectiveness and efficiency. While recognizing the (i) Concrete Experience (CE); (ii) Reflective Observation
external factors, we have concentrated on a limited number (RO); (iii) Abstract Conceptualization (AC); (iv) Active Ex-
of cognitive traits of searchers: perimentation (AE). LSI claims to measure individual pref-
erences for each of the above four basic modes for learning
Language ability or the ability to make inductive infer- and places an individual in a composite category indicative
ences through word association, as measured by a stan- of his/her learning style. The test has been widely applied
dard test called Remote Associates Test. and tested for over ten years. The respondent is asked to
Logical ability or the ability to make deductive infer- rank order from 1 to 4 a series of four statements in response
ences as measured by a standard test called Symbolic to a question on how he/she learns. There are twelve sets to
Reasoning Test. rank within twenty minutes; here are two examples
The amount and type of experience a searcher had in 5pe 1. From a statement about the problem at hand as
online searching was obtained from a questionnaire. Since tape recorded by the user, but without recourse to
DIALOG was used for searching, the questions asked were the written question.
specific to DIALOG system:
Type2. From the tape problem statement and the written
question submitted by the user.
1. How often do you search DIALOG?: daily, twice a
week, once per week, twice per month, less. Jhe 3. From the written question using only the words in
the question as search terms without any further
2. Indicate seven DIALOG databases you search most elaboration.
often in order of decreasing use.
Type 4. From the written question plus terms from an
3. Indicate thesaurus most important to you when you appropriate thesaurus for elaboration.
search.
These four types of searches we labeled project searches.
The answer to the first question has been used as a
searchers variable. The answers to the last two questions
have been used to match the searchers with databases they Search Strategy
are to search in the study, so that their background and the
We have concentrated on three aspects of the search
databases match as closely as possible.
statement as a whole:
prepared a preliminary search strategy 2. To observe the effects of different question classes.
conducted the search and recorded the results. (The For the first objective a separate experiment was con-
whole search was recorded on a disc and a printout). ducted. It consisted of testing the consistency or degree of
agreement in assigning question structure and classification
No restrictions, such as time limits, were placed on
classes by a number of judges. On the basis of advertising
searchers in preparation for and conduct of the search.
a group of twenty-one information professionals was assem-
In addition, three searchers from the full time staff of
bled to assign the question structure and classification.
the project were engaged in searching so called project
These were a different group from the searchers and had
searches, as described below. These became known as
nothing to do with searching. They are called classification
project searchers. Thus, a total of thirty-nine searchers
judges. The experiment consisted of each judge:
participated in the study.
. receiving twenty questions for judging; these were se-
Searching lected at random from the forty questions submitted by
users
Searching was done on DIALOG, the largest vendor in . assessing the question structure on an appropriate form
the world as to the number of databases and frequency of
use. More searchers have more professional experience with . classifying each question as to domain, clarity, specific-
DIALOG than with any other system, which was the reason ity, complexity, and presuppositions.
for selecting it.
Each question was searched on one database only. The The results of classification were calculated for agree-
database was selected by the project team on the basis of ment. The classes (and questions in these classes), with
closeness of fit between the question and the subject of the significant agreement, were then used to analyze chances of
database. The searchers were assigned questions which retrieval of relevant answers, i.e. to address the second
matched as closely as possible their own database experi- objective of this part of the study.
ence. Searching was done on microcomputers with prepro- Prior to conducting the experiment with twenty-one
grammed log-on and downloading protocols. Appropriate judges, a pilot experiment was carried out with two other
thesauri and manuals were assembled and made available to judges to observe if the procedures worked. They did. In the
all searchers. All searching was done in the same room and Final Report [19] we report results only on the two judges
environment, and under the same conditions. in the pilot study and not on the twenty-one judges in the
experiment proper (the report was written before this part of
the study was completed). Here we are reporting the results
Project Searches on twenty-one judges only, i.e., we are disregarding the
As mentioned, each question was searched by five out- pilot study.
side searchers. In addition, there were four searches done
in-house by the project staff. These additional searches were Evaluation by Users
labeled project searches, and the staff searchers as Here is a summary of steps involved in evaluation:
project searchers. Thus, nine searches were done for each
question: five by outside searchers and four by project . the end result of each search was a list of accession
searchers. numbers of items retrieved