Paper-M-108
28th May 2023
Efficiency of an IR system depends on the indexing language used
Its capability to handle 2 fundamentally different but interdependent types of relationship between the terms used to represent the subject content of the document
Thesaurus has mainly been conceived in the context of post-coordinate indexing system but it can be used for pre-coordinated system as well
The word ‘Thesaurus’ is of Greek origin, literally meaning treasury or storehouse of knowledge
But in modern usage, it denotes a list of terms arranges according to their relationship of ideas
1. Unesco Thesaurus: A Structured List of Descriptors for Indexing and Retrieving Literature in the Fields of Education, Science, Social and Human Science, Culture, Communication and Information
2. Thesaurus of ERIC Descriptors
3. Thesaurus of Sociological Research Terminology
4. Thesaurus of Sociological Indexing Terms
5. Social Science and Business Microthesaurus: A Hierarchical List of Indexing Terms Used by NTIS
6. Political Science Thesaurus
7. SPINES Thesaurus: A Controlled and Structured Vocabulary of Science and Technology for Policy Making
8. Thesaurus of Psychological Index Terms Thesaurus of Engineering and Scientific Terms (TEST)
9. INSPEC Thesaurus
10. NASA Thesaurus
11. Thesaurus of Computing Terms
12. Thesaurus of Scientific, Technical and Engineering Terms
13. International Road Research Documentation (IRRD) Thesaurus
14. ASIS Thesaurus of Information Science and Librarianship
15. Thesaurus of Information Science Terminology
16. Food: Multilingual Thesaurus
17. Thesaurus of Agricultural Terms
18. The ISDD Thesaurus. Keywords Relating to Non-Medical Use of Drugs and Drug Dependence
A thesaurus in the field of information storage and retrieval is a list of terms and/or of other signs (or symbols) indicating relationships among these elements, provided that the following criteria hold:
the list contains a significant proportion of non-preferred terms and/or of preferred terms not used as descriptors;
terminological control is intended.
Peter Mark Roget first conceived the idea of such compilation and brought out this Thesaurus of English words and Phrases in 1852 for the benefit of writers for looking for appropriate words to express their ideas
Helen Brownson is said to be the first person who used the term ‘Thesaurus’ in the context of IR in a paper presented in 1957 at the Dorking Conference in Classification Research
Hans P. Luhn was possible the first person to think in terms of ‘Information Retrieval Thesaurus’
The first Thesaurus used in an IR system was developed by Du Pont in US around 1959
| Year | Event |
|---|---|
| 1959 | Engineering Information Center of E. I. Dupont de Nemours developed the first true thesaurus |
| 1960 | Armed Services Technical Information Agency (ASTIA) produced the Thesaurus of ASTIA Descriptors |
| 1961 | American Institute of Chemical Engineers (AIChE) published the Chemical Engineering Thesaurus |
| 1964 | Engineers Joint Council (EJC) published the Thesaurus of Engineering Terms |
| 1967 | Thesaurus of Engineering and Scientific Terms (TEST) |
| 1967 | Committee on Scientific and Technical Information (COSATI) published the first set of guidelines for thesaurus construction |
| 1970 | Unesco Guidelines for the Establishment and Development for Monolingual Scientific and Technical Thesaurus |
| 1974 | American National Standards Institute (ANSI) Z39.19 - a US national standard for thesaurus construction |
| 1974 | First international standard for thesaurus construction – ISO 2788 |
To provide a map for a given field of knowledge indicating how the concepts of ideas are related to each other, which helps the indexer and searcher to understand the structure of the field of knowledge
To provide a standard vocabulary for a given subject
It provides consistent representation of the subject matter avoiding subject dispersion in output & input by controlling synonyms, quasi-synonyms, & homographs
Bringing together the term which are semantically related
To limit the no. of term that assign to be a document
To serve as search aid in retrieval
Its purpose is to promote consistency in the indexing of documents predominantly for post-coordinated ISAR
To facilitate searching by linking entry terms with descriptors
authors, indexers, and users into a controlled vocabulary used for indexing and retrieval
Consistency: To promote consistency in the assignment of index terms
Indication of Relationships: To indicate semantic relationships among terms
Retrieval: To serve as a searching aid in retrieval of documents
| LSH | Thesaurus |
|---|---|
| It is a complete list of names of subjects usually arranged in alphabetical order | It is a list of terms arranged in a helpful order. In other words, it is a compilation of all isolate ideas that occur within a subject or group of subjects arranged in alphabetical order |
| LSH | Thesaurus |
|---|---|
| LSH were designed in view the needs of pre-coordinate indexes | Thesaurus was designed to meet the special needs of the post-coordinate indexes |
| In LSH, the emphasis is on references from broader to narrower subject or downward references | Thesauri have more elaborate network of referencing, the direction of each type of reference is clearly indicated |
It is the relationship between preferred & non-preferred terms where 2 or more terms are regarded for indexing purposes as referring to the same concept
It is denoted by USE and UF
It includes synonyms, lexical variants, quasi-synonyms, & upward posting


This relationship shows level of super-ordination and sub-ordination
It is used in locating broader and narrower concepts in a logically progressive sequence
The relationship is reciprocal and is set out in a thesaurus using following conventions:
It includes generic relationship, hierarchical whole-part relationship, instance relationship, & polyhierarchical relationship
It identifies the link between a class or category and its members or species
It is also known as Inclusion Relationship
EG.: VERTEBRATA

This relationship is between a general category of things and events, expressed by a common noun, and an individual instance of that category, the instance then forming a class of one which is represented by a proper name
EG.: SEAS
NT Baltic Sea
Caspian Sea
Mediterranean Sea
It is the relationship between the term & its 2 or more super-ordinate terms
Some terms may belong to more than one hierarchy and consequently maybe related to more than one broader term & more than one set of narrower terms
EG.: Mamalia —Whale—Marine Animals
This relationship is found between terms which are closely related conceptually but not hierarchically and are not members of an equivalence set
The relation is reciprocal, and is distinguished by the abbreviation “RT” (Related Terms)
EG. TEACHING
TEACHING AIDS
Terms belonging to the same category (e.g., motorcycle / bicycle)
Terms belonging to different categories
1. Whole-part (e.g., buildings / doors)
2. A discipline and the objects studied (e.g., ethnography / primitive societies)
3. An operation or process and the agent or instrument (e.g., motor racing / racing cars)
4. An occupation and the person in that occupation (e.g., accountancy / accountants)
5. An action and the product of the action (e.g., publishing / music scores)
6. An action and its patient (e.g., data analysis / data)
7. Concepts related to their properties (e.g., women / femininity)
8. Concepts linked by causal dependence (e.g., injury / accidents)
9. A thing or action and its counter-agent (e.g., pests / pesticides)
10. A raw material and its product
11. An action and a property associated with it (e.g., precision measurement / accuracy)
12. A concept and its opposite (e.g., single people / married people)
The concept was developed by Jean Aitchison in 1969
It is a multipurpose retrieval language tool
It consists of 2 sections:
Alphabetical - in which descriptors and cross references are arranged in alphabetical order
Systematic or Classified - in which descriptors are arranged in their hierarchical order with level of hierarchy represented by indentions, dashes, dots, etc.
Graphic – in which the hierarchy is shown by a tree or an arrowgraph
Discuss the usefulness of a thesaurus in ISAR
Discuss the need & purpose of a thesaurus in indexing & searching. Discuss with suitable examples, the various types of relationship among the terms in a thesaurus
Discuss, the thesaurus as a tool in information organizing & retrieval and their relationship among the terms found in a thesaurus