Fact-checked by Grok 2 weeks ago

Thesaurus

A thesaurus is a that organizes words into groups based on their meanings, typically listing synonyms, antonyms, and related terms to aid writers, speakers, and researchers in selecting precise and exploring linguistic connections. Unlike a , which is arranged alphabetically and focuses on definitions, a thesaurus employs an onomasiological approach, starting from concepts to find associated words, thereby functioning as a for semantic and stylistic variation. The word "thesaurus" originates from the Latin thesaurus, borrowed from the ancient Greek thēsauros (θησαυρός), meaning "treasure," "treasury," or "storehouse," reflecting its role as a repository of linguistic riches. Early precursors to modern thesauri appeared in ancient texts, such as Greek and Roman compilations of synonyms, but the contemporary form emerged in the 19th century with Peter Mark Roget (1779–1869), a British physician, natural theologian, and polymath who developed his systematic classification of English words to address what he saw as the limitations of alphabetical dictionaries in capturing relational meanings. Roget's Thesaurus of English Words and Phrases, Classified and Arranged So As to Facilitate the Expression of Ideas and Assist in Literary Composition, first published in 1852, organized entries into hierarchical categories of ideas rather than strict synonym lists, influencing countless subsequent editions and adaptations that remain in print today. Thesauri have evolved into diverse types, including general-purpose volumes for everyday writing, such as updated Roget editions maintained by Roget's descendants or competitors like the ; specialized thesauri for fields like or ; and digital resources like , a computational developed at in the 1980s that structures words in semantic networks for and applications. Historical thesauri, such as the , extend this tradition by mapping word usage across centuries to trace semantic shifts and cultural changes, underscoring the thesaurus's enduring value in linguistic scholarship and education.

Origins and Development

Etymology

The word thesaurus originates from the term thēsauros (θησαυρός), which denotes a "," "," or "storehouse." This root word, possibly derived from the Proto-Indo-European *dʰeh₁- ("to put" or "place"), carried connotations of a secure repository for valuables, both literal and figurative. In classical , including works by , thēsauros extended metaphorically to signify a storehouse of or intellectual riches, emphasizing the accumulation and preservation of wisdom. Adopted into Latin as thesaurus, the term retained its primary sense of a or collection throughout and the medieval period, often applied to compilations of or resources. By the , it began appearing in titles of scholarly works, such as Mario Nizzoli's 1535 Thesaurus Ciceronianus, a cataloging words from Cicero's writings, marking an early association with linguistic collections. The contemporary meaning of thesaurus as a reference book of synonyms and related terms emerged in the , largely through physician and philologist Peter Mark Roget's 1852 publication, Thesaurus of English Words and Phrases, Classified and Arranged so as to Facilitate the Expression of Ideas and Assist in Literary Composition. Roget's work transformed the term from a general repository into a structured "" of , influencing its standardized modern usage.

Historical Evolution

The concept of a thesaurus traces its origins to ancient systems and compilations that organized and thematically. Aristotle's Categories, composed around 350 BCE, provided an early by enumerating ten fundamental categories of predication—such as substance, quantity, and relation—offering a systematic approach to linguistic and conceptual organization that influenced subsequent semantic tools. In the Roman period, Nonius Marcellus's De Compendiosa Doctrina, written in the early , assembled excerpts from over 200 Latin authors under topical headings covering , , and daily life, serving as a precursor to topical thesauri through its structured aggregation of related terms and phrases. Medieval and Renaissance scholars advanced these ideas by emphasizing vernacular expression and lexical precision. Dante Alighieri's De Vulgari Eloquentia, composed between 1303 and 1305, advocated for the use of Italian vernacular in while analyzing word selection for poetic effect, including implicit discussions of synonyms to achieve stylistic elevation, marking an early step toward organized synonymy in European linguistics. By the early , English developments included John Harris's Lexicon Technicum: Or, An Universal English Dictionary of Arts and Sciences (), which explained technical terms alongside related concepts and synonyms, bridging encyclopedic reference with lexical grouping in a way that prefigured dedicated synonym works. The modern thesaurus emerged in the 19th century with Peter Mark Roget's Thesaurus of English Words and Phrases, Classified and Arranged so as to Facilitate the Expression of Ideas and Assist in Literary Composition (1852), which introduced a system dividing words into six primary classes—abstract relations, space, matter, intellect, volition, and affection—drawing on taxonomies and Aristotelian principles to group synonyms and antonyms thematically rather than alphabetically. This innovation shifted the focus from mere lists to conceptual networks, profoundly impacting linguistic resources. In the , thesauri underwent significant expansions and standardization. The 1911 edition of Roget's work, edited by C. O. Sylvester Mawson, reorganized the entries for greater accessibility while preserving the original classification, adding contemporary terms and refining cross-references to enhance usability. Concurrently, international efforts culminated in UNESCO's Guidelines for the Establishment and Development of Monolingual Thesauri for Information Retrieval (1971), which provided standards for constructing controlled vocabularies in indexing systems, emphasizing hierarchical relationships, synonym control, and scope notes to support in libraries and databases.

Structural Organization

Alphabetical Formats

Alphabetical formats in thesauri organize entries by headwords arranged in standard dictionary-like sequence, where each primary term serves as the followed by grouped lists of synonyms, antonyms, related terms, and sometimes idiomatic expressions. This structure facilitates direct access to lexical variants without requiring through conceptual categories, making it a linear, word-centric approach to synonym discovery. The primary advantage of this organization lies in its familiarity and efficiency for users accustomed to dictionary navigation, enabling rapid lookup of specific words and their alternatives through simple alphabetical scanning. For instance, the 1936 edition of of the English Language in Dictionary Form, published by in the United States, exemplifies this format by presenting synonym clusters and antonym notes in strict , marking an early American adaptation for quick reference. Similarly, Merriam-Webster's Collegiate Thesaurus, first issued in 1976 and revised in subsequent editions, employs this method to include synonyms, antonyms, and related words alongside brief definitions to clarify shared meanings. However, alphabetical formats have limitations in supporting intuitive exploration of semantic relationships, as terms are isolated by rather than meaning, potentially hindering users seeking broader conceptual connections. This reflects a historical shift in thesaurus editions after , where publishers increasingly prioritized alphabetical arrangements over classified systems to enhance , as seen in the transition from Roget's original 1852 conceptual model to dictionary-form versions by . Typical entry structures in these formats begin with a bolded headword, followed by bulleted or numbered lists of synonyms categorized by nuance (e.g., formal vs. informal), antonyms in a separate , and usage providing contextual examples or warnings about connotations. Cross-references, such as "see also" pointers to related headwords, further link entries, allowing limited while maintaining the alphabetical backbone. In contrast to conceptual formats that emphasize thematic grouping, this design prioritizes precision in word substitution over idea exploration.

Conceptual and Thematic Formats

Conceptual and thematic formats in thesauri organize vocabulary around abstract ideas, semantic categories, and relational networks, prioritizing conceptual interconnections over alphabetical sequencing. This approach structures entries into broad classes or themes that group synonyms, related terms, and antonyms under overarching concepts, facilitating a deeper exploration of meaning beyond isolated words. For instance, Peter Mark Roget's original 1852 Thesaurus of English Words and Phrases classified approximately 1,000 conceptual categories into six primary divisions, such as "Abstract Relations" encompassing subcategories like "Existence" and "Relation," to reflect the universe's semantic architecture. Central to these formats are hierarchical and associative elements that map relationships between terms. Broader terms () represent superordinate concepts, while narrower terms () denote specific subtypes or instances, forming a tree-like structure where, for example, "animal" serves as a for "," which in turn is a for "." Associative links connect non-hierarchical but semantically related terms, such as "" to "antonym," enabling cross-references across themes. Polyhierarchies allow multifaceted concepts to have multiple , accommodating like "apple" linking to both "fruit" (in ) and "" (in ), which enhances flexibility in knowledge representation. These relations align with standards like ISO 25964, which defines hierarchical (/) and associative () links to ensure in indexing systems. A prominent example of this format in practice is the Art & Architecture Thesaurus (AAT), developed by the Getty Research Institute starting in the and refined through the . The AAT employs a system integrated with hierarchies, dividing content into eight facets—such as "Associated Concepts" for abstract ideas like "style" and "Physical Attributes" for materials—allowing users to navigate from broad themes to precise terms like "" under multiple relational paths. This structure supports indexing in art historical databases by emphasizing thematic depth over linear word lists. The benefits of conceptual and thematic formats lie in their support for exploratory knowledge discovery, as users can traverse semantic networks to uncover interconnections that alphabetical arrangements might obscure, thereby reducing biases toward common or literal word usages. This organization promotes conceptual exploration in fields like , where thematic clustering aids in disambiguating polysemous terms and expanding queries. Evolving from Roget's class-based system, modern thematic thesauri draw inspiration from ontologies, incorporating formal semantics and machine-readable relations as seen in SKOS extensions, to bridge traditional with computational graphs.

Handling Contrasting Senses and Synonyms

In thesauri, synonyms are managed through equivalence relations that link preferred terms to non-preferred terms, ensuring consistent indexing and retrieval. The preferred term serves as the primary descriptor, while non-preferred terms, including synonyms, are directed to it via "USE" references (from the non-preferred to the preferred) and "USED FOR" (UF) entries (from the preferred to the non-preferred). To group synonyms by nuance, thesauri employ scope notes or explanatory text to delineate contextual differences; for instance, under the preferred term "happy," synonyms like "joyful" may be grouped for emotional intensity, while "content" is distinguished for a state of satisfaction, preventing conflation in retrieval applications. This approach aligns with international standards that emphasize clarity in semantic mapping. Contrasting senses, particularly in polysemous or homonymous words, are handled by establishing separate concept entries to avoid ambiguity. For homonyms like "bank," one sense (financial institution) is assigned a distinct entry with its own relations, while another (river edge) receives a separate entry, often using codes, subentries, or compound pre-coordinated terms (e.g., "river bank") to disambiguate. Scope notes further clarify each sense's domain, such as specifying "bank (finance)" versus "bank (geography)," ensuring users select appropriate terms without cross-contamination in searches. This separation is a core recommendation in thesaurus construction to maintain monosemy where possible. Antonyms and related terms are typically incorporated via associative relations, denoted as "related terms" (RT), to highlight contrasts or gradations without implying . For example, "hot" may list "cold" as an antonym under RT, with near-synonyms like "warm" shown in gradations to indicate partial opposition or similarity. While not all thesauri mandate antonyms, they are explicitly listed when relevant to conceptual contrast, aiding in broader semantic navigation. Hyponyms (narrower terms, NT) and hypernyms (broader terms, BT) extend this by embedding words within hierarchies, such as "hot" as a hyponym of "," providing relational depth. Additional elements enhance precision, including usage labels for terms like "formal," "archaic," or "slang" to guide appropriate application, and notes for idioms (e.g., treating "kick the bucket" as a non-preferred term under "die" with a dedicated scope note). These features, along with hyponyms and hypernyms, follow standardized protocols for consistency across entries. Such methods integrate seamlessly into both alphabetical and conceptual thesaurus formats, supporting varied user needs.

Types and Variations

Monolingual Thesauri

Monolingual thesauri serve as specialized lexical tools confined to a single , primarily aiding in the identification of , antonyms, and related terms to enrich and support precise expression. Unlike broader dictionaries, they emphasize semantic relationships over or pronunciation, functioning as controlled vocabularies that map conceptual networks within the language's idiomatic framework. Prominent examples include in English, originally compiled in 1852 to group words by ideas for writers seeking expressive alternatives, and the Trésor de la Langue Française informatisé (TLFi), a comprehensive French resource from the 19th and 20th centuries that facilitates exploration through its detailed historical entries on over 100,000 terms. In specialized domains, the (MeSH) exemplifies a monolingual thesaurus tailored to English biomedical , indexing millions of articles with hierarchical descriptors to ensure consistent . Design principles of monolingual thesauri prioritize semantic coherence, establishing explicit relationships such as equivalence (synonyms), (broader/narrower terms), and (related concepts) to reflect the language's unique structures, including idioms and cultural nuances that shape meaning. For instance, Roget-style thesauri organize entries thematically to capture contextual subtleties, avoiding rigid alphabetical listings in favor of conceptual clusters that align with native speakers' intuitive associations. Domain-specific adaptations, like MeSH's tree structures with over 30,000 descriptors updated annually to incorporate evolving , ensure relevance without redundancy by selecting terms based on common English usage in . These features enable thesauri to address language-specific variations, such as phrasal idioms in English or culturally embedded expressions in , while maintaining a standardized yet flexible framework. In usage contexts, monolingual thesauri function as essential writing aids, helping authors diversify phrasing and avoid repetition, as seen in Roget's enduring role since its . They support by enhancing vocabulary acquisition and , particularly for learners navigating a language's nuances through synonym mapping. In , they serve as foundational references for compiling dictionaries, providing relational data that informs entry organization and sense disambiguation. The shift to digital formats has amplified their utility, with searchable online versions like the TLFi offering advanced query functions for rapid term exploration and integration into writing software. Similarly, MeSH's browser enables precise retrieval in academic and professional settings, evolving from to dynamic tools with updates. Key challenges in monolingual thesaural development include balancing with the inclusion of and regional variations, which can fragment semantic unity if overemphasized. For example, English thesauri like Roget's often prioritize formal or general usage, sidelining dialectal terms from regions like the American South or dialects to preserve core relational integrity, though this risks excluding dynamic, culturally vital expressions. In resources such as the TLFi, historical aids stability but complicates incorporating contemporary without cross-referencing evolving usages. Domain-specific thesauri like mitigate this by restricting scope to standardized scientific English, yet still face updates for emerging in global biomedical discourse. Overall, these issues demand ongoing curation to reflect a language's living diversity without undermining retrieval efficacy.

Bilingual and Multilingual Thesauri

Bilingual and multilingual thesauri extend the principles of term organization beyond a single by establishing mappings between synonyms, near-synonyms, and broader across linguistic boundaries, enabling the identification of equivalents and semantic correspondences. These structures typically align entries through relations, where a represented by a preferred in one , such as "" in English, is linked to corresponding terms like "democracia" in or "démocratie" in , while preserving hierarchical and associative relationships from monolingual bases. This cross-lingual alignment facilitates consistent representation of ideas in diverse linguistic contexts, often treating as language-independent nodes with multiple lexical realizations. Construction of these thesauri commonly involves leveraging parallel corpora—aligned texts in multiple languages—to extract and validate term pairs, followed by the formation of equivalence classes that group translation variants under a shared . For instance, statistical alignment techniques process bilingual texts to identify co-occurring terms, refining them into classes that account for variations in usage or . Handling non-equivalent terms, particularly culture-specific ones without direct counterparts, requires strategies such as borrowing the original term (e.g., the Danish "," denoting a sense of cozy , is often retained as a in English thesauri rather than translated) or providing descriptive approximations to approximate the . These methods ensure robustness but demand expert validation to avoid misalignment due to idiomatic or contextual differences. Notable examples include Eurodicautom, the European Commission's pioneering multilingual terminology database launched in 1975, which covered up to 12 official languages and supported translation by linking domain-specific terms across languages until its succession by IATE in 2004. In modern contexts, tools like OmegaT, an open-source application, incorporate bilingual glossaries and translation memories that function as dynamic thesauri, allowing users to manage and query term equivalents during localization workflows. These resources highlight the evolution from static databases to interactive systems for practical multilingual term handling. Applications of bilingual and multilingual thesauri span systems, where they provide lexical resources to resolve ambiguities and improve output fidelity by supplying aligned equivalents, and international indexing in global databases, enabling unified subject access across languages in institutions like the EU's portals. However, challenges persist, including the potential loss of idiomatic nuances during mapping, as cultural embeddings in phrases may not transfer seamlessly, leading to reduced precision in cross-lingual retrieval or .

Contemporary Applications

Role in Information Science

In information science, thesauri function as controlled vocabularies that standardize terminology for indexing and retrieving information in libraries and databases, ensuring consistency and precision in . A foundational example is the (LCSH), which originated in 1898 when the adopted the American Library Association's List of Subject Headings for Use in Dictionary Catalogs to support cataloging in its dictionary-based system. First published between 1910 and 1914, LCSH has since become a globally adopted thesaurus for subject access in library catalogs, facilitating the assignment of authorized terms to resources and improving retrieval accuracy. This historical role underscores thesauri's evolution as essential tools for managing large-scale information collections, from print-era catalogs to modern digital environments. Key functions of thesauri in information science include term normalization, which enforces the use of preferred descriptors over synonyms or variants to maintain uniformity; disambiguation, achieved through hierarchical and associative relationships that clarify multiple meanings of terms; and enabling faceted search in digital libraries by allowing users to navigate results via multifaceted categories like broader/narrower terms. These capabilities address challenges in vocabulary control, reducing retrieval noise and enhancing user access to relevant content. The ANSI/NISO Z39.19-2005 (R2010) standard establishes guidelines for constructing, formatting, and managing monolingual controlled vocabularies, including thesauri, to support interoperability and effective indexing in knowledge organization systems. Thesauri find practical applications in metadata tagging for archival collections and semantic interoperability across heterogeneous data systems, where standardized terms bridge disparate sources. For example, the Getty Art & Architecture Thesaurus (AAT), developed by the Getty Research Institute, provides hierarchical terminology for describing , , and cultural objects, aiding catalogers in tagging and archival records consistently. This enables cross-institutional and discovery in research, as seen in its integration with databases for precise resource description. Over time, thesauri have transitioned from static print indexes to dynamic digital ontologies, incorporating principles post-2010 to support applications. This evolution aligns with standards like ISO 25964, with revisions underway as of 2025, allowing thesauri to express complex relationships as RDF triples for machine-readable interoperability, thus addressing limitations in traditional indexing by enabling automated linking and enhanced data reuse across domains.

Integration with Natural Language Processing

Thesauri play a pivotal role in (NLP) by providing structured lexical knowledge that enhances computational understanding of language semantics. A seminal example is , a large lexical database developed starting in the 1980s at , which organizes English words into synsets—sets of cognitive synonyms—linked by semantic relations such as hypernymy and meronymy. This structure has influenced modern NLP models, including , where integrations combine WordNet's explicit relations with BERT's contextual embeddings to improve tasks like by supplementing neural representations with relational knowledge. In practical applications, thesauri facilitate synonym expansion in search engines, where queries are augmented with related terms to broaden retrieval and improve ; for instance, Google's search incorporates rewriting to handle lexical variations. They also underpin measures, enabling the computation of term relatedness through path-based metrics in hierarchical structures, such as the Wu-Palmer method, which assesses similarity based on the depth of shared subsumers in a thesaurus . is often applied to representations derived from thesauri to quantify this relatedness efficiently. Additionally, in question-answering systems, thesauri enrich query processing by mapping user questions to synonymous or related concepts, thereby expanding answer candidates and boosting precision in retrieval. Google's Knowledge Graph exemplifies thesauri integration on a large scale, leveraging semantic relations akin to those in thesauri to connect entities and infer contextual links, which powers enhanced search results with structured knowledge. Advancements in the 2020s have incorporated lexical resources into large language models (LLMs) to aid word sense disambiguation. Bilingual thesauri further support multilingual NLP by enabling cross-lingual synonym mapping in disambiguation tasks. Despite these benefits, challenges persist in scalability for environments, where manual thesaurus maintenance struggles against the volume of textual corpora, and in handling dynamic changes, such as emerging or domain shifts that render static relations obsolete without automated updates.

References

  1. [1]
    Thesauri (Chapter 3) - The Cambridge Handbook of the Dictionary
    Oct 19, 2024 · A thesaurus is a book or other resource which groups words according to their meanings. It is an onomasiological dictionary: that is, a person ...
  2. [2]
    Thesauri - an overview | ScienceDirect Topics
    A thesaurus is defined as a resource that provides a list of words grouped together according to similarity of meaning, aiding in the selection of appropriate ...
  3. [3]
    THESAURUS Definition & Meaning - Merriam-Webster
    Oct 8, 2025 · Word History ; Etymology. New Latin, from Latin, treasure, collection, from Greek thēsauros ; First Known Use. circa 1823, in the meaning defined ...
  4. [4]
    Peter Mark Roget: physician, scientist, systematist; his thesaurus ...
    Peter Mark Roget (1779-1869) is best known for his Thesaurus, a project completed late in his long life. He trained as a physician, practiced medicine, and was ...
  5. [5]
    A History of Roget's Thesaurus: Origins, Development, and Design
    Nov 27, 2003 · This book gives the first history of its genesis and publication, and investigates the principles of its structural design.Missing: scholarly | Show results with:scholarly
  6. [6]
    (PDF) What's in a Thesaurus - ResearchGate
    We first describe four varieties of thesaurus: (1) Roget-style, produced to help people find synonyms when they are writing; (2) WordNet and EuroWordNet.
  7. [7]
    Historical Thesaurus - Start page - Oxford English Dictionary
    The Historical Thesaurus groups senses and words into categories, and orders them by date of first use. It functions as a taxonomic index of language history.
  8. [8]
    Thesaurus - Etymology, Origin & Meaning
    From Latin thesaurus, meaning "treasury, storehouse," originating from Greek thēsauros "treasure, chest," possibly from PIE root *dhe- "to put," or a ...
  9. [9]
    G2344 - thēsauros - Strong's Greek Lexicon (kjv) - Blue Letter Bible
    θησαυρός · a casket, coffer, or other receptacle, in which valuables are kept · a treasury · storehouse, repository, magazine.
  10. [10]
    The Cult of Cicero: Have Latinists Been Brainwashed? – Antigone
    Feb 3, 2022 · ... Thesaurus Ciceronianus by Mario Nizzòli (Nizolius) in 1535. This work was a complete dictionary of words used by Cicero. It allowed ...
  11. [11]
    A History of the Thesaurus - Book Riot
    Jan 18, 2022 · From the Greek thēsauros, meaning “treasury” or “storehouse.” Those in the Middle Ages used the word “Thesaurer” to refer to a treasurer. The ...
  12. [12]
  13. [13]
    Aristotle's Categories - Stanford Encyclopedia of Philosophy
    Sep 7, 2007 · In the Predicamenta, Aristotle discusses in detail the categories of substance (2a12–4b19), quantity (4b20–6a36), relatives (6a37–8b24), and ...
  14. [14]
    Nonius Marcellus, early 4th cent. CE? | Oxford Classical Dictionary
    Nonius Marcellus (early 4th cent. ce?), author of an encyclopaedic dictionary in twenty books (De compendiosa doctrina, ed. W. M. Lindsay, 3 vols. (1903); book ...Missing: thesaurus | Show results with:thesaurus
  15. [15]
    Introduction - Dante's Works - CUNY
    Dante wrote the De vulgari eloquentia (On the eloquence of the vernacular) sometime during the years 1304-1307, more than ten years after the Vita nuova.
  16. [16]
    John Harris Issues the First English Encyclopedia Arranged in ...
    Harris was the first to make the distinction between “word-books” (dictionaries) and “subject-books (encyclopedias).
  17. [17]
    [PDF] A history of Roget's thesaurus: Origins, development, and design
    Roget's Thesaurus is a unique topical dictionary of synonyms, integrating two types of dictionaries, and is a linguistics-based achievement.
  18. [18]
    Roget's Thesaurus by Peter Mark Roget - Project Gutenberg
    The text notes the first edition's derivation from a version published in 1911 while highlighting recent supplemental updates to include contemporary terms.
  19. [19]
    [PDF] Guidelines for Multilingual Thesauri - IFLA Repository
    Dec 12, 2008 · The WG initiated a project to draft new Guidelines for Multilingual Thesauri, to replace the 1976 UNESCO Guidelines for the Establishment.
  20. [20]
    Vocabulary Control
    Mar 17, 2017 · In an alphabetical thesaurus, the descriptors followed by their relationships are listed in alphabetical sequences. In a classified thesaurus, ...
  21. [21]
    The evolution of thesauri and the history of knowledge organization
    Abstract. Thesauri are considered as an optimum between maximum ontological modelling (best knowledge mapping) and minimal alphabetic ordering (less expensive ...
  22. [22]
    Publication History of Roget's Thesaurus
    The first edition was published in 1852. The first American attempt was in 1854. The 1st International Edition was in 1886. The 6th International Edition was ...
  23. [23]
    Merriam Webster's Collegiate Thesaurus - Amazon.com
    Alphabetical lists include more than 340,000 synonyms, antonyms, related and contrasted words, and idioms. Brief definitions describe the meanings shared by ...
  24. [24]
    Merriam-Webster's collegiate thesaurus. -- University of Wisconsin
    An alphabetically-arranged reference lists synonyms, related and contrasted words, idiomatic equivalents, and antonyms, and provides a concise definition ...
  25. [25]
    The Remarkable Roget's Thesaurus | Merriam-Webster
    More than just a collection of related words—Peter Mark Roget intended his Thesaurus to be a classification of all knowledge.
  26. [26]
    [PDF] Unlocking the Semantics of Roget's Thesaurus Using Formal ...
    Roget's Thesaurus is a semantic dictionary that is organized by con- cepts rather than words. It has an elaborate implicit structure that has not, in the 150 ...
  27. [27]
    SKOS Simple Knowledge Organization System Primer - W3C
    Aug 18, 2009 · ... thesauri [ ISO2788 ], SKOS supplies three standard properties: skos:broader and skos:narrower enable the representation of hierarchical links ...
  28. [28]
    Frequently Asked Questions about the USGS Thesaurus
    Hierarchy: A term always has an "is a" relationship with its broader term (BT); a narrower term (NT) can always be said to be "a type of", "a part of", or "an ...
  29. [29]
    [PDF] Art & Architecture Thesaurus ® - Getty Museum
    ... Facets are the upper levels of the AAT structure. • AAT is not organized by subject matter or discipline. AAT, the Art & Architecture Thesaurus®. The AAT is a ...
  30. [30]
    Art & Architecture Thesaurus (Getty Research Institute)
    The Art & Architecture Thesaurus© (AAT) is a structured vocabulary for describing and indexing the visual arts and architecture.
  31. [31]
    [PDF] Controlled Vocabularies - Getty Museum
    The hierarchical relationship is the primary feature that distinguishes a thesaurus or taxonomy from simple controlled lists and synonym rings. Hierarchical ...
  32. [32]
    A Dialectic Perspective on the Evolution of Thesauri and Ontologies
    Aug 30, 2025 · The purpose of this article is to identify the most important factors and features in the evolution of thesauri and ontologies through a ...
  33. [33]
    ISO 25964-1:2011 - Information and documentation — Thesauri and ...
    ISO 25964-1:2011 gives recommendations for the development and maintenance of thesauri intended for information retrieval applications.Missing: antonyms polysemy
  34. [34]
    Thesaurus (for information retrieval)
    The prime function of a thesaurus is to support information retrieval by guiding the choice of terms for indexing and searching. According to ISO 25964-1 ( ...
  35. [35]
    DISCOVERING SEMANTIC REGULARITY IN LEXICAL RESOURCES
    Abstract. We first describe four varieties of thesaurus: (1) Roget-style thesauruses, produced to help people find synonyms and antonyms when they are writ.
  36. [36]
    [PDF] Le Trésor de la langue française informatisé (TLFi) - ACL Anthology
    It is a dictionary of the 19th and 20th century vocabulary, in 16 volumes. The first volume was published in 1971 and the last one in 1993.
  37. [37]
    Introduction to MeSH
    ### Summary of MeSH as a Monolingual Thesaurus
  38. [38]
    [PDF] Thesauri in the Digital Ecosystem - JLIS.it
    Jan 15, 2022 · ABSTRACT. In recent years, thesauri have taken on new roles, new functions, and have shown some advantages over other knowledge.
  39. [39]
    [PDF] An industry perspective: dealing with language variation in Collins ...
    Sep 24, 2020 · Collins publishes a very wide range of monolingual and bilingual diction- aries, as well as language-learning courses and phrase books.
  40. [40]
    [PDF] Multilingual Equivalency: - Getty Museum
    Loan terms. • If there is no equivalent in the target language, one option = loan term. • loan term is a foreign word or phrase that is routinely used instead ...Missing: bilingual | Show results with:bilingual
  41. [41]
    [PDF] Using statistical methods to create a bilingual ... - UT Student Theses
    8.3.1 Equivalence classes. Defining equivalence classes of words using Morphological analysis may help the system in two ways. The number of entries of both ...
  42. [42]
    [PDF] Equivalence and Translation Strategies in Multilingual Thesaurus ...
    The fifth case is non-equivalence, when target language does not contain a term ... lot of the terminology is very specific and there may not be a precise ...
  43. [43]
    [PDF] The IATE Project - Towards a Single Terminology Database
    Nov 30, 2001 · So far the following databases have been imported into the EU term base: Eurodicautom (Commission), TIS (Council),. Euterpe (EP), Euroterms ...<|control11|><|separator|>
  44. [44]
    OmegaT - The Free Translation Memory Tool - OmegaT
    OmegaT is a translation memory application that works on Windows, macOS, Linux. It is a tool intended for professional translators.Download · OmegaT · Documentation · Resources
  45. [45]
    [PDF] Multilingual Thesauri and Ontologies in Cross-Language Retrieval
    Abstract. This paper sets forth a framework for the use of thesauri and ontoiogies as knowledge bases in cross-language retrieval. It.
  46. [46]
    Library of Congress Subject Headings (LCSH)
    Sep 19, 2023 · A subject heading system is an alphabetical list of terms used for indexing documents. It is different from a → thesaurus, which is also an ...Introduction · A brief history and description... · Criticisms of LCSH · Conclusion
  47. [47]
    The Role of Thesaurus Designers in Information Retrieval
    Apr 16, 2024 · The foundation of thesaurus design begins with thoroughly understanding the subject domain. Designers immerse themselves in the field's ...<|separator|>
  48. [48]
    Controlled Vocabularies, Taxonomy and Faceted Search - Claravine
    Faceted search is when the search tool is configured to present results based on facets and categories presented in the taxonomy. When a user types a term that ...<|control11|><|separator|>
  49. [49]
  50. [50]
    (PDF) Thesauri and Semantic Web: Discussion of the Evolution of ...
    Aug 6, 2025 · The evolution of thesauri toward their integration with the Semantic Web is examined. Elements and structures in the thesaurus standard, ISO ...
  51. [51]
    WordNet: A Lexical Database for English - ACL Anthology
    1992. WordNet: A Lexical Database for English. In Speech and Natural Language: Proceedings of a Workshop Held at Harriman, New York, February 23-26, ...
  52. [52]
    Integrating WordNet and BERT for Lexical Semantics in Natural ...
    Dec 31, 2021 · We propose an integration of BERT and WordNet to supplement BERT with explicit semantic knowledge for natural language understanding (NLU).Missing: influence | Show results with:influence
  53. [53]
    How a Search Engine Might Use Synonyms to Rewrite Search Queries
    Dec 29, 2008 · Google uses a few different methods to rewrite search queries, using things like stemming, synonyms and statistical machine translation.
  54. [54]
    Calculating the semantic distance between two documents using a ...
    Jul 5, 2023 · The Wu-Palmer measure, introduced in 1994, is a semantic similarity measure based on the depth of nodes in a hierarchical thesaurus, generally ...
  55. [55]
    [PDF] Extending Thesauri Using Word Embeddings and the Intersection ...
    Word embeddings provide an easy access to word relatedness via the cosine similarity measure. Kiela et al. proposed that during the training phase,. Page 3 ...
  56. [56]
    Enriching a thesaurus as a better question-answering tool and ...
    Mar 5, 2025 · The findings imply that thesauri enriched with semantic relations are useful in question-answering and modern information retrieval, although ...
  57. [57]
    Google Knowledge Graph and How it Works - Search Engine Journal
    Mar 31, 2021 · Knowledge Graphs can help search engines like Google leverage structured data about topics. Semantic data and markup, in turn, help to connect concepts and ...
  58. [58]
    Using Language Models to Disambiguate Lexical Choices in ...
    We evaluate recent LLMs and neural machine translation systems on DTAiLS, with the best-performing model, GPT-4, achieving from 67 to 85% accuracy across ...
  59. [59]
    [PDF] LLM-Aided Translation of SKOS Thesauri - arXiv
    Jul 29, 2025 · Multilingual thesauri, often used for metadata, en- able collective benefit, especially for those communities directly connected to the research ...
  60. [60]
    [PDF] Improvements in Automatic Thesaurus Extraction - ACL Anthology
    Unfortunately, thesauri are expensive and time- consuming to create manually, and tend to suffer from problems of bias, inconsistency, and limited coverage. In ...