Fact-checked by Grok 2 weeks ago

Knowledge engineering

Knowledge engineering is a subdiscipline of artificial intelligence focused on the acquisition, representation, validation, and application of specialized human knowledge within computer systems to solve complex, domain-specific problems that typically require expert-level expertise.^[1] It involves the systematic design, development, and maintenance of knowledge-based systems (KBS), such as expert systems, which emulate human reasoning processes to provide intelligent decision support.^[2] At its core, knowledge engineering bridges human cognition and computational processes, transforming tacit expert insights into explicit, machine-readable formats like rules, ontologies, and semantic models.^[3] The field originated in the 1970s alongside early expert systems, such as MYCIN for medical diagnosis, marking a shift from rule-based programming to knowledge-driven AI.^[3] By the 1980s, knowledge engineering gained prominence as a distinct engineering practice, emphasizing the challenges of eliciting high-quality knowledge from domain experts amid uncertainties in process requirements.^[1] Its evolution in the 1990s incorporated advancements in the semantic web, ontologies, and linked data, expanding beyond isolated expert systems to interconnected knowledge networks.^[2] In recent decades, particularly since the mid-2010s, the integration of large language models (LLMs) has revolutionized the discipline, enabling hybrid neuro-symbolic approaches that automate knowledge extraction from natural language and enhance scalability in knowledge generation and maintenance.^[3] Central processes in knowledge engineering include knowledge acquisition, where experts' insights are gathered through methods like structured interviews, observation, and protocol analysis; knowledge representation, utilizing formal structures such as production rules, frames, semantic networks, or ontologies to encode relationships and inferences; and knowledge validation and maintenance, ensuring accuracy, consistency, and adaptability through iterative testing and refinement.^[1] These steps form a cyclic, iterative modeling process that integrates elements from software engineering, cognitive science, and data mining to build robust KBS.^[2] Problem-solving methods (PSMs) and conceptual modeling further guide the structuring of knowledge for reusable, domain-independent applications.^[1] Knowledge engineering plays a pivotal role in advancing AI by enabling inference-based reasoning, knowledge management, and intelligent automation across diverse fields, including medicine, engineering, finance, and biodiversity informatics.^[3] Its importance lies in addressing the "knowledge bottleneck" in AI development, where human expertise is formalized to create scalable systems that support decision-making under uncertainty and facilitate human-machine collaboration.^[1] With the rise of LLMs, contemporary knowledge engineering enhances accessibility, allowing non-experts to contribute to knowledge bases while preserving symbolic rigor for reliable AI outcomes.^[3]

Overview

Definition and Scope

Knowledge engineering is the discipline that involves eliciting, structuring, and formalizing knowledge from human experts to develop computable models capable of emulating expert decision-making in intelligent systems.^[4] This process integrates human expertise and reasoning into computer programs, enabling them to address complex problems that traditionally require specialized human judgment.^[5] At its core, it emphasizes the transfer of domain-specific knowledge to create systems that reason and solve problems in a manner analogous to human experts.^[6] The scope of knowledge engineering centers on human-centric approaches to knowledge transfer, where explicit rules and heuristics derived from experts are encoded into systems, distinguishing it from data-driven methods in artificial intelligence that rely primarily on statistical patterns from large datasets.^[7] For instance, rule-based systems in knowledge engineering apply predefined if-then rules to mimic expert logic, whereas statistical learning techniques, such as those in machine learning, infer behaviors from probabilistic models without direct expert input.^[8] This focus makes knowledge engineering particularly suited for domains where interpretable, verifiable reasoning is essential, such as medical diagnosis or engineering design, rather than purely predictive tasks.^[4] Central to knowledge engineering are the distinctions between explicit and tacit knowledge, as well as its foundational role in constructing knowledge-based systems (KBS). Explicit knowledge consists of articulable facts, rules, and procedures that can be readily documented and formalized, while tacit knowledge encompasses intuitive, experience-based insights that are difficult to verbalize and often require interactive elicitation techniques to uncover.^[4] Knowledge engineering bridges these by converting tacit elements into explicit representations, thereby powering KBS—computer programs that utilize a structured knowledge base and inference mechanisms to solve domain-specific problems autonomously.^[9] The term "knowledge engineering" originated in the late 1960s and early 1970s within artificial intelligence research, evolving from John McCarthy's concept of "applied epistemology" and formalized by Edward Feigenbaum during work on projects like DENDRAL at Stanford University, with parallel developments at Carnegie Mellon University.^[6] This etymology reflects its emergence as a practical application of AI principles to expert knowledge capture.^[10]

Relation to Artificial Intelligence

Knowledge engineering occupies a central position within the broader field of artificial intelligence (AI), specifically as a core discipline of symbolic AI, which emphasizes the explicit representation and manipulation of knowledge using logical rules and structures.^[11] This approach contrasts with connectionist methods, such as neural networks, that rely on statistical patterns derived from data rather than formalized symbolic reasoning.^[12] During the 1970s AI winter, the "knowledge is power" paradigm emerged as a foundational principle in symbolic AI, asserting that the depth and quality of encoded domain knowledge, rather than raw computational power, were key to achieving intelligent behavior in systems.^[13] A pivotal milestone in this relation was the 1969 paper by John McCarthy and Patrick J. Hayes, which introduced the knowledge representation hypothesis, proposing that AI problems be divided into epistemological aspects—concerned with formalizing what is known about the world—and heuristic aspects for search and problem-solving.^[14] This hypothesis underscored knowledge engineering's role in bridging philosophical underpinnings of intelligence with practical computational implementation, influencing subsequent developments in AI by prioritizing structured knowledge as essential for reasoning.^[15] Knowledge engineering intersects with contemporary AI through its integration into hybrid systems that combine symbolic methods with sub-symbolic techniques, such as neurosymbolic architectures, to enhance explainability and reasoning in complex tasks.^[12] For instance, it supports natural language processing by providing ontologies and rule-based formalisms for semantic understanding, and bolsters decision support systems through encoded expert logic that guides probabilistic inferences.^[16] In distinction from machine learning, knowledge engineering is inherently expert-driven, relying on human specialists to elicit and formalize domain knowledge, whereas machine learning is predominantly data-driven, inducing patterns from large datasets without explicit rule encoding.^[17] Similarly, it differs from knowledge management, which focuses on organizational strategies for capturing, storing, and sharing information across enterprises, by emphasizing computational formalization tailored for automated inference in AI applications.^[18]

Historical Development

Early Foundations (1950s–1970s)

The foundations of knowledge engineering emerged in the mid-20th century as part of the nascent field of artificial intelligence, where researchers sought to encode and manipulate human-like reasoning in computational systems. One of the earliest milestones was the Logic Theorist program, developed by Allen Newell, Herbert A. Simon, and Cliff Shaw in 1956, which demonstrated automated theorem proving by manipulating symbolic knowledge structures derived from Whitehead and Russell's Principia Mathematica. This system represented knowledge as logical expressions and applied heuristic search to generate proofs, marking the first deliberate attempt to engineer a machine capable of discovering new mathematical knowledge through rule-based inference.^[19] Building on this, Newell, Simon, and J.C. Shaw introduced the General Problem Solver (GPS) in 1959, a program designed to tackle a broad class of problems by separating domain-specific knowledge from general problem-solving strategies. GPS employed means-ends analysis, where it identified discrepancies between current and goal states and selected operators to reduce them, effectively engineering knowledge as a set of objects, goals, and transformations applicable to tasks like theorem proving and puzzle solving. This approach laid groundwork for knowledge engineering by emphasizing the modular representation of expertise, allowing the system to simulate human problem-solving without exhaustive search.^[20] Theoretical advancements in the 1960s further solidified these ideas through heuristic programming, pioneered by Edward Feigenbaum at Stanford University. Feigenbaum's work focused on capturing domain-specific expertise in programs like DENDRAL (initiated in 1965), which used heuristic rules to infer molecular structures from mass spectrometry data, introducing knowledge engineering as the process of eliciting and formalizing expert heuristics for scientific discovery. Complementing this, James Slagle's SAINT program (1961–1963) incorporated production rules—condition-action pairs—to solve symbolic integration problems in calculus, representing mathematical knowledge as a hierarchy of rules that guided heuristic selection and application, achieving performance comparable to a skilled undergraduate.^[21] Institutional developments accelerated these efforts, with the establishment of dedicated AI laboratories providing dedicated spaces for knowledge-focused research. The MIT Artificial Intelligence Project was founded in 1959 by John McCarthy and Marvin Minsky, fostering early experiments in symbolic knowledge manipulation. Similarly, the Stanford Artificial Intelligence Laboratory (SAIL) emerged in 1963 under McCarthy, while the University of Edinburgh's Experimental Programming Unit, led by Donald Michie, began AI work the same year, emphasizing pattern recognition and rule-based systems. These labs were bolstered by substantial funding from the Defense Advanced Research Projects Agency (DARPA), which allocated $2.2 million in 1963 to MIT's Project MAC for AI research, enabling interdisciplinary teams to engineer complex knowledge representations in areas like natural language and planning.^[22]^[23]^[22]^[24] Despite these advances, the era faced significant challenges from the computational limitations of early hardware, which struggled to process intricate knowledge structures involving large search spaces and symbolic manipulations. Systems like GPS and SAINT required substantial memory and processing time for even modest problems, highlighting the gap between theoretical promise and practical scalability. These constraints, coupled with overly optimistic projections from researchers, led to funding cuts, culminating in the first AI winter from 1974 to 1980; in the UK, the 1973 Lighthill Report criticized AI's progress and recommended reallocating resources away from machine intelligence, while in the US, DARPA reduced its AI funding in 1974 following internal reviews that questioned the field's progress.^[25]

Rise of Expert Systems (1980s–1990s)

The 1980s marked a pivotal era in knowledge engineering, characterized by the proliferation of expert systems that applied domain-specific knowledge to solve complex problems previously handled by human specialists. Building on earlier theoretical work, this period saw the transition from experimental prototypes to practical implementations, driven by advances in rule-based reasoning and inference mechanisms. Expert systems encoded expert knowledge as production rules (if-then statements) combined with inference engines to mimic decision-making processes, enabling applications in medicine, chemistry, and manufacturing.^[26] Key projects exemplified this surge. MYCIN, originally developed in 1976 at Stanford University, reached its peak influence in the 1980s as a consultation system for diagnosing bacterial infections and recommending antibiotic therapies; it demonstrated diagnostic accuracy comparable to or exceeding human experts in controlled tests, influencing subsequent medical AI efforts.^[27] DENDRAL, initiated in 1965 at Stanford, expanded significantly in the 1980s with broader dissemination to academic and industrial chemists, automating mass spectrometry analysis to infer molecular structures from spectral data through heuristic rules.^[28] Similarly, XCON (also known as R1), deployed by Digital Equipment Corporation in 1980, configured VAX computer systems from customer orders, reducing configuration errors by over 95% and saving millions in operational costs annually.^[29] Methodological advances facilitated scalable development. Frederick Hayes-Roth and colleagues introduced structured knowledge engineering cycles in 1983, outlining iterative phases of knowledge acquisition, representation, validation, and refinement to systematize the building of expert systems beyond ad hoc programming.^[30] Complementing this, shell-based tools like EMYCIN—derived from MYCIN's domain-independent components in the early 1980s—provided reusable frameworks for rule-based consultations, accelerating the creation of new systems in diverse domains such as pulmonary diagnosis (e.g., PUFF).^[31] Commercially, the expert systems boom fueled rapid industry growth, with the AI sector expanding from a few million dollars in 1980 to billions by 1988, driven by demand for knowledge-intensive applications.^[32] Companies like Teknowledge, founded in 1981, specialized in developing and consulting on expert systems for business and engineering, securing contracts with firms like General Motors.^[33] Inference Corporation, established in 1979, commercialized tools like the ART shell for building rule-based systems, supporting deployments in finance and manufacturing.^[34] By the late 1980s, however, limitations emerged, particularly the knowledge acquisition bottleneck—the labor-intensive process of eliciting and formalizing expert knowledge, which Feigenbaum identified as early as 1983 and which hindered scaling beyond narrow domains.^[35] This, combined with overhyped expectations and the collapse of specialized AI hardware markets like Lisp machines, contributed to the second AI winter from 1987 to 1993, slashing funding and stalling expert systems development.^[36]

Contemporary Evolution (2000s–Present)

The early 2000s witnessed a pivotal shift in knowledge engineering from domain-specific expert systems to ontology engineering, catalyzed by the Semantic Web initiative. In 2001, Tim Berners-Lee, James Hendler, and Ora Lassila outlined a vision for the web where data would be given well-defined meaning, enabling computers to process information more intelligently through structured ontologies and RDF (Resource Description Framework).^[37] This approach emphasized interoperability and machine readability, transforming knowledge representation into a web-scale endeavor that built upon but extended earlier symbolic methods. Tools like Protégé, originally developed in the 1990s, matured significantly post-2000 with the release of Protégé-2000, an open-source platform for constructing ontologies and knowledge bases with intuitive interfaces for modeling classes, properties, and relationships.^[38] The rise of the internet and big data further propelled this evolution, providing vast, heterogeneous datasets that demanded robust knowledge engineering techniques for extraction, integration, and utilization. By the mid-2000s, the explosion of online information—estimated to grow from petabytes to zettabytes annually—highlighted the need for scalable structures to handle unstructured web content, influencing a revival of graph-based representations.^[39] Google's introduction of the Knowledge Graph in 2012 exemplified this trend, deploying a massive entity-relationship database with 500 million entities and more than 3.5 billion facts about them to improve search relevance by connecting concepts rather than relying solely on keyword matching.^[40] Similarly, Facebook's Graph Search, launched in 2013, extended its social graph into a knowledge-oriented query system, allowing users to explore connections across people, places, and interests using natural language. Research trends in the 2010s increasingly focused on hybrid approaches that merged symbolic knowledge engineering with statistical machine learning, addressing challenges like knowledge acquisition from noisy data and enabling neuro-symbolic systems for explainable AI. These methods combined rule-based reasoning with data-driven inference, as seen in frameworks for ontology learning from text corpora.^[41] The European Knowledge Acquisition Workshop (EKAW), established in 1987, peaked in influence during this period, with proceedings from 2000 onward contributing seminal works on ontology alignment, knowledge validation, and collaborative engineering practices that shaped interdisciplinary applications.^[42] In the 2020s, knowledge engineering has advanced through the integration of large language models (LLMs) with symbolic methods, enabling automated knowledge extraction and validation at scale. Neuro-symbolic AI systems, such as those combining LLMs with ontologies for enhanced reasoning, have addressed the knowledge bottleneck by facilitating hybrid models that leverage both neural pattern recognition and logical inference, as demonstrated in applications like medical diagnostics and scientific discovery.^[43] By 2025, knowledge engineering has become integral to enterprise AI, powering recommendation systems, natural language processing, and decision support tools across industries. The global knowledge management software market, encompassing core knowledge engineering functionalities, reached approximately $13.7 billion in value, reflecting widespread adoption in scalable, AI-enhanced platforms.^[44]

Core Processes

Knowledge Acquisition

Knowledge acquisition is the initial and often most challenging phase of knowledge engineering, involving the systematic elicitation, capture, and organization of expertise from human sources to build knowledge-based systems. This process transforms tacit knowledge—such as heuristics, decision rules, and domain-specific insights held by experts—into explicit, structured forms suitable for computational use. It requires close collaboration between knowledge engineers and domain experts, emphasizing iterative refinement to ensure the captured knowledge reflects real-world problem-solving accurately. The process typically unfolds in distinct stages: first, identifying suitable experts through criteria like experience level and reputation within the domain; second, conducting elicitation sessions to gather knowledge; and third, structuring the elicited information into preliminary models or hierarchies. Common pitfalls include expert inconsistency, where individuals may provide varying responses due to contextual factors or memory biases, and the "knowledge bottleneck," where a single expert's availability limits progress. To mitigate these, knowledge engineers often employ multiple experts for cross-validation and document sessions meticulously. Key techniques for knowledge acquisition include structured interviews, where engineers pose targeted questions to probe decision-making processes; protocol analysis, which involves verbalizing thoughts during task performance to reveal underlying reasoning; and repertory grids, originally developed in George Kelly's personal construct theory for psychological assessment and adapted for eliciting hierarchical knowledge structures. Additionally, machine induction methods, such as decision tree learning algorithms like ID3, automate rule extraction from examples provided by experts, generating if-then rules that approximate human expertise. These techniques are selected based on the domain's complexity, with manual methods suiting nuanced, qualitative knowledge and automated ones handling large datasets efficiently. Early tools for facilitating acquisition included HyperCard-based systems in the 1980s, which enabled interactive card stacks for visual knowledge mapping and prototyping. Modern software, such as OntoWizard, supports ontology-driven acquisition by guiding users through graphical interfaces to define concepts, relations, and axioms collaboratively. As of 2025, generative AI and large language models (LLMs) have emerged as tools for semi-automated knowledge extraction from unstructured text, using prompting techniques like few-shot learning to generate knowledge graph triples and datasets efficiently. These tools enhance efficiency by reducing cognitive load on experts and allowing real-time feedback during sessions.^[45] Success in knowledge acquisition is evaluated through metrics like completeness, which assesses whether all relevant rules or concepts have been captured, often measured by coverage rates in downstream validation tests where the knowledge base reproduces expert decisions on unseen cases. Accuracy is gauged by comparing system outputs to expert judgments, with thresholds determined based on domain requirements to ensure reliability. These metrics underscore the iterative nature of acquisition, where initial captures are refined until they achieve sufficient fidelity for practical deployment.

Knowledge Representation

Knowledge representation in knowledge engineering involves formalizing acquired knowledge into structures that machines can process, reason over, and utilize effectively. This process transforms unstructured or semi-structured information from domain experts into symbolic or graphical forms that support inference, enabling systems to mimic human-like decision-making. Central to this are various paradigms that balance the need for capturing complex relationships with computational feasibility. One foundational paradigm is rule-based representation, which encodes knowledge as conditional statements in the form of production rules. These rules typically follow the structure:

\text{IF } \text{conditions} \text{ THEN } \text{actions}

This format allows for straightforward encoding of heuristic knowledge, where conditions test facts in a working memory, and actions modify the state or infer new facts. Production rules gained prominence in early expert systems due to their modularity and ease of acquisition from experts, facilitating forward or backward chaining for inference.^[46]^[47] Frame-based representation, introduced by Marvin Minsky, organizes knowledge into hierarchical structures called frames, each consisting of slots and fillers that represent stereotypical situations or objects. Frames support inheritance, where a child frame automatically acquires properties from a parent frame unless overridden, depicted as:

\text{Parent(Frame)} \rightarrow \text{Child(Frame)} \text{ inherits slots}

This paradigm excels in modeling default knowledge and contextual expectations, such as filling in missing details during comprehension tasks. It provides a natural way to represent structured objects and their attributes, though it requires careful handling of exceptions to inheritance.^[48]^[49] Semantic networks represent knowledge as directed graphs, with nodes denoting concepts or entities and labeled edges indicating relationships, such as "is-a" for inheritance or "has-part" for composition. Originating from efforts to model associative memory, these networks enable efficient traversal for retrieval and inference, like spreading activation to find related concepts. They are particularly useful for capturing taxonomic hierarchies and associative links in domains like natural language processing.^[50] Logic-based representation employs formal logics, notably first-order logic (FOL), to express knowledge declaratively as axioms that support deductive reasoning. Languages like Prolog implement a subset of FOL through Horn clauses, allowing queries to derive conclusions via resolution. This approach provides precise semantics and soundness but can suffer from incompleteness in handling non-monotonic reasoning. Prolog's syntax, for instance, uses predicates like parent(X, Y) to define facts and rules for inference.^[51]^[52] Among formalisms, ontologies using the Web Ontology Language (OWL) standardize knowledge representation for the Semantic Web, enabling interoperability across systems. OWL, a W3C recommendation, builds on description logics to define classes, properties, and individuals with constructs for cardinality restrictions and transitive relations, supporting automated reasoning over web-scale data. Description logics (DLs) underpin OWL, providing a decidable fragment of FOL for tasks like subsumption and consistency checking; for example, ALC DL allows concepts like \concept{Animal} \sqcap \exists \text{hasLeg}.\concept{Leg}. DLs ensure tractable reasoning in expressive ontologies by restricting FOL's power.^[53]^[54] For handling uncertainty, Bayesian networks offer a probabilistic paradigm, representing knowledge as directed acyclic graphs where nodes are random variables and edges denote conditional dependencies. Inference computes posterior probabilities via methods like belief propagation, as formalized in Pearl's framework. This is briefly noted here as a complementary approach, though its primary application lies in uncertain reasoning.^[55] A key consideration across these paradigms is the trade-off between expressiveness and efficiency: highly expressive formalisms like full FOL enable rich descriptions but lead to undecidable or computationally expensive reasoning (e.g., EXPTIME-complete for ALC), while restricted ones like production rules or basic DLs ensure polynomial-time inference at the cost of limited modeling power. Designers must select based on domain needs, prioritizing tractability for real-time systems.^[56] As of 2025, large language models (LLMs) assist in ontology engineering and knowledge graph structuring through prompting, integrating with tools like NeOn and HermiT to improve consistency, though prompting expertise is required to mitigate inconsistencies.^[45]

Knowledge Validation and Maintenance

Knowledge validation and maintenance are essential phases in knowledge engineering that ensure the reliability, accuracy, and longevity of knowledge bases used in expert systems and ontologies. Validation involves systematically verifying that the encoded knowledge adheres to logical consistency, completeness, and domain correctness, while maintenance focuses on updating and refining the knowledge base to adapt to evolving information or errors detected post-deployment. These processes mitigate risks such as incorrect inferences that could lead to faulty decision-making in applications like medical diagnosis or financial forecasting. Validation methods in knowledge engineering include consistency checks, which detect conflicts in rules or axioms using automated theorem proving techniques. For instance, propositional satisfiability (SAT) solvers or description logic reasoners can identify contradictions by attempting to prove the unsatisfiability of the knowledge base under assumed consistency. Completeness testing employs test cases derived from domain scenarios to assess whether the knowledge base covers all relevant situations without gaps, often through coverage metrics like rule firing rates in simulated environments. Sensitivity analysis evaluates the robustness of the knowledge base by perturbing input parameters and observing the stability of outputs, helping to quantify how changes in knowledge elements affect overall system behavior. These methods are grounded in formal verification approaches, as outlined in foundational work on knowledge-based system validation. Maintenance strategies for knowledge bases emphasize systematic evolution to handle updates without disrupting existing functionality. Version control systems, adapted from software engineering, track changes to knowledge artifacts such as rules or ontologies, enabling rollback and auditing of modifications. Incremental updates utilize delta rules—minimal change sets that propagate only affected portions of the knowledge base—to avoid full recomputation, which is particularly efficient for large-scale ontologies. Handling knowledge drift addresses domain changes, such as evolving medical guidelines, through periodic audits and machine-assisted monitoring for obsolescence, ensuring the knowledge remains aligned with real-world dynamics. These strategies draw from lifecycle models in knowledge engineering, promoting sustainability in deployed systems. Key tools for validation and maintenance include early systems like CHECK, developed in the 1980s for verifying rule-based expert systems by performing static analysis on production rules to detect redundancies and conflicts. In modern contexts, integrated development environments (IDEs) incorporate reasoners such as HermiT, a high-performance OWL reasoner that supports consistency checking and classification in ontologies through tableau-based algorithms. HermiT is widely used in tools like Protégé for scalable validation of Semantic Web knowledge bases. These tools exemplify the progression from manual to automated assurance techniques. As of 2025, generative AI aids validation but introduces challenges like hallucinations and biases, necessitating human oversight and new metrics beyond traditional F1 scores, such as adversarial testing for knowledge graphs.^[45] Challenges in knowledge validation and maintenance primarily revolve around scalability, especially in large ontologies where inference tasks exhibit high computational complexity; for example, simple consistency checks can scale as O(n²) in the number of axioms due to pairwise interaction analysis, leading to exponential time in worst-case scenarios for expressive logics. Addressing this requires optimized algorithms and distributed computing, yet resource constraints remain a barrier for real-time validation in dynamic environments.

Applications and Techniques

Expert Systems

Expert systems represent a cornerstone application of knowledge engineering, designed to emulate the decision-making capabilities of human experts within narrowly defined domains by encoding specialized knowledge into computable forms. These systems typically operate through a modular architecture comprising three primary components: the knowledge base, which stores domain-specific facts and rules; the inference engine, which applies logical reasoning to derive conclusions from the knowledge base and input data; and the user interface, which facilitates interaction between the user and the system. The inference engine employs reasoning strategies such as forward chaining, a data-driven approach that starts with available facts to infer new ones until a conclusion is reached, or backward chaining, a goal-driven method that begins with a hypothesis and works backward to verify supporting evidence. This architecture enables expert systems to provide reasoned advice or diagnoses, often outperforming non-specialists in their targeted areas.^[57]^[58] The development lifecycle of expert systems is an iterative process tailored to knowledge engineering principles, emphasizing the acquisition, representation, and validation of expert knowledge for specific, narrow domains rather than general-purpose software engineering. Key phases include problem identification to define the scope and objectives; knowledge acquisition, where domain experts collaborate with knowledge engineers to elicit and formalize rules and heuristics; design and implementation, involving the construction of the knowledge base and selection of inference mechanisms; testing and validation against expert judgments; and ongoing maintenance to update the knowledge base as domain understanding evolves. Performance is evaluated using metrics such as accuracy rates in matching expert decisions, with systems often achieving high fidelity in controlled scenarios but requiring continuous refinement to handle edge cases. This lifecycle underscores the resource-intensive nature of knowledge engineering, where bottlenecks in acquisition can extend development timelines significantly.^[59]^[60] Prominent case studies illustrate the practical impact of expert systems. MYCIN, developed at Stanford University in the 1970s, was a backward-chaining system with approximately 450 production rules focused on diagnosing bacterial infections such as bacteremia and meningitis, and recommending antibiotic therapies. In evaluations, MYCIN's therapy recommendations agreed with those of infectious disease experts in 69% of cases, demonstrating performance comparable to specialists on challenging test sets. Similarly, PROSPECTOR, created by SRI International in the late 1970s for the U.S. Geological Survey, used a rule-based framework with over 1,000 rules to assess the favorability of mineral exploration sites, incorporating uncertain evidence through Bayesian-like probabilities. Applied to uranium prospecting in the Department of Energy's National Uranium Resource Evaluation program, it achieved validation scores with an average discrepancy of 0.70 on a 10-point scale against expert assessments and successfully predicted an undiscovered molybdenum deposit in Washington State, guiding exploration efforts that confirmed its presence. These examples highlight how knowledge engineering enables targeted, high-stakes applications with measurable success in accuracy and real-world utility.^[61]^[62]^[63]^[64] Despite their achievements, expert systems exhibit brittleness, a key limitation where they perform reliably within their narrow training domains but degrade sharply or fail catastrophically when confronted with novel, unseen scenarios outside the encoded knowledge. This stems from their reliance on explicit, finite rule sets that lack the adaptive, common-sense reasoning of human experts, leading to gaps in handling incomplete data or edge conditions. For instance, systems like MYCIN could suggest inappropriate therapies for rare infection variants not covered in its rules, underscoring the need for robust validation, though full resolution remains challenging without broader contextual knowledge integration.^[65]^[66]

Ontologies and Semantic Web

In knowledge engineering, ontologies serve as formal specifications of conceptualizations, defining the concepts within a domain and the relations that hold between them. This approach enables the explicit representation of shared knowledge, facilitating interoperability and reasoning across systems. Originally articulated by Thomas Gruber, an ontology is described as "an explicit specification of a conceptualization," where the conceptualization refers to an abstract model of some phenomenon in the world that identifies the relevant entities and their interrelations.^[67] Key components of an ontology include classes, which categorize entities; properties, which describe attributes or relations between classes; individuals or instances, which are specific members of classes; and axioms, which are logical statements that impose constraints or inferences on the other components. These elements collectively provide a structured vocabulary that supports automated processing and knowledge sharing.^[68] The integration of ontologies with the Semantic Web represents a pivotal advancement in knowledge engineering, enabling machines to interpret and link data across the web. At its core, the Resource Description Framework (RDF), a W3C standard first recommended in 1999, models knowledge as triples consisting of a subject, predicate, and object, forming directed graphs that represent statements about resources. Ontologies built on RDF, often using languages like OWL (Web Ontology Language), extend this by adding formal semantics for classes, properties, and axioms, allowing for inference and query federation. SPARQL, another W3C standard introduced in 2008, serves as the query language for RDF data, enabling complex retrieval and manipulation of ontological knowledge from distributed sources, thus promoting the vision of a web of linked, machine-understandable data. These standards, developed through ongoing W3C efforts since 1999, underpin the Semantic Web's architecture for scalable knowledge representation and discovery.^[69]^[70] Prominent applications of ontologies in knowledge engineering highlight their utility in domain-specific interoperability. The Gene Ontology (GO), launched in 2000 by the Gene Ontology Consortium, provides a controlled vocabulary for describing gene and gene product attributes across organisms, structured into three namespaces—molecular function, biological process, and cellular component—to unify bioinformatics annotations and support functional genomics research. Similarly, DBpedia extracts structured information from Wikipedia infoboxes and other elements, transforming them into RDF triples to create a vast linked dataset that interconnects Wikipedia content with external ontologies, enabling queries over billions of facts and fostering open data ecosystems since its inception in 2007. These examples demonstrate how ontologies enable precise knowledge integration in fields like biology and information retrieval.^[71]^[72] The engineering process for ontologies in knowledge engineering balances creation from scratch with reuse of existing resources to enhance efficiency and consistency. Reuse involves importing or extending modular components from libraries like the Open Biomedical Ontologies (OBO) Foundry, reducing redundancy and promoting standardization, whereas full creation is reserved for novel domains requiring bespoke conceptualizations. Tools such as TopBraid Composer facilitate this process by offering graphical editors for RDF/OWL modeling, validation through reasoning engines, and support for collaborative development, allowing engineers to build, query, and maintain ontologies iteratively. This methodology ensures ontologies remain maintainable and aligned with evolving knowledge needs.^[73]

Integration with Machine Learning

Knowledge engineering integrates with machine learning through hybrid approaches that leverage structured symbolic knowledge to augment data-driven models, addressing limitations in pure neural methods such as poor generalization and lack of interpretability. Neuro-symbolic AI exemplifies this synergy by combining logical rules from knowledge representation with neural networks, enabling systems to learn patterns from data while maintaining reasoning capabilities rooted in explicit knowledge bases. Early foundational work in this paradigm, such as the neural-symbolic learning systems proposed by d'Avila Garcez et al., embedded propositional logic into recurrent neural networks to support approximate reasoning and knowledge extraction from trained models. This integration allows neural components to handle perceptual tasks while symbolic rules enforce constraints, fostering more robust AI systems. A prominent application is in explainable AI (XAI), where knowledge graphs derived from knowledge engineering provide semantic structures to interpret ML predictions, making opaque models more transparent. Knowledge graphs map model outputs to domain-specific relationships, generating natural language explanations that align with human understanding; for instance, in image recognition, graphs like ConceptNet link detected objects to contextual concepts, enhancing trust in decisions.^[74] A systematic survey underscores their utility across tasks, including rule-based explanations for neural outputs and recommender systems that justify suggestions via entity relations from sources like DBpedia.^[74] Such techniques draw on core knowledge representation processes to ensure explanations are not merely post-hoc but inherently tied to verifiable domain knowledge. Key techniques include knowledge injection into ML pipelines, where ontologies from knowledge engineering pre-train models to incorporate prior constraints, and inductive logic programming (ILP), which learns symbolic rules directly from data augmented with background knowledge. In knowledge injection, methods like Ontology-based Semantic Composition Regularization (OSCR) embed task-agnostic ontological triples into embeddings during training, guiding models toward semantically coherent representations in applications such as question answering. ILP, a longstanding approach, induces first-order logic rules that generalize examples while respecting declarative knowledge, with systems like Aleph demonstrating its efficacy in relational learning tasks.^[75] These methods enable ML to benefit from engineered knowledge without full symbolic overhaul. Recent advances as of 2025 have further integrated neuro-symbolic AI with large language models (LLMs) to mitigate issues like hallucinations, where symbolic knowledge grounds neural outputs in factual structures. For example, neuro-symbolic frameworks combine LLMs with knowledge graphs to enhance reasoning in natural language processing, achieving higher accuracy in tasks requiring logical inference. Gartner's 2025 AI Hype Cycle highlights neuro-symbolic AI as an emerging paradigm for trustworthy systems that operate with less data while providing explainable decisions.^[76]^[77] Notable examples illustrate practical impacts: IBM Watson's DeepQA system fused structured knowledge bases with ML classifiers and evidence retrieval to process natural language queries, powering its 2011 Jeopardy! victory through a pipeline that scored candidate answers via knowledge-grounded confidence measures.^[78] Similarly, AlphaFold incorporated structural knowledge priors from multiple sequence alignments and protein databases into its neural architecture, using evolutionary covariation as inductive biases to predict 3D protein structures with atomic accuracy in the 2020 CASP14 competition.^[79] These hybrids yield benefits like enhanced generalization, where knowledge constraints reduce overfitting; comparative studies report F1-score improvements of up to 1.1% in domain-specific tasks, such as scientific text classification, by injecting relational knowledge into transformers.^[80]

Challenges and Future Directions

Key Challenges

One of the most persistent bottlenecks in knowledge engineering is the knowledge acquisition process, originally identified by Edward Feigenbaum in the late 1970s as the primary constraint on developing effective expert systems due to the difficulty in eliciting, structuring, and formalizing expert knowledge.^[81] This bottleneck remains relevant today, as manual extraction of domain-specific expertise continues to be labor-intensive and prone to incomplete capture, particularly in complex fields like medicine or engineering.^[82] Scalability exacerbates this issue in large domains, where the volume of required knowledge grows exponentially, overwhelming traditional acquisition methods and leading to diminished returns on investment for systems handling vast, interconnected data sets.^[83] Technical challenges further complicate knowledge engineering, especially in handling uncertainty and incompleteness within knowledge bases. Non-monotonic reasoning, which allows conclusions to be revised upon new information, poses significant problems because real-world knowledge often includes exceptions and defaults that monotonic logics cannot accommodate without leading to explosive or trivial inferences.^[84] Integrating probabilistic elements to manage uncertainty adds complexity, as combining non-monotonic rules with measures like triangular norms requires careful calibration to avoid propagating errors across inferences.^[85] Interoperability across different knowledge representations remains a core obstacle, as heterogeneous formats—such as ontologies, frames, and semantic networks—lack standardized mappings, hindering seamless data exchange and reasoning in distributed systems.^[86] Human factors introduce additional hurdles, including expert bias during elicitation, where domain specialists unconsciously emphasize certain patterns while overlooking edge cases due to cognitive limitations like confirmation bias or the curse of expertise.^[87] These biases can embed inaccuracies into the knowledge base, amplifying errors in downstream applications. The high cost of manual knowledge engineering, often requiring extensive interviews and iterations, contrasts sharply with emerging automated alternatives like machine learning induction, which promise efficiency but demand substantial upfront validation to ensure reliability.^[88] Quantitative assessments highlight the severity of these issues, with early knowledge bases exhibiting high levels of inconsistency, where conflicting rules or incomplete axioms rendered significant portions of the system unreliable and required ongoing maintenance to achieve coherence.^[89] Such error levels underscore the need for robust detection mechanisms, as even modern large-scale bases, like those in semantic web applications, face similar scalability-driven inconsistencies when integrating diverse sources.^[90]

Emerging Trends and Ethical Considerations

Recent advancements in knowledge engineering have increasingly incorporated natural language processing (NLP) techniques, particularly large language models (LLMs) such as GPT variants, to automate ontology generation and knowledge graph construction. Post-2020 developments, including the LAMA probing method for knowledge graph completion via cloze-style prompts and frameworks like TKGCon for theme-specific knowledge graphs from corpora, demonstrate how LLMs streamline workflows by generating competency questions and aligning ontologies. For instance, a 2023 hackathon at King's College London utilized LLMs such as ChatGPT to refine ontologies, highlighting the shift toward human-AI collaboration in reducing manual effort.^[91] These automated processes address scalability challenges but require careful prompting to mitigate inconsistencies and hallucinations.^[45] Another prominent trend is the integration of blockchain technology to ensure knowledge provenance, enabling immutable tracking of knowledge origins, modifications, and ownership in distributed systems. Blockchain-based schemes, such as those proposed for knowledge data traceability, leverage decentralized ledgers to verify authenticity and prevent tampering, particularly in collaborative environments.^[92] This approach enhances trust in knowledge bases by providing verifiable trails, as modeled in systems for managing expert-derived knowledge evolution. Ethical concerns in knowledge engineering center on bias amplification during expert elicitation, where subjective judgments can embed and exacerbate societal prejudices into knowledge bases. Techniques like structured interviews aim to minimize cognitive biases—such as anchoring or overconfidence—but poorly conducted elicitations risk misleading representations that propagate inequities.^[93]^[94] Privacy issues arise in knowledge acquisition, necessitating compliance with regulations like the EU's General Data Protection Regulation (GDPR) since 2018, which mandates privacy by design in data handling and processing.^[95] Knowledge-based systems for GDPR compliance dynamically query obligations to support automated verification, ensuring personal data protection during elicitation from sensitive sources.^[95] In hybrid human-AI systems, accountability remains challenging, as opaque interactions between expert knowledge and machine outputs complicate responsibility attribution; frameworks emphasizing explainability and audit trails are emerging to address this.^[96]^[97] To mitigate biases in knowledge bases, fairness metrics such as demographic parity are applied to ensure equitable representation across protected groups, measuring whether positive outcomes (e.g., link predictions in knowledge graphs) occur at equal rates regardless of attributes like gender or ethnicity.^[98] In knowledge graphs integrating social determinants of health, demographic parity formulations evaluate link prediction fairness, revealing disparities in entity connections that could amplify inequities if unaddressed.^[99] These metrics prioritize independence between sensitive attributes and model decisions, though trade-offs with accuracy necessitate balanced implementation.^[100] Looking ahead, knowledge engineering is poised to play a pivotal role in artificial general intelligence (AGI) by providing structured knowledge representation to complement neural approaches, enabling systems to generalize expertise across domains as in human cognition.^[101] The field is expected to drive hybrid symbolic-neural architectures essential for AGI's reasoning capabilities.^[102] Market projections indicate robust growth, with the knowledge management software sector—encompassing knowledge engineering tools—anticipated to reach USD 59.51 billion by 2033 (as of October 2024), fueled by AI integration and demand for scalable intelligence systems.^[103]

References

[1]
[PDF] Knowledge Engineering-an overview
recent years. Knowledge Engineering is the aspect of system engineering which addresses uncertain process requirements by emphasizing the acquisition of ...
[2]
(PDF) Knowledge Engineering - ResearchGate
Knowledge engineering refers to all technical, scientific and social aspects involved in designing, maintaining and using knowledge-based systems.Missing: scholarly | Show results with:scholarly
[3]
[PDF] Knowledge Engineering Using Large Language Models - DROPS
Dec 19, 2023 · Abstract. Knowledge engineering is a discipline that focuses on the creation and maintenance of processes that generate and apply knowledge.
[4]
Knowledge Engineering - an overview | ScienceDirect Topics
Knowledge engineering is defined as the discipline of integrating human knowledge and decision-making into computer systems to enable them to solve complex ...Introduction to Knowledge... · Knowledge Acquisition and...
[5]
What is knowledge engineering? - TechTarget
May 12, 2022 · Knowledge engineering is a field of artificial intelligence (AI) that tries to emulate the judgment and behavior of a human expert in a given field.
[6]
[PDF] THE ART OF ARTIFICIAL INTELLIGENCE: I. THEMES AND CASE ...
Jun 13, 1977 · The knowledge engineer practices the art of bringing the principles and tools of. AI research to bear on difficult applications problems ...
[7]
AI must be taught concepts, not just patterns in raw data - Nature
Mar 4, 2025 · Knowledge engineering works better than data-driven learning in these scenarios: when the knowledge needed to solve a problem, in the form of rules, is readily ...
[8]
(PDF) Knowledge-Driven versus Data-Driven Logics - ResearchGate
Aug 7, 2025 · In contrast, many modern AI technologies are not knowledge driven like expert systems but could be considered to produce "data-driven knowledge.
[9]
https://www.sciencedirect.com/topics/computer-science/expert-system
[10]
https://www.sciencedirect.com/science/article/pii/B9780128002070000010
[11]
Neuro-symbolic AI and knowledge engineering
Symbolic systems rely onlies Knowledge Engineering (KE), i.e. the design, construction, and maintenance of formal knowledge representations such as ontologies ...
[12]
Knowledge Engineering in the Age of Neurosymbolic Systems
The field of knowledge engineering is experiencing a substantial impact from the rapid growth and widespread adoption of Neurosymbolic Systems (NeSys).
[13]
[PDF] What Do We Know About Knowledge? - AAAI Publications
The “knowledge is power” principle is most closely associated with Francis Bacon, from his. 1597 tract on heresies: “Nam et ipsa scientia potestas est.” (“In ...
[14]
[PDF] SOME PHILOSOPHICAL PROBLEMS FROM THE STANDPOINT OF ...
The formalism of this paper represents an advance over McCarthy (1963) and Green (1969) in that it permits proof of the correctness of strategies that contain ...
[15]
[PDF] Epistemological Problems of Artificial Intelligence - IJCAI
In (McCarthy and Hayes 1969), we proposed dividing the artificial intelligence problem into two parts - an epistemological part and a heuristic part. This ...
[16]
Neuro-symbolic AI: Integrating Reasoning & Learning
This hybrid methodology combines the adaptability of neural networks with symbolic AI's interpretability and formal reasoning abilities, which provide a ...
[17]
AI Next Steps: Data-Driven vs Knowledge-Driven: | Singularity
Jan 26, 2023 · Knowledge-first AI is about the combination of human knowledge and data to build better models than predictive models than you could do alone with data.
[18]
Formalizing knowledge and expertise: where have we been and ...
Feb 7, 2011 · Therefore, knowledge engineering is 'The engineering discipline that involves integrating knowledge into computer systems in order to solve ...Information · 3 Notable Absences · 4 Development Of The Field
[19]
[PDF] the logic theory machine - a complex information processing system
In this paper we shall report some results of a research program directed toward the analysis and understanding of com- plex information processing systems. The ...
[20]
[PDF] A GENERAL PROBLEM-SOLVING PROGRAM FOR A COMPUTER
This paper deals with the theory of problem solving. It describes a program for a digital computer, called. General Problem Solver I (GPS), which is part of ...
[21]
A Heuristic Program that Solves Symbolic Integration Problems in ...
The program is called SAINT, an acronym for "Symbolic. Automatic INTegrator." This paper discusses the SAINT program and its performance. ... JAMES R. SLAGLE.
[22]
Early Artificial Intelligence Projects - MIT CSAIL
Here you will find a rough chronology of some of AI's most influential projects. It is intended for both non-scientists and those ready to continue ...
[23]
History of AI at Edinburgh | AIAI - School of Informatics
Nov 22, 2024 · Artificial Intelligence research at Edinburgh can trace its origins to a small research group established in 1963 by Donald Michie, ...
[24]
9 Development in Artificial Intelligence | Funding a Revolution
DARPA's early support launched a golden age of AI research and rapidly advanced the emergence of a formal discipline. Much of DARPA's funding for AI was ...<|separator|>
[25]
[PDF] Lighthill Report: Artificial Intelligence: a paper symposium
Lighthill's report was commissioned by the Science Research Council (SRC) to give an unbiased view of the state of AI research primarily in the UK in 1973.
[26]
[PDF] EXPERT SYSTEMS IN THE 1980s - Stacks
EMYCIN (008) provides a framework for building consultation programs in various domains. It uses the domain-independent components of the MYCIN system, notably ...
[27]
[PDF] Rule-Based Expert Systems: The MYCIN Experiments of the ...
MYCIN is an expert system (Duda and Shortliffe, 1983). By that mean that it is an AI program designed (a) to provide expert-level solutions to complex problems, ...Missing: impact | Show results with:impact
[28]
[PDF] DENDRAL and expert system applications - Stacks
DENDRAL was ported to SUMEX's PDP-10 and made available throughout the 1970s and early 1980s to a wide national community of academic and industrial chemists.Missing: expansion | Show results with:expansion
[29]
[PDF] 1980 - R1: An Expert in the Computer Systems Domain
R1 is a rule-based system that uses Match to configure VAX-11/780 systems, generating diagrams from customer orders and adding missing components.
[30]
(PDF) Building expert systems - ResearchGate
Aug 5, 2025 · Building Expert Systems-Frederick Hayes-Roth, Donald A. Waterman, and Douglas. B. Lenat, Eds. (Reading, MA: Addison-Wes-. ley, 1983,. 444. pp ...
[31]
[PDF] EMYCIN: A Knowledge Engineer's Tool for Constructing Rule-Based ...
Building knowledge-based, or expert, systems from scratch can be very time-consuming, however. This suggests the need for general tools to aid in the ...Missing: shell- | Show results with:shell-
[32]
What Should We Learn from Past AI Forecasts? | Open Philanthropy
May 1, 2016 · Overall, the AI industry boomed from a few million dollars in 1980 to billions of dollars in 1988, including hundreds of companies building ...
[33]
Setbacks for Artificial Intelligence - The New York Times
Mar 4, 1988 · ''People believed their own hype,'' said S. Jerrold Kaplan, co-founder of one leading artificial intelligence company, Teknowledge Inc., and now ...
[34]
[PDF] Expert Systems: A Technology Before Its Time1
Much has been written about what happened in the commercialization of expert systems in the 1980s.3 (We'll use the term “expert systems” to refer exclusively to ...
[35]
History Of AI In 33 Breakthroughs: The First Expert System - Forbes
Oct 29, 2022 · Feigenbaum explained heuristic knowledge (in his 1983 talk “Knowledge Engineering: The Applied Side of Artificial Intelligence”) as “knowledge ...
[36]
A brief history of AI: how to prevent another winter (a critical review)
Oct 1, 2021 · We provide a brief rundown of AI's evolution over the course of decades, highlighting its crucial moments and major turning points from inception to the ...
[37]
The Semantic Web - Scientific American
May 1, 2001 ... The Semantic Web. A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities. By Tim Berners-Lee, ...
[38]
Protégé-2000: An Open-Source Ontology-Development and ... - NIH
Protégé-2000 is an open-source tool that assists users in the construction of large electronic knowledge bases. It has an intuitive user interface that enables ...Missing: modeling history post-
[39]
Big data analytics: a link between knowledge management ...
Aug 3, 2019 · 69% of organizations that deployed big data analytics reported significant improvements in their cyber knowledge management capabilities.
[40]
Introducing the Knowledge Graph: things, not strings - Google Blog
May 16, 2012 · The Knowledge Graph enables you to search for things, people or places that Google knows about—landmarks, celebrities, cities, sports teams, ...
[41]
Review Hybrid expert systems: A survey of current approaches and ...
This paper is a statistical analysis of hybrid expert system approaches and their applications but more specifically connectionist and neuro-fuzzy system ...
[42]
EKAW
EKAW (European Knowledge Acquisition Workshop) started in 1987 as the European analogue to the KAW (Knowledge Acquisition Workshop) series of workshops in ...
[43]
Knowledge Management Software Market Size & Share Analysis
Jun 30, 2025 · The knowledge management software market is valued at USD 13.70 billion in 2025, projected to reach USD 32.15 billion by 2030, with an 18.60% ...
[44]
[PDF] Production Rules as a Representation for a Knowledge-Based ...
This is a pre-print of a paper submitted to Artificial. Intelligence. Randall Davis and Bruce Buchanan are in the Computer Science Department, Edward Shortliffe ...
[45]
The epistemology of a rule-based expert system - ScienceDirect.com
Production rules are a popular representation for encoding heuristic knowledge in programs for scientific and medical problem solving.
[46]
A Framework for Representing Knowledge - DSpace@MIT
The paper applies the frame-system idea also to problems of linguistic understanding: memory, acquisition and retrieval of knowledge, and a variety of ways to ...
[47]
[PDF] A framework for representing knowledge - Semantic Scholar
Jun 1, 1974 · This paper classify the knowledge, and presents a framework to describe it using frames and rules, easy to represent an IS-A hierarchy, ...
[48]
M. Ross Quillian, Semantic networks - PhilPapers
Quillian, M. Ross (1968). Semantic networks. In Marvin Lee Minsky, Semantic Information Processing. MIT Press.
[49]
[PDF] BAYESIAN NETWORKS Judea Pearl Computer Science ...
The first algorithms proposed for probabilistic calculations in Bayesian networks used a local, distributed message-passing architecture, typical of many ...
[50]
[PDF] Prolog: Past, Present, and Future
Prolog, founded 50 years ago, uses logic for computing. It has added features like constraints and negation-as-failure, and is now a sophisticated language.Missing: original | Show results with:original
[51]
OWL Web Ontology Language Overview - W3C
Feb 10, 2004 · The OWL Web Ontology Language is designed for use by applications that need to process the content of information instead of just presenting information to ...
[52]
[PDF] 2 Basic Description Logics
47. Page 2. 48. F. Baader, W. Nutt of concepts determines subconcept/superconcept relationships (called subsumption relationships in DL) between the concepts of ...
[53]
[PDF] BAYESIAN NETWORKS* Judea Pearl Cognitive Systems ...
Bayesian networks were developed in the late 1970's to model distributed processing in reading comprehension, where both semantical expectations and ...
[54]
[PDF] A Fundamental Tradeoff in Knowledge Representation and Reasoning
It includes substantial portions of two other conference papers: "The Tractability of. Subsumption in Frame-Based Description Languages,” by Ronald J. Brachman ...
[55]
Knowledge-based (expert) systems in engineering applications
This survey paper presents a thorough description of fundamentals of engineering based expert systems and their knowledge representation techniques.
[56]
Design of an expert system architecture: An overview - IOP Science
The paper explains the role of Backward Chaining method, Forward Chaining method, Rule-Value method as three major methods involved in solving these problems to ...
[57]
The expert system life cycle: what have we learned from software ...
The following sections describe the phases of the expert system life cycle in detail. Table I Parallels: Software Engineering and Expert System Development.
[58]
Systems development life-cycle for expert systems - ScienceDirect
A life cycle for expert systems is constructed, outlining the tasks and activities to be performed at each stage of system development. The life cycle ...
[59]
[PDF] Rule-Based Expert Systems: The MYCIN Experiments of the ...
We know that the medical knowledge in MYCIN is not precise, complete, or well codified. Although some of it certainly is mathematical in nature, it is mostly " ...
[60]
MYCIN: a knowledge-based consultation program for infectious ...
MYCIN is a computer-based consultation system designed to assist physicians in the diagnosis of and therapy selection for patients with bacterial infections.
[61]
PROSPECTOR computer-based expert system - SRI
Dec 2, 1970 · SRI's Artificial Intelligence Center developed one of the first computer-based expert systems to aid geologists in mineral exploration.
[62]
[PDF] 1980 - An Application of the Prospector System to DOE's National ...
Abstract. A practical criterion for the success of a knowledge-based problem-solving system is its usefulness as a tool to those workilb in its specialized.
[63]
CYC: : Using Common Sense Knowledge to Overcome Brittleness ...
We briefly illustrate how common sense reasoning and analogy can widen the knowledge acquisition bottleneck The next section (“How CYC Works”) illustrates how ...<|control11|><|separator|>
[64]
Expert System Tools: The Next Generation - IEEE Xplore
This potential for knowledge gaps or inconsistencies gives expert systems their reputation for “brittle”. behavior (they fail catastrophically when dealing ...
[65]
[PDF] Ontology - Tom Gruber
The paper defines ontology as an "explicit specification of a conceptualization," which is, in turn, "the objects, concepts, and other entities that are ...
[66]
OWL 2 Web Ontology Language Structural Specification and ... - W3C
Dec 11, 2012 · Entities, such as classes, properties, and individuals, are identified by IRIs. They form the primitive terms of an ontology and constitute the ...
[67]
RDF - Semantic Web Standards - W3C
RDF is a standard model for data interchange on the Web. RDF has features that facilitate data merging even if the underlying schemas differ, and it ...
[68]
SPARQL 1.1 Query Language - W3C
Mar 21, 2013 · This specification defines the syntax and semantics of the SPARQL query language for RDF. SPARQL can be used to express queries across diverse data sources.
[69]
Gene Ontology: tool for the unification of biology | Nature Genetics
The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes.Missing: bioinformatics | Show results with:bioinformatics
[70]
[PDF] DBpedia: A Nucleus for a Web of Open Data - UPenn CIS
DBpedia allows you to ask sophisticated queries against datasets derived from Wikipedia and to link other datasets on the Web to Wikipedia data. We describe the ...
[71]
[PDF] Ontology engineering - The basic process - IDA.LiU.SE
• To enable standardisation and/or reuse of domain knowledge. – to ... – Ontology engineering tools (TopBraid Composer, Protégé 4 and 5, WebProtége ...).Missing: creation | Show results with:creation
[72]
Knowledge graphs as tools for explainable machine learning: A survey
This paper provides an extensive overview of the use of knowledge graphs in the context of Explainable Machine Learning.
[73]
[2008.07912] Inductive logic programming at 30: a new introduction
Aug 18, 2020 · Inductive logic programming (ILP) is a form of machine learning. The goal of ILP is to induce a hypothesis (a set of logical rules) that generalises training ...
[74]
Building Watson: An Overview of the DeepQA Project | AI Magazine
Jul 28, 2010 · Abstract. IBM Research undertook a challenge to build a computer system that could compete at the human champion level in real time on the ...
[75]
Highly accurate protein structure prediction with AlphaFold - Nature
Jul 15, 2021 · The AlphaFold network directly predicts the 3D coordinates of all heavy atoms for a given protein using the primary amino acid sequence and ...
[76]
A comparative analysis of knowledge injection strategies for large ...
In this paper, we present a comprehensive study of knowledge injection strategies for transformers within the scientific domain.
[77]
[PDF] Knowledge Engineering: The Applied Side of Artificial Intelligence.
The-problem of knowledge acquisition Is the critical bottleneck problem in Artificial Intelligence. 2.0 A Brief Tutorial Using the MYCIN Program. As the basis ...
[78]
25 Years of Knowledge Acquisition - ScienceDirect.com
... knowledge acquisition bottleneck (Feigenbaum, 1977). In a nutshell, the problem concerned the effective acquisition and representation of knowledge, in a ...
[79]
[PDF] VERY Large Knowledge bases - Architecture vs Engineering - IJCAI
This panel will use the VLKBs to help expose the audience to some of the issues resulting from the scaling and use of. AI "in the large." Currently, VLKBs are ...Missing: scalability | Show results with:scalability
[80]
Chapter 6 Nonmonotonic Reasoning - ScienceDirect.com
This interest was fueled by several fundamental challenges facing knowledge representation such as modeling and reasoning about rules with exceptions or ...
[81]
[PDF] Reasoning with Incomplete and Uncertain Information - DTIC
This paper proposes an apporoach to these problems based on integrating nonmonotonic reasoning with plau- sible reasoning based on triangular norms. The paper ...<|separator|>
[82]
The Representational Challenge of Integration and Interoperability ...
The challenge is the consistent, correct, and formalized representation of the transformed health ecosystem from the perspectives of all domains involved.
[83]
Probability problems in knowledge acquisition for expert systems
Many expert systems allow the use of uncertainty values. However, people have been found to be consistently susceptible to cognitive biases in estimating and ...Missing: limitations | Show results with:limitations
[84]
[PDF] COGNITION SUPPORT TOOLS FOR KNOWLEDGE ACQUISITION
... cognitive issues of knowledge engineering especially after the work on expert bias has demonstrated severe difficulties in identifying and controlling for bias.
[85]
(PDF) Measuring inconsistency in knowledgebases - ResearchGate
Aug 5, 2025 · It is well-known that knowledgebases may contain inconsistencies. We provide a measure to quantify the inconsistency of a knowledgebase, ...
[86]
[PDF] Inconsistency-based Ranking of Knowledge Bases - Semantic Scholar
When a knowledge base K is inconsistent the clas- sical inference relation is trivialized, i.e., any formula and its negation can be inferred from K. To address.Missing: error rates
[87]
https://www.sciencedirect.com/science/article/abs/pii/0950705188900068
[88]
Knowledge prompting: How knowledge engineers use generative AI
Oct 15, 2025 · Collaboration between humans and AI agents is redefining ontology and knowledge engineering, streamlining workflows, and altering the ...
[89]
A Blockchain-Based Scheme for Knowledge Data Traceability and ...
Aug 16, 2024 · This paper presents the design of a blockchain-based knowledge data provenance and sharing system, which leverages blockchain technology to achieve trustworthy ...
[90]
Using three minimally biasing elicitation techniques for knowledge ...
Bias is defined, here, as an altering or misrepresentation of the expert's thought processes. This paper focuses on three manual elicitation techniques—the ...
[91]
Use (and abuse) of expert elicitation in support of decision making ...
Done well, expert elicitation can make a valuable contribution to informed decision making. Done poorly it can lead to useless or even misleading results.
[92]
[PDF] Towards Knowledge-Based Systems for GDPR Compliance
Using the knowledge-based system, it is possible to express dynamic queries over the obligations, whose re- sults can be used as a form of compliance ...
[93]
Accountability in artificial intelligence: what it is and how it works | AI ...
Feb 7, 2023 · It refers to complex hybrid systems in which human and technical resources are joined in goal-directed behavior (Baxter and Sommerville 2011; ...
[94]
[PDF] Supporting Trust in Hybrid Intelligence Systems Using Blockchains
The combination of techniques from machine learning and knowledge engineering can lead to new types of information systems for processing data and knowledge by ...<|separator|>
[95]
Fairness: Demographic parity | Machine Learning
Aug 25, 2025 · Learn how to use the demographic parity metric to evaluate ML model predictions for fairness, and its benefits and drawbacks.
[96]
Integrating Social Determinants of Health into Knowledge Graphs
Based on this constructed graph, we employ the idea of demographic parity,, to develop a novel formulation of fairness within the context of link prediction, ...
[97]
https://ceur-ws.org/Vol-2600/paper9.pdf
[98]
New standard for knowledge engineering in AI - IEC
Apr 5, 2024 · But AI applications are only as good as the data that fuel them. Knowledge engineering (KE) involves acquiring knowledge from a range of sources ...
[99]
What is Knowledge Engineering and Its Importance in AI and Expert ...
May 31, 2024 · Knowledge engineering refers to the process of designing and building systems that can simulate human expertise. It involves extracting, ...
[100]
Knowledge Management Software Market Size & Outlook, 2025-2033
The global knowledge management software market size was USD 23.58 billion in 2024 & is projected to grow from USD XX billion in 2025 to USD 59.51 billion ...