Fact-checked by Grok 2 weeks ago

Multi-model database

A multi-model database is a type of database management system (DBMS) that natively supports multiple data models—such as relational, document (e.g., JSON or XML), graph, key-value, and spatial—within a single, integrated backend, allowing diverse data types to be stored, queried, and managed without requiring separate specialized databases.^[1]^[2] This approach, often termed multimodel polyglot persistence, addresses the challenges of handling heterogeneous data in modern applications by providing unified administration, security, scalability, and high availability features across all supported models.^[1]^[3] Key benefits of multi-model databases include simplified data integration and reduced operational complexity, as organizations avoid the overhead of maintaining multiple siloed systems for different data formats.^[2]^[3] They enable efficient querying using a common language or extensions, such as SQL with added support for graph patterns (e.g., MATCH clauses), JSON functions, XQuery for XML, and spatial operators, often leveraging in-memory processing and indexing tailored to each model.^[2]^[1] Notable implementations include Oracle AI Database 26ai (as of 2025), which supports JSON via Simple Oracle Document Access (SODA), property graphs with analytics, RDF semantic graphs, and spatial data; Azure SQL, which integrates these capabilities into its relational engine using Transact-SQL extensions; and Azure Cosmos DB, a NoSQL multi-model service supporting document, key-value, wide-column, graph, and spatial models.^[4]^[1]^[2]^[5] The rise of multi-model databases reflects the evolution of data management to accommodate big data, cloud-native applications, and polyglot programming, with benchmarks emerging to evaluate performance across models like document, graph, and key-value stores.^[3]^[6] These systems prioritize optimized storage formats, such as binary JSON representations, and cross-model query capabilities to support complex, real-world workloads in industries like finance, healthcare, and e-commerce.^[1]^[7]

Overview

Definition and Characteristics

A multi-model database is a database management system (DBMS) that natively supports multiple data models—such as relational, document, graph, and key-value—within a single, integrated backend, enabling seamless storage, querying, and management of diverse data types without requiring separate systems for each model.^[8]^[9] This approach allows applications to leverage specialized data structures and access methods tailored to specific needs while maintaining a unified platform for all data operations.^[1] Key characteristics of multi-model databases include a unified storage engine that efficiently manages various data formats and structures in one repository, model-agnostic querying that supports operations across different models via a single interface or query language, and the elimination of data silos by consolidating heterogeneous data sources.^[9]^[10] Unlike polyglot persistence, which relies on multiple specialized databases leading to increased complexity, integration overhead, and potential inconsistencies, multi-model databases achieve polyglot capabilities within a single system, simplifying administration, security, and scalability.^[1]^[11] These databases evolved to overcome the rigidity of traditional single-model systems, such as relational DBMS limited to structured data or NoSQL silos optimized for one paradigm but inflexible for others, by enabling hybrid data handling that supports the varied workloads of modern applications.^[9]^[11] This unified flexibility addresses the challenges of data diversity in big data environments without the drawbacks of fragmented architectures.^[10]

Historical Development

The concept of multi-model databases emerged in the early 2010s, building on innovations in NoSQL databases to address the growing need for handling diverse data types within a unified system, rather than relying on separate specialized databases. This development responded to the challenges of polyglot persistence, a term coined by software architect Martin Fowler in his 2011 bliki post, which described using multiple database technologies tailored to specific application needs to manage varying data storage requirements.^[12] One of the pioneering systems, OrientDB, was first released in 2010 by Luca Garulli, integrating document, graph, key-value, and object-oriented models into a scalable NoSQL database.^[13] The term "multi-model database" itself was formally introduced by Garulli in May 2012 during his keynote at the NoSQL Matters Conference in Cologne, Germany, envisioning an evolution of first-generation NoSQL products to support broader use cases through integrated backends.^[14] Between 2014 and 2018, multi-model databases gained traction with key releases that demonstrated practical viability and enterprise appeal. ArangoDB, initially launched as AvocadoDB in 2011 and renamed in 2012, established itself as an open-source option supporting document, graph, and key-value models with a focus on query flexibility via its AQL language.^[15] Similarly, Microsoft introduced Azure Cosmos DB in 2017 as a globally distributed, multi-model service, evolving from the internal Project Florence started in 2010 to handle large-scale, multi-tenant applications across key-value, document, graph, and column-family models.^[16] Post-2020, the adoption of multi-model databases accelerated, driven by the demands of cloud-native architectures and AI-driven workloads that require seamless integration of structured, semi-structured, and unstructured data. Systems like SurrealDB, first released in 2022, have advanced this trend through ongoing developments up to 2025, emphasizing real-time querying, extensibility, and deployment in edge computing environments to support distributed AI applications.^[17] This growth reflects broader shifts in data management, including the transition from rigid relational database management systems (RDBMS), which dominated from the 1970s to the 2000s, to the scalable but fragmented NoSQL paradigms of the 2000s.^[18] The rise of multi-model approaches was further influenced by the big data explosion, where frameworks like Apache Hadoop—initially released in April 2006—exposed the "variety" challenge in processing heterogeneous datasets, prompting hybrid designs that unify storage and querying without sacrificing performance.^[19] By consolidating models into single engines, these databases mitigated the operational overhead of polyglot persistence while adapting to the unstructured data surge in modern ecosystems.^[20]

Supported Data Models

Common Models

Multi-model databases typically support a variety of standard data models to accommodate diverse application needs, including relational, document, graph, key-value, column-family, spatial, vector, and time-series structures. These models allow users to store and manage different types of data within a unified system, leveraging each model's strengths for specific use cases such as structured queries, semi-structured storage, or relationship traversals. The relational model organizes data into tabular structures with rows and columns, supporting SQL-like querying, ACID transactions for data integrity, and operations like joins to relate multiple tables efficiently. This model is particularly suited for applications requiring strict schema enforcement and complex analytical queries, as implemented in systems like Azure SQL Database, which extends traditional relational capabilities to multi-model environments.^[2] The document model stores data as self-contained, semi-structured units in formats like JSON or BSON, offering schema flexibility to handle varying data shapes without rigid predefined structures. It excels in scenarios involving hierarchical or nested data, such as content management or user profiles, where rapid ingestion and retrieval are prioritized over fixed schemas, as seen in ArangoDB's native document collections. The graph model represents data as nodes, edges, and properties to capture complex relationships and interconnections, enabling efficient traversals and pattern matching for relationship-heavy datasets like social networks or recommendation engines. This approach facilitates queries that follow paths through connected entities, providing insights into networks that tabular models struggle with, as supported natively in databases like OrientDB.^[21] The spatial model handles geographic and geometric data, supporting queries for location-based analysis, proximity searches, and mapping applications using standards like GeoJSON or Well-Known Text (WKT). It is ideal for use cases in logistics, urban planning, and environmental monitoring, with native support in systems like Oracle Database and ArangoDB.^[1] Key-value and column-family models provide foundational storage for high-performance access patterns. The key-value model uses simple pairs for fast lookups and caching, ideal for session data or configuration stores with minimal overhead. Column-family models, akin to wide-column stores, organize data into dynamic columns within rows for scalable handling of sparse, semi-structured information like logs or sensor readings, as exemplified by Azure Cosmos DB's Cassandra API. Emerging support for vector and time-series models addresses modern demands in AI/ML and real-time analytics as of 2025. The vector model stores high-dimensional embeddings for similarity searches and machine learning applications, such as semantic retrieval in large language models, integrated in systems like ArangoDB. The time-series model manages timestamped sequential data for temporal analysis, supporting efficient aggregation and forecasting in IoT or financial applications, as provided by SurrealDB.^[22]

Extensibility and User-Defined Models

Multi-model databases enhance flexibility by supporting user-defined models, which allow developers to create custom data structures tailored to specific application needs without altering the core system. These models are typically defined through mechanisms such as schema extensions, where users specify new item types, constraints, and relationships using declarative constructs like the TRIPLE format (<ITEM NAME, ITEM TYPE, ITEM CONSTRAINT>). For instance, custom geospatial models can be built by extending graph-based structures with path filters to handle spatial queries, while event-sourced models leverage document-oriented schemas with matching filters for temporal event tracking.^[23] This approach enables the integration of domain-specific semantics while preserving compatibility with built-in models.^[23] Extensibility features in multi-model databases further empower customization through plugin architectures, schema-on-read paradigms, and API hooks that facilitate the addition of new models without backend modifications. Plugin architectures permit the registration of characteristic filters or functions that extend query processing for novel data types, ensuring seamless incorporation of specialized logic. Schema-on-read approaches, such as those employing supply-driven inference, dynamically interpret heterogeneous data sources—ranging from relational to graph-based—allowing on-demand extensions of existing schemas with minimal upfront definition. API hooks provide entry points for injecting domain-specific behaviors, such as custom indexing or validation, directly into the query engine. These features collectively support scalable adaptation, as demonstrated by tools that unify schemas across models using record schema descriptions (RSD) to capture integrity constraints and inter-model references.^[23]^[24]^[25] In practice, these capabilities enable multi-model databases to adapt to industry-specific requirements, fostering innovation in dynamic environments. In finance, extensibility allows the creation of custom risk assessment models by extending multidimensional cubes with real-time market data feeds, improving OLAP analyses for volatile conditions. For IoT applications, hybrid sensor data models can be user-defined to integrate time-series and graph elements, supporting real-time analytics in scenarios like environmental monitoring. By 2025, integration of AI in database management has supported advancements in schema evolution and automation, reducing manual configuration in evolving data ecosystems.^[24]^[24]

System Architecture

Core Design Principles

Multi-model database systems are engineered around a unified backend that serves as a single, integrated storage layer capable of handling diverse data models such as relational, document, graph, and key-value without requiring separate engines or polyglot persistence approaches.^[26] This design minimizes overhead by sharing core infrastructure services like transactions, recovery, and indexing across models, ensuring data consistency and reducing the complexity of managing multiple disparate systems.^[27] By consolidating storage, these systems avoid the procedural integration challenges of traditional polyglot setups, allowing for more efficient resource utilization and simpler administration.^[26] To facilitate seamless interaction with varied data models, multi-model databases employ abstraction layers, often in the form of unified APIs or intermediaries like object-relational mappers, that translate operations between models without exposing underlying complexities to applications.^[7] These layers enable declarative access to multiple models through a common interface, supporting transformations such as SQL queries over graph data or JSON documents, which enhances developer productivity by abstracting model-specific details.^[26] For instance, views and query rewriters act as logical intermediaries, permitting flexible data organization independent of physical storage while maintaining model fidelity.^[27] Scalability and consistency in multi-model databases involve strategic trade-offs guided by the CAP theorem, where systems prioritize availability and partition tolerance for distributed workloads while often favoring eventual consistency to accommodate diverse model requirements like high-throughput key-value operations alongside ACID-compliant relational transactions.^[28] This balance is achieved through tunable consistency models, such as BASE for scalable, fault-tolerant scenarios and stricter ACID guarantees for critical data, enabling horizontal scaling across large, semi-structured datasets without sacrificing overall system reliability.^[26] In practice, in-memory processing and adaptive indexing support massive data volumes, ensuring performance under varying loads from common models like graphs and documents.^[27] Security and governance are reinforced through unified access controls that apply consistently across all supported models, typically via role-based access control (RBAC) policies to enforce fine-grained permissions and prevent unauthorized cross-model data exposure.^[29] This centralized approach simplifies compliance by providing a single governance framework for auditing, encryption, and policy enforcement, reducing risks associated with fragmented security in multi-model environments.^[30] For example, attribute-based controls can restrict intra-document access using standards like XPath, ensuring secure handling of hybrid data while maintaining operational efficiency.^[26]

Storage and Indexing Mechanisms

Multi-model databases typically employ a unified storage engine to manage diverse data models such as documents, graphs, and key-value pairs, often building on document-oriented structures like JSON trees or extending key-value stores to accommodate relational and graph elements. For instance, systems like ArangoDB utilize RocksDB, an LSM-tree-based engine optimized for high write throughput, to persist all data models in a single layer, where documents serve as the foundational unit and graph edges are represented as specialized documents linking vertices.^[31] In contrast, OrientDB leverages B-tree and hash-based storage for efficient read operations across its multi-model support, including object-oriented extensions for relational-like queries. These engines balance write-heavy workloads with LSM-trees for sequential appends and read-optimized B-trees for point lookups, enabling seamless integration of heterogeneous data without model-specific silos.^[32] Indexing strategies in multi-model databases are designed to support queries across models, incorporating composite indexes for relational joins, full-text indexes for document searches, and traversal indexes for graph navigation. Composite indexes, often built on multiple attributes, facilitate efficient relational operations by combining keys from document or key-value stores, as seen in ArangoDB's hash and skiplist indexes that span document and graph elements. Full-text indexes employ inverted structures to handle semi-structured document content, while graph-specific traversal indexes use adjacency lists or edge pointers to enable rapid pathfinding, with OrientDB's unique traversal mechanism supporting millisecond-level queries regardless of database scale. Adaptive indexing approaches dynamically adjust based on query patterns, selecting model-appropriate structures—such as B-trees for ordered relational access or bloom filters for probabilistic key-value lookups—to optimize across mixed workloads.^[32] Data representation in multi-model databases relies on unified serialization formats to store heterogeneous data efficiently, often using binary encodings like BSON or Protocol Buffers to embed diverse models within a common structure. For example, graphs are typically represented via adjacency lists embedded in document collections, allowing key-value pairs to serve as node properties and relational tuples to map onto composite keys, as implemented in systems like ArcNeural with its memory-mapped files for vectors and RocksDB for payloads.^[33] Schema evolution tools, such as the prototype MM-evolver proposed in 2019, support propagating changes across models—such as adding attributes to documents or altering graph edges—while maintaining backward compatibility through versioned mappings and categorical transformations.^[34] This enables flexible handling of evolving schemas without data migration disruptions, prioritizing extensibility in polyglot persistence environments.^[35]

Querying and Interfaces

Query Languages

Multi-model databases employ a variety of query languages to handle operations across diverse data models, typically through unified languages that abstract underlying complexities or model-specific subsets routed via a single interface. Unified query languages, such as ArangoDB Query Language (AQL), enable seamless querying of key-value, document, and graph models within a single syntax, supporting declarative operations like traversals and joins without requiring model-specific switches.^[36] Similarly, extensions to SQL, including SQL/JSON as standardized in ISO/IEC 9075:2016, allow relational databases like PostgreSQL to query JSON documents alongside tabular data using operators like containment (@>) and path expressions, effectively supporting hybrid relational-document models.^[36]^[37] Model-specific query subsets are often integrated into multi-model systems to leverage specialized paradigms while maintaining a unified access point. For instance, in databases like ArcadeDB, SQL handles relational queries, Cypher supports pattern matching for property graphs (e.g., MATCH (n:Hero)-[:IsFriendOf]->(m) RETURN n, m), and Gremlin enables traversal-based graph operations, all executable through a consistent interface such as the system's Java API or web console.^[38] These subsets allow developers to apply graph-specific languages like Cypher or Gremlin for complex relationship queries without abandoning relational SQL for structured data, with the database routing requests internally across models.^[36] Advanced features in these languages facilitate cross-model interactions, such as joins between graph edges and JSON documents or aggregation pipelines that summarize data from multiple sources. In AQL, for example, queries can perform graph traversals followed by aggregations like counting connected components across document collections, optimizing for multi-model storage targets. SQL++ variants extend this by incorporating path queries and object-relational mappings for unified aggregations over JSON and relational data.^[36] The Graph Query Language (GQL), standardized as ISO/IEC 39075:2023, further integrates property graph querying into SQL, enabling multi-model systems to handle graph patterns alongside relational and document data.^[39] As of November 2025, natural language interfaces using large language models (LLMs) are an emerging trend in database querying, primarily through tools that translate plain English prompts into SQL (NL2SQL), with growing exploration for broader data models. These tools aim to enable non-experts to query enterprise-scale databases while balancing accuracy and latency, though adoption for cross-model operations across graphs, documents, and vectors remains in early stages.^[40]^[41]

APIs and Access Methods

Multi-model databases typically expose standard APIs that enable model-agnostic interactions, allowing developers to perform operations across diverse data models without switching interfaces. RESTful APIs are widely adopted for their simplicity and compatibility with web-based applications, providing endpoints for CRUD operations on relational, document, graph, and key-value data. GraphQL endpoints further enhance flexibility by permitting clients to specify exact data requirements, reducing over-fetching in scenarios involving multiple models. Language-specific drivers, such as JDBC extensions for Java and Python SDKs, support unified access to these models, facilitating seamless integration in polyglot environments.^[32]^[1]^[10] Access protocols in multi-model databases prioritize efficiency and versatility to handle varied workloads. gRPC serves as a high-performance protocol for low-latency, bidirectional communication, particularly suited for microservices architectures querying hybrid data structures. WebSockets enable real-time, persistent connections for streaming updates across models, supporting applications like live analytics on graph and document data. Federation patterns allow hybrid queries by virtually unifying multiple backend stores, enabling cross-model joins without data duplication.^[32]^[42] Integration capabilities extend multi-model databases into broader ecosystems, with connectors facilitating data flow to streaming platforms like Kafka for real-time ingestion and processing of multi-structured events. Compatibility with BI tools via standard ODBC/JDBC drivers supports analytics workflows, allowing unified reporting on relational and non-relational data. As of 2025, serverless access models have gained prominence, offering auto-scaling APIs without infrastructure management, as seen in cloud-native implementations that handle variable loads across data models efficiently.^[43]^[44]^[45]

Benefits and Limitations

Advantages

Multi-model databases offer a simplified architecture by integrating multiple data models—such as relational, document, graph, and key-value—into a single platform, thereby reducing the need for deploying and maintaining separate specialized databases. This consolidation addresses the challenges of polyglot persistence, where applications require diverse data storage solutions, by minimizing integration overhead and lowering operational costs associated with data synchronization and system interoperability.^[32]^[46] In terms of performance efficiency, these databases leverage unified storage layers and optimized indexing strategies to enable faster query execution across different models without the latency introduced by extract, transform, load (ETL) processes or data duplication between silos. This approach results in better resource utilization, particularly in distributed and cloud-based environments, where a single instance can handle varied workloads more effectively than fragmented systems.^[46]^[32] The flexibility of multi-model databases makes them well-suited for modern application development, including microservices architectures that demand varied data access patterns, AI and machine learning workflows requiring vector embeddings alongside structured data, and real-time analytics that benefit from seamless querying of operational datasets. By natively supporting these paradigms within one system, developers can iterate more rapidly and adapt to evolving requirements without architectural overhauls.^[32]^[47]

Challenges and Drawbacks

Multi-model databases introduce significant complexity in management, primarily due to the need to handle diverse data models within a unified system, which often features contradictory characteristics and requires specialized skills for effective administration.^[48] This can result in a steeper learning curve for developers and administrators, as unified querying across models demands familiarity with multiple paradigms, potentially leading to suboptimal performance akin to a "jack-of-all-trades" approach that underperforms compared to specialized single-model databases in model-specific workloads.^[49] Query optimization across heterogeneous models exacerbates this, as execution plans must accommodate varied access patterns, sometimes degrading performance by up to 65% for operations spanning multiple models.^[50] Consistency issues pose another key challenge, particularly in balancing transaction properties across differing models, where relational components may require ACID guarantees while document or graph elements favor eventual consistency, necessitating advanced conflict resolution to avoid inconsistencies in distributed environments.^[32] Hybrid consistency models can mitigate this by providing strong guarantees for critical transactions and relaxed ones for others, potentially reducing latency in write-heavy scenarios by 38%, though implementing such strategies adds operational overhead.^[50] Additionally, proprietary extensions in multi-model systems can lead to vendor lock-in, making migration difficult due to dependencies on vendor-specific features for cross-model integration.^[51] As of 2025, multi-model databases exhibit maturity gaps, remaining a relatively emerging field with limited ecosystem support relative to established single-model specialists like relational or NoSQL databases, which boast more mature tools, libraries, and community resources.^[32] This immaturity manifests in scalability challenges for ultra-high-volume scenarios, where unified storage engines may achieve throughput within 12% of specialized systems but struggle with extreme heterogeneity without custom tuning.^[50] Despite projected growth at a 19.3% CAGR through 2028, the ecosystem's relative youth limits widespread adoption in mission-critical applications requiring proven long-term reliability.^[50]

Notable Implementations

Commercial Systems

Oracle Database is a multi-model relational database management system that supports relational data alongside document (JSON via Simple Oracle Document Access (SODA)), graph (property graphs and RDF semantic graphs), spatial, and key-value models within a single integrated engine.^[1] First introduced with multi-model capabilities in version 12c (2013) and enhanced in 19c (2019), the latest version 23ai (as of 2025) adds AI Vector Search for machine learning workloads, enabling unified querying via SQL extensions like JSON functions, graph MATCH clauses, and spatial operators.^[52] It provides enterprise-grade features such as ACID transactions, high availability through Real Application Clusters (RAC), and scalability for big data analytics, making it suitable for industries like finance and healthcare requiring secure, compliant data management across diverse models.^[1] Microsoft Azure SQL Database extends the relational model with native support for JSON documents, graph queries via MATCH, spatial data, and XML through T-SQL, allowing multi-model operations without separate databases.^[2] Launched as part of Azure SQL in 2010 and with multi-model features maturing by 2017, it leverages the SQL Server engine for cloud scalability, automatic tuning, and integration with Azure services like Synapse Analytics. As of 2025, it supports hyperscale storage up to 100 TB and serverless compute options, ideal for hybrid transactional-analytical processing (HTAP) in applications such as e-commerce and IoT.^[2] Microsoft Azure Cosmos DB is a globally distributed, multi-model database service that supports document, key-value, graph, and column-family data models through multiple APIs, including SQL (Core), MongoDB, Cassandra, Gremlin, and Azure Table Storage.^[5] Launched in May 2017, it provides automatic scaling, low-latency guarantees under 10 ms for point reads and writes, and multi-region replication for high availability, making it suitable for enterprise cloud applications such as real-time analytics, e-commerce personalization, and AI-driven services that require consistent performance across global data centers.^[53]^[54] Couchbase Server functions as a distributed multi-model database that integrates document storage with graph capabilities, enabling the modeling of complex relationships using JSON documents and SQL++ (formerly N1QL) queries that support joins, recursive common table expressions for graph traversal, and ACID transactions.^[55]^[56] It emphasizes mobile and real-time synchronization through Couchbase Lite and Sync Gateway, allowing seamless data replication between edge devices and cloud environments for offline-first applications. In October 2025, Couchbase released version 8.0, introducing hyperscale vector indexing and search enhancements like the Hybrid Vector Index for billion-scale AI workloads, which supports hybrid queries combining vector similarity with document and graph data.^[57] These features position Couchbase for use cases in mobile apps, IoT data syncing, and generative AI applications requiring low-latency access to interconnected data.^[55] SingleStore (formerly MemSQL) is a distributed SQL database that provides native multi-model support for relational, JSON, time-series, vector, full-text, and geospatial data within a single engine, using standard SQL queries across all models without needing separate systems.^[9] It converges online transaction processing (OLTP) and online analytical processing (OLAP) through its Hybrid Transactional/Analytical Processing (HTAP) architecture, delivering sub-millisecond query latencies for real-time ingestion and analytics on petabyte-scale datasets.^[58] This low-latency unification enables use cases like fraud detection, supply chain optimization, and interactive dashboards in enterprise environments, where immediate insights from mixed workloads are critical.^[59]

Open-Source Projects

ArangoDB is a prominent open-source multi-model database that natively integrates document, graph, and key-value storage models within a single engine, enabling unified querying across data types.^[60] Its architecture leverages a flexible storage layer that supports JSON documents for semi-structured data, graph structures for relationship modeling, and key-value pairs for simple lookups, all managed by a distributed cluster design for scalability. The project, initiated in 2014, uses the ArangoDB Query Language (AQL), a declarative SQL-like syntax extended for multi-model operations, allowing complex traversals and joins in one query. Foxx microservices provide a JavaScript framework for embedding custom logic directly into the database, facilitating serverless-style applications without external middleware. Community contributions have been vital since its Apache 2.0-licensed inception, with active development on GitHub including extensions for search and analytics; recent 2025 enhancements integrate the Kubernetes Operator for automated cluster management, deployment, and scaling in containerized environments.^[61]^[62] OrientDB, now evolved into ArcadeDB following its 2018 acquisition by SAP and subsequent support discontinuation in 2021, represents a key open-source multi-model effort emphasizing graph and document paradigms with extensions for other models like key-value and time-series.^[63] ArcadeDB's architecture builds on OrientDB's record-based storage but introduces a lighter, faster transactional engine using Alien Technology for multi-model handling, supporting graphs for analytics, documents for flexibility, and vectors for AI workloads in a single backend.^[38] It extends standard SQL with graph-specific syntax like OpenCypher for traversals, enabling efficient analytics on large-scale connected data without model silos.^[64] Post-acquisition, community forks led by original creator Luca Garulli birthed ArcadeDB as the official continuation under Apache 2.0, fostering contributions in areas like sharding, replication, and plugin development via GitHub, with strong emphasis on graph traversal speeds reaching millions of records per second.^[65] SurrealDB, a Rust-based open-source multi-model database, unifies document, graph, relational, time-series, geospatial, and key-value models in a scalable, embeddable architecture designed for modern applications.^[66] Its core engine supports real-time queries through live subscriptions and event-driven updates, allowing reactive data flows across models without polling. Focused on edge and embedded scenarios, it runs in-process for low-latency operations on devices, with offline-first synchronization and a small footprint suitable for IoT and mobile use.^[67] Gaining significant traction since 2023, the project has seen rapid adoption through $6 million in funding, major releases like version 2.0 in 2024, and reported $5 million revenue by 2025, driven by its developer-friendly SQL-like query language and AI-native features.^[68]^[69] The community, active on GitHub and Discord, contributes to its Apache 2.0-licensed codebase, emphasizing security (RBAC, JWT) and scalability from single-node to distributed clusters.^[70]

Evaluation and Benchmarking

Performance Metrics

Performance metrics for multi-model databases evaluate their ability to handle diverse data models and workloads efficiently, focusing on key indicators that reflect operational effectiveness across relational, document, graph, key-value, and increasingly vector-based operations. Throughput, typically measured in queries per second (QPS) or transactions per second (TPS), quantifies the volume of operations a database can process over time, particularly important for mixed workloads involving operations like graph traversals and relational joins. For instance, in benchmarks simulating social commerce scenarios, throughput varies significantly by workload; a New Order transaction spanning multiple models might achieve 230 TPS in one system, while a Payment transaction reaches 738 TPS, highlighting how multi-model integration can optimize or constrain processing rates depending on data access patterns. Latency, the time taken for individual query responses, is assessed in milliseconds or seconds and often scales logarithmically with dataset size, with graph traversals typically incurring higher latency than simple relational joins due to traversal depth and join complexity across models.^[71] Scalability metrics examine a database's capacity to expand without proportional performance degradation, including horizontal scaling efficiency—such as the ability to distribute workloads across nodes—and sharding overhead, which measures the additional computational cost of partitioning data. Resource utilization tracks CPU and memory consumption under mixed workloads, where handling concurrent relational queries, graph analytics, and document retrievals can lead to uneven load distribution if not optimized. In evaluations using scale factors from 1GB to 30GB, multi-model databases demonstrate linear scalability in data generation and query execution on clustered setups, completing large-scale data preparation in under 60 minutes on three nodes, though sharding introduces overhead in cross-model queries compared to single-model operations.^[71] Consistency and durability metrics ensure reliable data handling in distributed environments, with transaction commit rates indicating the percentage of ACID-compliant operations successfully finalized across models, often exceeding 99% in standalone modes for workloads like order processing that span graphs and relational data. Replication lag, the delay in synchronizing data across replicas, is monitored in milliseconds to balance availability and freshness, particularly in globally distributed systems where strong consistency levels can introduce lags under high throughput. In 2025, vector similarity search speed has emerged as a critical metric for AI-integrated multi-model databases, measuring query latency for high-dimensional embeddings; for example, systems can achieve low-latency responses in the tens of milliseconds for searches over millions of vectors while maintaining high recall rates, enabling efficient hybrid workloads combining semantic search with traditional models.^[72]

Benchmarking Approaches

Benchmarking multi-model databases involves methodologies that evaluate system performance across diverse data models such as relational, document, graph, key-value, and increasingly vector representations. Standard benchmarks are often adapted from single-model suites to handle combined workloads, ensuring comprehensive assessment of interoperability and efficiency. For instance, the Yahoo! Cloud Serving Benchmark (YCSB) is extended for key-value operations within multi-model contexts, while the Linked Data Benchmark Council (LDBC) social network benchmark supports graph traversals integrated with other models. The Transaction Processing Performance Council Decision Support (TPC-DS) benchmark is similarly adapted for analytic queries involving relational and array data, as seen in frameworks that scale real-world datasets to simulate mixed-model analytics.^[71]^[73]^[74] These adaptations emphasize combined workloads to test model conversions and joint operations, such as graph pattern matching alongside relational aggregations. The UniBench framework, for example, draws from YCSB, LDBC, and TPC benchmarks to generate correlated data across models, executing queries that span document retrieval, graph navigation, and key-value lookups in a unified schema. Similarly, M2Bench incorporates TPC-DS-inspired analytic tasks across relational, document, graph, and array models, using domain-specific scenarios like e-commerce recommendations that mix at least two models per task. Such approaches prioritize end-to-end query execution times and scalability under scale factors from 1 to 10, providing a foundation for evaluating multi-model synergies. Recent developments as of 2025, such as VDBBench 1.0, further incorporate AI-augmented techniques for vector operations in real-world simulations.^[71]^[74] Custom benchmarking approaches often involve workload simulations that blend models in configurable proportions to mimic real-world applications. For example, scenarios might allocate 40% of operations to relational queries, 30% to graph traversals, and the remainder to document or key-value accesses, generated via synthetic tools on platforms like Spark. The MMSBench-Net benchmark employs custom scripts to simulate network monitoring workloads, integrating relational user data, document logs, and graph topologies with adjustable query distributions for parallel execution. Tools like HammerDB, primarily for relational OLTP, can be scripted into hybrid setups for multi-model testing, while bespoke generators handle data evolution and model mixing. These methods ensure repeatable tests focused on transaction throughput and query latency, often using choke-point queries to isolate multi-model challenges.^[75]^[76] In 2025, benchmarking trends incorporate AI-augmented techniques for vector operations, reflecting the integration of embedding-based models in multi-model systems. Tools like VDBBench facilitate real-world simulations with streaming ingestion and concurrent reads/writes on high-dimensional datasets (e.g., 768-1536 dimensions from AI models like Cohere V2), measuring P95 latency and recall for retrieval-augmented generation workloads. Fair comparisons across vendors necessitate standardized configurations, particularly distinguishing cloud deployments—offering auto-scaling and pay-as-you-go economics—from on-premise setups with fixed hardware control. Benchmarking studies highlight the need for identical workloads and resource normalization to account for cloud latency variability versus on-premise predictability, ensuring equitable evaluation of scalability and cost-efficiency in multi-model environments. These approaches target performance metrics such as throughput and resource utilization as core evaluation goals.^[72]^[77]

Theoretical Underpinnings

Foundational Concepts

The concept of polyglot persistence, introduced by Martin Fowler and Pramod Sadalage in 2012, posits that applications benefit from employing multiple specialized data storage technologies to match diverse data requirements, rather than relying solely on relational databases.^[12] This approach acknowledges the limitations of a one-size-fits-all model, advocating for key-value stores for simple lookups, document stores for semi-structured data, and graph databases for complex relationships. Multi-model databases represent an evolution of this idea, integrating these varied models into a unified system to reduce integration overhead while preserving the strengths of each paradigm.^[78] Data model unification in multi-model databases addresses the fragmentation of storage by extending abstract conceptual models, such as the entity-relationship (ER) model, to encompass relational, graph, and document structures within a cohesive framework. The traditional ER model, which focuses on entities, attributes, and relationships, can be generalized to represent graph edges as relationships and document hierarchies as nested entities, enabling seamless transitions between models.^[79] This unification mitigates the object-relational impedance mismatch—a longstanding challenge where the gap between application-level object models and rigid relational schemas leads to inefficient data mapping and query complexities—by allowing native support for multiple representations without custom intermediaries.^[80] Formal foundations for multi-model databases draw on type theory to enable schema flexibility, permitting dynamic evolution of data structures while maintaining type safety and consistency across models. Type theory provides a rigorous basis for defining polymorphic schemas that accommodate varying degrees of structure, from strictly typed relational tables to schemaless documents, ensuring that queries and updates preserve semantic integrity. Complementing this, category theory offers conceptual tools for query mapping, treating data models as categories where morphisms represent transformations between relational tuples, graph traversals, and document extractions, thus facilitating unified query processing without loss of expressiveness.^[81]^[82]

Research Directions

Recent research in multi-model databases emphasizes the integration of artificial intelligence (AI) and machine learning (ML) capabilities to handle diverse data types, including unstructured and vector-based representations. A key focus is on incorporating native vector databases into multi-model architectures, enabling seamless storage and querying of embeddings alongside traditional models like relational and graph data. For instance, the Hybrid Multimodal Graph Index (HMGI) framework proposes a graph-based structure that combines relational indexing with vector search, using modality-aware partitioning to optimize performance for multimodal data ingestion and retrieval in databases.^[83] This approach achieves sub-linear query times and outperforms standalone vector databases in scenarios requiring relational context, addressing the limitations of retrofitting vector support into legacy systems.^[83] Advancements in federated learning further enhance AI integration by allowing collaborative model training across distributed multi-model datasets without centralizing sensitive information. The FLAMMABLE framework introduces multi-model federated learning (MMFL), where clients dynamically engage multiple models per training round based on their computational resources, adapting batch sizes to mitigate heterogeneity.^[84] This results in 1.1–10.0× faster convergence and 1.3–5.4% higher accuracy compared to single-model baselines, facilitating scalable ML over multi-model stores.^[84] Complementing this, 2025 studies on hybrid embeddings explore combining textual, visual, and relational embeddings within multi-model environments to support richer semantic queries, as seen in HMGI's adaptive index updates for dynamic multimodal data.^[83] Emerging challenges in quantum and distributed computing are driving innovations in secure multi-model storage. Research highlights the need for quantum-resistant encryption to protect diverse data models against future quantum threats, with proposals evaluating lattice-based and hash-based algorithms for database integration.^[85] These schemes ensure post-quantum security for multi-model systems by applying hybrid cryptographic layers that maintain compatibility with existing query engines while safeguarding vector and graph data.^[85] In parallel, blockchain hybrids address decentralization by enabling secure, multi-tenant access across distributed models; the MtDB system leverages blockchain for metadata coordination and IPFS for data storage, supporting universal SQL queries with 35ms latency over large-scale records and 1.2–1.3× overhead for integrity enforcement.^[86] This architecture promotes interoperability in decentralized environments, such as healthcare, without compromising multi-model flexibility.^[86] Standardization efforts aim to unify querying and evaluation across multi-model databases, tackling inconsistencies in schema evolution and cross-model operations. Proposals for universal query representations, such as Directed Acyclic Graph-based primitives, provide a model-agnostic framework for multimodal retrieval, extensible to polystore systems via standardized pipelines.^[87] Similarly, natural language translation to multi-model query languages (MMQLs) introduces adaptive frameworks that improve accuracy by over 9% through schema embeddings and error correction, fostering a common interface for diverse data models.^[88] Academic benchmarks, including SIGMOD 2024–2025 studies, advance consistency testing; for example, the TransforMMer tool simulates data evolution across relational, document, and graph models, generating dynamic benchmarks to evaluate interoperability and performance under schema changes.^[89] The Multimodal Attributed Graph Benchmark (MAGB) further assesses consistency in graph-vector hybrids, revealing modality biases and the benefits of balanced embeddings for reliable multi-model learning.^[90]

References

[1]
[PDF] Multimodel Database - Oracle
multiple data models and access methods within a single database management system. ... Such applications have a well-defined data model and data distribution.
[2]
Multi-model capabilities - Azure SQL | Microsoft Learn
Nov 6, 2024 · Multi-model databases enable you to store and work with data in multiple formats, such as relational data, graph, JSON or XML documents, spatial data, and key- ...
[3]
[PDF] Multi-model Databases: A New Journey to Handle the Variety of Data
From the point of view of our survey this is not a multi-model database, but a possible use case of the respective DBMS; there is no cross-model query language, ...
[4]
[PDF] Towards Benchmarking Multi-Model Databases
For example, document, graph, relational, and key-value models are examples of data models that may be supported by a multi-model database. Nothing shows the.
[5]
[PDF] DortDB: Bridging Query Languages for Multi-Model Data Ponds
They can be stored in a multi-model database management system. (DBMS) [9] designed to handle diverse data formats while ensuring optimized performance and ...
[6]
What Is a Multimodel Database? | Definition from TechTarget
Oct 5, 2021 · A multimodel database is a data processing platform that supports multiple data models, which define the parameters for how the information in a database is ...
[7]
What Is a Multi-Model Database? - SingleStore
Sep 1, 2022 · A multi-model database or a database that natively allows you to store and access data of different types, such as relational, time series, geospatial, key- ...What Is a Multi-Model Database? · Consolidation and Cross... · ACID Compliance
[8]
Multi-Model NoSQL Database Features | Progress Marklogic
Whereas polyglot persistence results in data silos and multiple interfaces that require complex integration workflows, a multi-model database facilitates ...
[9]
Unlocking Data Potential: The Advantage of Multi-Model Databases
Multi-model databases offer a promising solution. These integrated databases can store, manage, and query data in multiple models, simplifying data management.Missing: limitations | Show results with:limitations
[10]
Polyglot Persistence - Martin Fowler
Nov 16, 2011 · In 2006, my colleague Neal Ford coined the term Polyglot Programming, to express the idea that applications should be written in a mix of ...
[11]
OrientDB System Properties - DB-Engines
Initial release, 2010 ; Current release, 3.2.29, March 2024 ; License info Commercial or Open Source, Open Source info Apache version 2 ; Cloud-based only info ...Missing: history | Show results with:history
[12]
A Deep Dive into Multi-Model Databases: Hype vs. Reality
A Gartner analyst report published in 2020 defines a multi-model DBMS as one that supports a unified database for different types of data (relational, document, ...
[13]
What is ArangoDB? - DevOpsSchool.com
Mar 26, 2022 · ArangoDB first release in year 2011 as AvocadoDB and then renamed to ArangoDB in 2012, developed by ArangoDB GmbH. It came up with the test ...
[14]
Azure Cosmos DB: The industry's first globally-distributed, multi ...
May 10, 2017 · It is the first cloud database to natively support a multitude of data models and popular query APIs, is built on a novel database engine ...
[15]
Releases - SurrealDB
Release v2.3.1. Released on May 7th, 2025. This release resolves an issue identified in v2.3.0 that can corrupt the database when an UPDATE statement is ...Missing: advancements | Show results with:advancements
[16]
A brief history of databases: From relational, to NoSQL, to distributed ...
Feb 24, 2022 · The first computer database was built in the 1960s, but the history of databases as we know them, really begins in 1970.Missing: multi- | Show results with:multi-<|control11|><|separator|>
[17]
The Evolution of Apache Hadoop: A Revolutionary Big Data ...
Jan 17, 2024 · The initial release of Hadoop, version 0.1.0, came in April 2006. It consisted of two main components: the Hadoop Distributed File System (HDFS) ...
[18]
The Rise of Multi-Model Databases in Modern Architectures - Rapydo
Mar 31, 2025 · At their core, multi-model databases feature a unified storage layer that efficiently handles various data formats, a model translation ...
[19]
Data Modeling - OrientDB
The OrientDB engine supports Graph, Document, Key/Value, and Object models, so you can use OrientDB as a replacement for a product in any of these categories.
[20]
https://www.rapydo.io/blog/the-rise-of-multi-model-databases-in-modern-architectures-innovation-market-impact-and-organizational-readiness
[21]
[PDF] Multi-SQL: An extensible multi-model data query language - arXiv
ABSTRACT. Big data management aims to establish data hubs that support data in multiple models and types in an all- around way. Thus, the multi-model ...
[22]
An approach to on-demand extension of multidimensional cubes in ...
One of the potential benefits of MMDWs over traditional DWs is extensibility, which specifically refers to the potential for adding new multidimensional ...
[23]
A universal approach for multi-model schema inference
Aug 11, 2022 · We introduce an approach that ensures inference of a common schema of multi-model data capturing their specifics.
[24]
Multi-Model Databases: A Modern Approach to Data Management
Feb 22, 2025 · A multi-model database is a database management system designed to support multiple data models within a single, integrated backend.
[25]
[PDF] Multi-Model Database Management Systems - a Look Forward
The biggest issue of poly- glot persistency is that the combined DBMSs is neither declarative nor unified. It leaves database application to procedurally join ...
[26]
Multi-Model Database Systems: The State of Affairs - ResearchGate
Aug 6, 2025 · A multimodel database allows a company to store data in different data models and allows for all these models to be managed with a single management system.<|control11|><|separator|>
[27]
Multi-Model Database - Macrometa
The BASE is typically a multi-model database that focuses on high availability, horizontal scale, as well as fault tolerance instead of consistency. What is ...
[28]
[PDF] MarkLogic Multi-Model Database
Marklogic server is a multi-model database that has both modern NosQl and trusted enterprise capabilities to build ... • Secure – Fine-grained, role-based ...
[29]
Secure your Azure Cosmos DB for NoSQL account - Microsoft Learn
Sep 10, 2025 · Azure Cosmos DB for NoSQL is a globally distributed, multi-model database service designed for mission-critical applications. ... role-based ...
[30]
ArangoDB RocksDB | Optimizing Performance with Storage
ArangoDB, as a native multi-model database, competes with many single-model storage technologies. When we started the ArangoDB project, one of the key ...
[31]
OrientDB - Wikipedia
OrientDB uses several indexing mechanisms based on B-tree and Extendible hashing, the last one is known as "hash index". Each record has Surrogate key which ...
[32]
Multi-model Databases: A New Journey to Handle the Variety of Data
In this survey, we introduce the area of multi-model DBMSs that build a single database platform to manage multi-model data.
[33]
[PDF] ArcNeural: A Multi-Modal Database for the Gen-AI Era - arXiv
Jun 11, 2025 · This paper introduces Arc-. Neural, a novel multi-modal database designed to address the chal- lenges of integrating and managing diverse data ...
[34]
Evolution management in multi-model databases - ScienceDirect.com
We introduce a tool called MM-evolver, which enables to carry out user-required changes over a multi-model schema and propagates them across all sub-models.Missing: RDBMS | Show results with:RDBMS
[35]
[PDF] A Generic Schema Evolution Approach for NoSQL and Relational ...
In this article, we present a generic schema evolution approach able to support the most popular NoSQL data models (columnar, document, key-value, and graph) ...
[36]
Multi-model query languages: taming the variety of big data
May 31, 2023 · This article aims to offer a comprehensive survey of a wide range of multi-model query languages of MMDBs.
[37]
Documentation: 18: 8.14. JSON Types - PostgreSQL
PostgreSQL offers two types for storing JSON data: json and jsonb . To implement efficient query mechanisms for these data types, PostgreSQL also provides the ...Missing: multi- | Show results with:multi-
[38]
ArcadeDB - The Next Generation Multi-Model DBMS
The next generation multi-model database supporting graphs, key/value, documents, search engine, vectors and time-series.
[39]
Enterprise-grade natural language to SQL generation using LLMs
Apr 24, 2025 · Recent advances in generative AI have led to the rapid evolution of natural language to SQL (NL2SQL) technology, which uses pre-trained large ...
[40]
What is Graph Database Federation? Benefits and Use Cases
Jul 9, 2024 · Graph database federation enables seamless cross-database queries, enhancing data integration, scalability, and performance.
[41]
Kafka Connectors | Confluent Documentation
You can use self-managed Apache Kafka® connectors to move data in and out of Kafka. The self-managed connectors are for use with Confluent Platform.
[42]
Top 7 Serverless Databases to Use in 2025 - GeeksforGeeks
Jul 23, 2025 · This article discusses what a serverless database is, its various features, and the top 7 serverless databases that users can use in 2025.
[43]
Best Multi-Model Databases in 2025 - Slashdot
Utilize Ignite as a conventional SQL database by employing JDBC drivers, ODBC drivers, or the dedicated SQL APIs that cater to Java, C#, C++, Python, and ...<|separator|>
[44]
A Comparative Performance Evaluation of Multi-Model NoSQL ...
Jun 7, 2023 · Their expected benefits include increased versatility, reduced installation complexity, improved database performance, and smaller storage ...
[45]
The AI Database Landscape: Vector Search, Gen AI, and More
Sep 16, 2024 · The AI database landscape includes vector-only, relational (with vector support), and multi-model databases, which support various data types ...
[46]
Self-Adapting Design and Maintenance of Multi-Model Databases
Sep 13, 2022 · Multi-model data is organised in various mutually interlinked formats and models, often with contradictory features.
[47]
[PDF] Modernizing Data Engineering: Leveraging Advanced Distributed ...
May 11, 2025 · Despite their compelling advantages, implementing multi-model database architectures presents significant technical challenges. Query ...<|control11|><|separator|>
[48]
A resilient and robust framework to dissolve vendor lock-in
Vendor lock-in occurs at the database layer due to heavy data volumes, high network bandwidth costs, dependencies, or unacceptable downtime. · If there are ...
[49]
Azure Cosmos DB | Microsoft Azure
### Summary of Azure Cosmos DB (Microsoft)
[50]
Dear DocumentDB customers, welcome to Azure Cosmos DB!
May 17, 2017 · Azure Cosmos DB, announced at the Microsoft Build 2017 conference, is the first globally distributed, multi-model database service for building
[51]
A technical overview of Azure Cosmos DB | Microsoft Azure Blog
May 10, 2017 · Azure Cosmos DB is Microsoft's globally distributed, horizontally partitioned, multi-model database service.Noteworthy Aspects Of Azure... · Resource Model And Api... · Fully Resource Governed...Missing: features | Show results with:features
[52]
Couchbase Server
### Summary of Couchbase Server Features and Use Cases
[53]
Query Graph Models With Couchbase Recursive CTE
May 9, 2024 · Couchbase uses Recursive CTEs to query complex data structures, like graph networks, using SQL++ for multi-model support, unlike other NoSQL ...
[54]
Couchbase 8.0: Unified Data Platform for Hyperscale AI Applications
Oct 21, 2025 · With over 400 features and changes, Couchbase 8.0 delivers breakthrough innovations in vector indexing, vector search usage and performance, and ...
[55]
SingleStore | The Performance You Need for Enterprise AI
SingleStore delivers the performance you need for enterprise AI. We combine transactional (OLTP) and analytical (OLAP) processing, multi-model data support ...Careers · SingleStore Helios cloud service · Contact Sales · Real-Time AnalyticsMissing: multi- | Show results with:multi-
[56]
SingleStoreDB: The Real-Time Analytics Database
SingleStoreDB is a real-time analytics database that powers interactive, in-app applications. From supply chain analytics to fraud, cybersecurity, ...Application With In-App... · Low-Latency Performance For... · Separation Of Storage And...Missing: OLTP OLAP
[57]
https://www.couchbase.com/blog/couchbase-8-hyperscale-ai/
[58]
ArangoDB is a native multi-model database with flexible ... - GitHub
ArangoDB is a scalable graph database system to drive value from connected data, faster. Native graphs, an integrated search engine, and JSON support.
[59]
Deploying ArangoDB on Kubernetes and customizing settings
May 7, 2025 · After deploying the latest ArangoDB Kubernetes operator and configuring storage resources, we will create the ArangoDB database deployment ...
[60]
ArcadeData/arcadedb - GitHub
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, ...
[61]
ArcadeDB Manual
ArcadeDB is a new generation, multi-model DBMS that runs on most hardware/software, supporting graphs, documents, and other models.Missing: SAP forks
[62]
[Announcement] Welcome to ArcadeDB, the new official fork of ...
Sep 1, 2021 · ArcadeDB is a new generation of Multi-Model databases. It has a brand new transactional engine, much faster and lighter than OrientDB.Missing: now extension graph analytics post- 2020 acquisition
[63]
SurrealDB: The ultimate multi-model database
Unified multi-model database. Combines document, graph, time-series, relational, geospatial and key-value data models natively, without workarounds or added ...Missing: support | Show results with:support
[64]
SurrealDB | Solutions | Embedded and Edge
SurrealDB brings database power to the edge, can be bundled into apps, and operates autonomously with offline-first sync, using the same multi-model database.Missing: Rust- focused traction 2023-2025
[65]
SurrealDB raises $6M for its database-as-a-service offering
Jan 4, 2023 · After being bootstrapped for three years (and despite being pre-revenue), SurrealDB closed a seed round recently that came in at $6 million.
[66]
How SurrealDB hit $5M revenue with a 45 person team in 2025.
Since its launch in 2022, SurrealDB has shown consistent revenue growth, reflecting its expanding user base and increasing adoption across various industries.Missing: traction | Show results with:traction
[67]
surrealdb/surrealdb: A scalable, distributed, collaborative, document ...
SurrealDB is an end-to-end cloud-native database designed for modern applications, including web, mobile, serverless, Jamstack, backend, and traditional ...SurrealDB · Issues 596 · Security · Pull requests 77Missing: traction 2023-2025
[68]
[PDF] A Benchmark for Multi-Model Database Management Systems
UniBench consists of a mixed data model, a synthetic multi-model data generator, and a set of core workloads. Specifically, the data model sim- ulates an ...
[69]
Holistic evaluation in multi-model databases benchmarking
Dec 6, 2019 · In this paper, we propose UniBench, a generic multi-model benchmark for a holistic evaluation of state-of-the-art MMDBs.
[70]
[PDF] M2Bench: A Database Benchmark for Multi-Model Analytic Workloads
UniBench [51] is a recent benchmark proposed for a multi-model DBMS. However, it does not support the array data model, which is essential and common for ...
[71]
[PDF] Scenario-Based Evaluation of Multi-Model Database Systems
Multi-model database systems have gained increasing popularity due to their efficient management of diverse types of data and support for complex queries ...
[72]
HammerDB
Benchmark the world's most popular relational databases, on-premise and in the cloud. Native multi-DB support. One tool for SQL Server, Db2, Oracle ...Missing: custom workloads model
[73]
VDBBench 1.0: Real-World Benchmarking for Vector Databases
Jul 3, 2025 · Discover VDBBench 1.0, an open-source tool for benchmarking vector databases with real-world data, streaming ingestion, and concurrent ...
[74]
(PDF) Evaluating Cloud-Based vs. On-Premise Data Processing Tools
Feb 27, 2025 · This study evaluates the performance, scalability, cost efficiency, and security of cloud-based versus on-premise data processing tools through a comprehensive ...
[75]
NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot ...
NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence. by Pramod J. Sadalage, Martin Fowler. August 2012. Intermediate to advanced. 192 ...
[76]
A unified metamodel for NoSQL and relational databases
Multi-model database tools normally use a generic or unified metamodel to represent schemas of the data model that they support. Such metamodels facilitate ...<|control11|><|separator|>
[77]
(PDF) Type theoretical databases - ResearchGate
Aug 7, 2025 · The type theory then allows for the specification of database schemas and instances, the manipulation of the same with the usual type ...
[78]
A unified representation and transformation of multi-model data ...
May 3, 2022 · In this paper, we extend our previous proposal of multi-model data representation using category theory for transformations between models.
[79]
Unifying categorical representation of multi-model data
May 6, 2022 · In this paper, we show how category theory can be used for representation of multi-model data and schema and how the mutual mapping
[80]
The Hybrid Multimodal Graph Index (HMGI): A Comprehensive Framework for Integrated Relational and Vector Search
### Summary of HMGI Paper on Hybrid Embeddings in Multimodal Data Contexts
[81]
FLAMMABLE: A Multi-Model Federated Learning Framework ... - arXiv
Oct 12, 2025 · Multi-Model Federated Learning (MMFL) is an emerging direction in Federated Learning (FL) where multiple models are trained in parallel, ...
[82]
Comparing Quantum-Resistant Cryptographic Algorithms for ...
Sep 2, 2025 · To safeguard sensitive data from quantum-based decryption risks, the data layer has to use post-quantum encryption for databases. This ...
[83]
MtDB: A Decentralized Multi-Tenant Database for Secure Data Sharing
### Summary of MtDB: A Decentralized Multi-Tenant Database
[84]
Towards a Universal Query Representation for Multimodal ...
In this paper, we present an initial proposal for a universal query representation mechanism for multimodal information retrieval. The proposed approach ...
[85]
https://dl.acm.org/doi/10.1145/3745812.3745888
[86]
(PDF) Simulating Multi-Model Data Evolution for Benchmarking Big ...
Sep 25, 2025 · This paper addresses the challenge of benchmarking multi-model data management systems capable of handling diverse and evolving data.
[87]
When Graph Meets Multimodal: Benchmarking and Meditating on ...
Aug 3, 2025 · In this paper, we first propose MAGB, a comprehensive MAG benchmark ... SIGMOD: ACM Special Interest Group on Management of Data · SIGKDD: ACM ...