Fact-checked by Grok 2 weeks ago

Cosmos DB

Azure Cosmos DB is a fully managed, globally distributed, multi-model NoSQL database service developed by Microsoft as part of the Azure cloud platform, offering seamless scalability, low-latency access, and support for multiple data models including document, key-value, graph, column-family, and vector data to enable modern application development.^[1] Launched in general availability on May 10, 2017, it evolved from the earlier Azure DocumentDB service and was announced at Microsoft Build 2017 as the industry's first globally distributed, multi-model database.^[2] The service provides single-digit millisecond response times at any scale through automatic and independent partitioning, with throughput and storage scaling elastically to handle petabyte-scale workloads. It supports multiple APIs for flexibility, including the native Azure Cosmos DB API for NoSQL (SQL querying), compatibility with MongoDB vCore and API for MongoDB, Apache Cassandra, Apache Gremlin for graph data, Azure Table API for key-value, and Azure Cosmos DB for PostgreSQL, allowing developers to use familiar tools without data migration. A core strength is its turnkey global distribution, enabling data replication across any number of Azure regions with configurable consistency levels (strong, bounded staleness, session, consistent prefix, or eventual) and multi-region writes for active-active architectures, ensuring data sovereignty and low-latency access worldwide. Azure Cosmos DB guarantees 99.999% high availability via a comprehensive SLA, enterprise-grade security features like customer-managed keys, private endpoints, and compliance with standards such as GDPR, HIPAA, and ISO 27001, and integrates with Azure AI services for vector search and generative AI applications.^[1] Notably, it powers high-scale workloads for organizations like OpenAI, supporting the reliability and scalability of applications such as ChatGPT.^[3] Recent enhancements include serverless throughput options for cost efficiency and the general availability of Cosmos DB in Microsoft Fabric, announced in November 2025, extending its capabilities to data analytics workflows.^[4]

History and Overview

Development History

Azure Cosmos DB originated as Project Florence in 2010, an internal Microsoft initiative designed to resolve scalability challenges encountered in large-scale applications, such as elastic scaling of throughput and storage to manage unpredictable workload spikes.^[2] This project addressed pain points in services requiring high availability and global distribution, evolving through internal iterations to support Microsoft's own workloads before external release.^[2] The technology underlying Cosmos DB saw extensive internal deployment at Microsoft for nearly a decade prior to its public launch, powering critical systems and refining its multi-model and distributed capabilities in production environments.^[5] On May 10, 2017, Microsoft officially announced Azure Cosmos DB and made it generally available as a fully managed, globally distributed, multi-model database service, building on the foundations of the earlier Azure DocumentDB while introducing turnkey global replication and multiple consistency models.^[6]^[7] Key milestones in its evolution include the introduction of serverless mode in 2020, which enabled on-demand throughput provisioning without fixed capacity commitments, simplifying development for bursty workloads.^[8] In 2023, Cosmos DB added vector search capabilities, initially for the MongoDB vCore API, allowing efficient storage, indexing, and querying of high-dimensional vectors to support AI-driven applications.^[9] That same year, support for the PostgreSQL API reached general availability in October 2022, extending Cosmos DB's relational capabilities with distributed scaling via the Citus extension; however, as of October 2025, Azure Cosmos DB for PostgreSQL is no longer supported for new projects, with Microsoft recommending alternatives such as Azure Database for PostgreSQL.^[10]^[11] By 2025, Cosmos DB continued to advance with integrations to Microsoft Fabric and enhanced support for AI workloads, announced at Microsoft Build in May, including automatic scaling optimizations tailored for generative AI scenarios.^[12] These updates featured expanded vector indexing, such as the general availability of DiskANN for large-scale similarity searches up to 4,096 dimensions, further embedding Cosmos DB in AI ecosystems.^[13]

Key Features and Benefits

Azure Cosmos DB provides turnkey global distribution, enabling automatic data replication to any number of Azure regions with a 99.999% availability service level agreement (SLA) for both reads and writes in multi-region configurations.^[14] This high availability is maintained even during regional outages, ensuring business continuity without manual intervention.^[14] The service guarantees read latency under 10 ms and write latency under 15 ms at the 99th percentile using direct connectivity, delivering low-latency performance worldwide.^[15] Elastic scalability allows seamless adjustment of throughput and storage capacity without downtime, supporting instant scaling to meet varying workloads.^[16] Multi-model support accommodates diverse data types, including document, key-value, graph, column-family, and vector models, without enforcing schemas for flexible storage and querying via multiple APIs.^[16] Built-in security features encompass encryption at rest and in transit, role-based access control (RBAC) via Microsoft Entra ID, and private endpoints for secure connectivity.^[17] For cost efficiency, Azure Cosmos DB offers serverless throughput, which charges only for consumed request units, and provisioned throughput modes with autoscale options to match predictable or variable demands.^[18] It integrates deeply with the Azure ecosystem, facilitating serverless computing via Azure Functions, real-time analytics with Azure Synapse Link, and AI workloads through vector search capabilities.^[16]

Architecture and Data Model

Resource Model

Azure Cosmos DB organizes its resources in a hierarchical model, with the account serving as the top-level resource that scopes and manages all underlying databases and containers. An account represents a unit of administrative division within an Azure subscription, enabling features such as multi-region replication and consistency configuration. By default, a subscription supports up to 250 accounts, though this limit can be increased to 1,000 via a support request.^[19] Accounts are created and managed through the Azure portal, Resource Manager templates, PowerShell, CLI, SDKs, or REST API.^[20] Databases function as logical namespaces within an account, grouping one or more containers to provide isolation and organization for related data sets. Each database can optionally provision shared throughput, allowing up to 25 containers to draw from a common pool of request units (RUs) per second, with a maximum of 1,000,000 RUs (increasable via support). This shared model simplifies provisioning for workloads with varying demands across containers, while dedicated throughput can be assigned directly to individual containers outside of databases. Databases map to concepts like keyspaces in the Cassandra API but are not applicable in the Table API.^[20]^[19] Containers represent the core scalable units for data storage and operations in Cosmos DB, holding items and supporting horizontal scaling through partitioning. Each container requires a partition key to distribute data across logical partitions, with no upper limit on total storage capacity per container—enabling petabyte-scale datasets when properly partitioned. However, individual logical partitions are capped at 20 GB of storage (temporarily increasable via support), and physical partitions, which are managed internally, hold up to 50 GB. Containers support schema-agnostic data ingestion, automatic indexing, time-to-live (TTL) policies, change feeds, and server-side programmability via stored procedures, triggers, and user-defined functions. Throughput for containers can be provisioned as dedicated (standard or autoscale) or shared via the parent database.^[20]^[19]^[21] Items are the atomic units of data within containers, typically represented as JSON documents in the NoSQL API or equivalents such as rows in the Table API, documents in MongoDB, or vertices/edges in Gremlin. Each item must have a unique id (user-provided, up to 1,023 bytes) within its logical partition and an optional partition key value (up to 2,048 bytes). The maximum size for an item is 2 MB (measured by the UTF-8 length of its JSON representation), beyond which performance and cost efficiency may degrade. Cosmos DB automatically adds system-generated properties to items for management and operations: _rid provides a unique resource identifier, _ts records the last update timestamp (in Unix epoch seconds), and _etag enables optimistic concurrency control by tagging resource versions. Items support standard CRUD operations (create, read, update, delete, upsert) across all APIs.^[20]^[19]

Multi-model Support

Azure Cosmos DB provides multi-model support by natively accommodating various data models within a single, globally distributed database service, allowing developers to select the most suitable model and API for their applications without managing multiple disparate databases.^[16] This capability is enabled through compatibility with multiple open-source APIs, each tailored to specific data paradigms, while leveraging a shared infrastructure for storage and operations.^[22] The service supports a document model using JSON-like structures for handling semi-structured data, primarily via the API for NoSQL, which stores items as flexible, hierarchical documents.^[16] For simple, high-performance lookups, it offers a key-value model through the Table API, where data is stored and retrieved using unique keys without complex querying needs.^[22] The column-family model, suited for wide-column stores and analytical workloads, is provided by the Apache Cassandra API, enabling efficient handling of sparse, multidimensional data.^[16] In addition, the graph model represents interconnected data as vertices and edges, supported by the Apache Gremlin API for traversing relationships in social networks, recommendation engines, or fraud detection systems.^[22] Cosmos DB also extends compatibility to MongoDB's BSON documents via the MongoDB API (in both request unit and vCore modes), preserving the binary JSON format for document-oriented applications, and to PostgreSQL's relational structures through the PostgreSQL API, allowing SQL-based relational modeling with ACID transactions; as of November 2025, the PostgreSQL API is no longer supported for new projects, though existing deployments remain supported—Microsoft recommends Azure Database for PostgreSQL Flexible Server for new relational workloads.^[16] For the NoSQL-compatible APIs (NoSQL, MongoDB, Cassandra, Gremlin, and Table), Cosmos DB employs a unified storage engine built on an atoms-records-sequences (ARS) design, which abstracts the underlying storage to map diverse data models seamlessly without requiring separate physical databases.^[22] The PostgreSQL API, however, uses a separate distributed engine based on Citus and native PostgreSQL storage. This enables developers to choose and switch among the NoSQL-compatible APIs per container—units within the resource hierarchy that hold model-specific items—without data migration, as the same data can be accessed via different APIs by adjusting the client configuration.^[20] For instance, a container provisioned for the NoSQL API stores JSON items, while one for Cassandra stores rows, yet both reside in the same account and share global distribution features.^[16] The PostgreSQL API follows a distinct resource model with clusters, databases, schemas, and distributed tables. The following table illustrates how containers adapt to different models by defining the entity types for data storage:

API	Container Entity	Data Entity
API for NoSQL	Container	Item (JSON)
API for Cassandra	Table	Row
API for MongoDB	Collection	Document (BSON)
API for Gremlin	Graph	Node or Edge
API for Table	Table	Item (key-value)
API for PostgreSQL	Table	Row (relational)

Cosmos DB's schema-agnostic approach further enhances flexibility, permitting mixed data types and evolving schemas within the same container, as all data is automatically indexed regardless of structure to support fast queries and development agility.^[16] This design is particularly beneficial for polyglot persistence scenarios, where applications require multiple data models to coexist efficiently.^[22]

APIs and Query Languages

Core (SQL) API

The Core (SQL) API in Azure Cosmos DB provides a JSON-based querying interface inspired by ANSI SQL, enabling developers to query semi-structured data stored as JSON documents in containers. This API supports core SQL clauses such as SELECT for projecting specific properties or entire documents, FROM to specify the container as the data source, and WHERE for filtering based on conditions like equality, range, or logical operators. Aggregate functions like COUNT, SUM, MIN, MAX, AVG, and DISTINCT are also available to perform computations over document sets, facilitating common analytical operations directly within queries. JOIN operations are supported but limited to intra-partition joins, allowing correlations between documents sharing the same partition key without cross-partition overhead.^[25] Indexing policies in the Core (SQL) API ensure efficient query performance by default applying range indexing to all paths in JSON documents, which supports equality, range, and order-by queries on strings and numbers. This default policy automatically indexes every property unless customized, promoting broad query flexibility. For optimization, users can define exclusion paths to omit non-queryable properties from indexing, reducing storage costs, write latency, and Request Unit (RU) consumption— for example, excluding system-generated fields like _etag. Custom policies may also include spatial or composite indexes for specialized queries, but the core focus remains on range indexing for general-purpose performance tuning.^[26] Server-side programmability in the Core (SQL) API allows execution of JavaScript-based stored procedures, triggers, and user-defined functions (UDFs) to encapsulate business logic within the database. Stored procedures enable transactional operations across multiple items in a single partition, such as bulk imports or updates, ensuring atomicity via optimistic concurrency control. Pre-triggers execute before document operations (e.g., validation), while post-triggers run afterward (e.g., logging), both participating in the same transaction to maintain consistency. UDFs extend query capabilities by allowing custom scalar functions invoked within SELECT clauses, such as complex calculations, without affecting transactions.^[27] Full-text search capabilities in the Core (SQL) API are integrated with Azure AI Search (formerly Azure Cognitive Search), enabling advanced text indexing and querying of JSON documents as a data source. This integration supports features like semantic ranking, faceting, and AI-enriched search (e.g., entity recognition), where Cosmos DB containers are indexed via an indexer to power relevance-based retrieval. Queries can combine full-text with SQL filters for hybrid scenarios, such as searching product descriptions while filtering by category.^[28] Query metrics provide detailed insights into execution performance, including total elapsed time (broken down by document load, index lookup, and VM execution phases), RU consumption per query, and the number of documents scanned or retrieved. These metrics help identify bottlenecks, such as inefficient filters leading to full scans, and include optimization recommendations like adding indexes or rewriting predicates. Access is available through SDK diagnostics or REST API responses, with index metrics specifically highlighting utilized versus missing indexes for query improvement.^[29] Geospatial queries in the Core (SQL) API support GeoJSON data types, including Point, LineString, Polygon, and MultiPolygon, adhering to the WGS-84 coordinate reference system for real-world location representation. Built-in functions like ST_DISTANCE, ST_WITHIN, and ST_INTERSECTS enable proximity searches, bounding box filters, and polygon containment checks, such as finding points within a radius or intersecting regions. Indexing for geospatial data is handled via the container's policy, with spatial indexes automatically applied to GeoJSON paths to ensure efficient query resolution without full scans.^[30]

MongoDB, Cassandra, Gremlin, and Table APIs

Azure Cosmos DB provides multiple APIs beyond its core SQL interface to ensure compatibility with popular open-source databases and ecosystems, allowing developers to use existing applications, drivers, and tools with minimal modifications. These include the API for MongoDB, API for Apache Cassandra, API for Apache Gremlin, and API for Table, each emulating the wire protocol and query languages of their respective origins while leveraging Cosmos DB's underlying distributed architecture for scalability and global replication. This multi-model support enables seamless integration into diverse workloads, such as document stores, wide-column stores, graph databases, and key-value systems, all mapped to a shared resource model of databases, containers, and items.^[24] The API for MongoDB offers full compatibility with the MongoDB wire protocol, supporting versions up to 8.0 in the vCore architecture and enabling the use of standard MongoDB drivers, SDKs, and tools. It accommodates document-oriented workloads with features like aggregation pipelines for complex data processing, automatic sharding for horizontal scaling across partitions, and replica sets for high availability through native replication. For instance, aggregation stages such as $match, $group, and $sort are fully supported, allowing pipelines to perform filtering, grouping, and sorting on large datasets without code changes. However, multi-document transactions are limited to operations within a single non-sharded collection. A key limit is the maximum document size of 16 MB, applicable to collections created after enabling the feature, aligning with MongoDB's standard constraints to prevent oversized payloads.^[31]^[32]^[31] Migration from open-source MongoDB to the Cosmos DB API for MongoDB typically involves no application code changes due to wire protocol compatibility, with data transfer facilitated by tools like the Azure Database Migration Service or the Cosmos DB Data Migration Tool, which handle bulk imports from JSON or MongoDB exports while preserving indexes and schemas. Pre-migration assessments focus on verifying compatibility for features like geospatial queries and change streams, ensuring workloads can leverage Cosmos DB's tunable consistency and global distribution post-migration.^[33]^[34] The API for Apache Cassandra serves as a drop-in replacement for Apache Cassandra versions 3.0 and later, supporting the Cassandra Query Language (CQL) v3.11 with backward compatibility to v2.x, and integrating seamlessly with existing Cassandra client drivers via the CQL Binary Protocol v4. It enables wide-column data modeling with commands like CREATE TABLE, INSERT, SELECT, [UPDATE](/page/Update), and DELETE, including support for data types such as bigint, uuid, and aggregates like avg and [count](/page/Count), while offering per-operation tunable consistency levels—such as eventual or strong—for reads and writes. This allows applications to maintain Cassandra-specific patterns, like denormalized tables for high-throughput queries, without refactoring. API-specific limits include restrictions on certain advanced commands, such as CREATE FUNCTION or ALTER ROLE, to align with Cosmos DB's managed service boundaries.^[35]^[35]^[36] For migration from Apache Cassandra, paths emphasize zero-downtime options using dual-write proxies or Spark-based ETL processes, where existing CQL queries and drivers connect directly to Cosmos DB endpoints, facilitating live data synchronization from on-premises or cloud clusters while optimizing for Cosmos DB's partitioning model. Tools like cqlsh's COPY command or Azure Databricks enable bulk data movement, with post-migration adjustments primarily for throughput provisioning to match Cassandra's eventual consistency defaults.^[37]^[38] The API for Apache Gremlin provides graph database capabilities compliant with Apache TinkerPop 3.4, supporting the Gremlin traversal language for querying property graphs with vertices, edges, and properties through operations like addV(), addE(), out(), and in(). It enables complex relationship traversals, such as pathfinding and pattern matching, using breadth-first search for optimized performance in distributed environments, and integrates with open-source Gremlin SDKs via wire protocol support. Key features include automatic indexing of graph elements and compatibility with graph analytics tools, though it diverges from pure TinkerPop in traversal strategy to prioritize scalability. Limits include a maximum operator depth of 400 unique steps per traversal and a repeat limit of 32 iterations for loops like .repeat(), capping effective traversal depth at 32 hops to manage resource consumption in large graphs. Recommended client libraries are version 3.4.13.^[39]^[40]^[40] Migration from open-source Gremlin graphs to Cosmos DB involves exporting data in formats like GraphSON or CSV and importing via the Bulk Executor library or Spark connectors, with no changes required for Gremlin query syntax due to protocol compatibility; this path supports gradual cutover by running parallel traversals during data sync.^[41] The API for Table emulates Azure Table Storage for simple key-value and tabular data operations, supporting existing SDKs and tools with wire protocol compatibility for tasks like entity insertion, updates, and deletions using partition keys and row keys. It facilitates OData querying for filtering, sorting, and selecting properties—such as $filter=PartitionKey eq 'value' and $orderby=RowKey—along with LINQ support in .NET, enabling efficient lookups on semi-structured data without schema enforcement. This API is suited for scenarios like IoT telemetry or session stores, with automatic indexing on all properties to accelerate queries. Unlike other APIs, it enforces a 2 MB limit per entity to maintain performance, distinct from MongoDB's larger document allowance.^[42]^[43]^[19] Migration from Azure Table Storage or similar key-value systems requires no code alterations, with the Cosmos DB Data Migration Tool handling bulk imports from CSV or direct connections, optimizing for Cosmos DB's global distribution while preserving OData query patterns for seamless application portability.^[34]

PostgreSQL API

Azure Cosmos DB for PostgreSQL provided a distributed relational database service compatible with PostgreSQL, supporting standard SQL queries and extensions like Citus for horizontal scaling across nodes. It allowed developers to use familiar PostgreSQL tools, drivers, and applications for relational workloads with Cosmos DB's global distribution and scalability. As of October 2025, Azure Cosmos DB for PostgreSQL is no longer supported for new projects; existing deployments continue, but Microsoft recommends migrating to Azure Database for PostgreSQL flexible server or Azure SQL Database for new relational applications.^[44]

Scaling and Partitioning

Logical Partitioning

In Azure Cosmos DB, logical partitioning organizes data by grouping items that share the same partition key value into a logical partition, which serves as the unit of distribution and scalability. A partition key, such as /userId, is a JSON property path defined when creating a container, ensuring all items with the same key value (e.g., "user123") are stored together in one logical partition. This design enables efficient single-partition transactions and queries, as operations within a logical partition do not require cross-partition coordination.^[21] Physical partitions are the underlying storage units managed by Azure Cosmos DB, each capable of holding up to 50 GB of data and providing up to 10,000 request units per second (RU/s) of throughput. Multiple logical partitions are distributed across these physical partitions based on an internal hashing mechanism, with the system automatically splitting physical partitions when they exceed storage or throughput limits to maintain performance and availability. Each logical partition itself is limited to 20 GB of storage and 10,000 RU/s, preventing any single partition from becoming a bottleneck.^[21]^[19] Selecting an effective partition key is crucial for optimal performance and involves choosing a property with high cardinality to ensure even data and workload distribution across logical partitions, thereby avoiding hotspots where one partition receives disproportionate traffic. The key should also be immutable throughout the item's lifecycle to prevent costly data movement between partitions, and it must align with common query patterns to minimize cross-partition operations. For example, using /customerId for e-commerce orders distributes load evenly if customer activity is balanced, while avoiding low-cardinality keys like /region that could concentrate data in few partitions.^[21] Cross-partition queries, which span multiple logical partitions, are supported but less efficient than single-partition queries, as they require fan-out to all relevant physical partitions, increasing latency and RU consumption (typically 2-3 additional RU/s per physical partition involved). To optimize, queries should include the partition key in the filter whenever possible. For legacy datasets lacking a natural high-cardinality key, synthetic partition keys—such as concatenating fields like /customerId_orderDate—can be used to artificially distribute data evenly without altering the original schema.^[21]^[45] Throughput provisioned to a container is distributed evenly across physical partitions, with each logical partition capped at 10,000 RU/s to enforce balanced scaling.^[21]

Throughput Provisioning Models

Azure Cosmos DB uses Request Units (RUs) as a normalized measure of throughput, representing the resources required for read and write operations regardless of the underlying infrastructure such as CPU, memory, and IOPS.^[46] For example, a point read of a 1 KB item costs 1 RU, while costs for indexed reads and writes vary based on item size, indexing policies, and query complexity.^[46] In the provisioned throughput model, users allocate a fixed amount of RU/s to either databases or containers, ensuring predictable performance for steady workloads.^[47] Throughput can be provisioned at the database level, where it is shared across up to 25 containers without per-container guarantees, or at the container level for dedicated allocation.^[47] This model supports two variants: standard (manual) throughput, which maintains a fixed RU/s rate (minimum 400 RU/s), and autoscale throughput, which automatically scales between 10% and 100% of a defined maximum RU/s (up to 10x the base), making it suitable for workloads with variable demand.^[48] Provisioned throughput is billed hourly based on the allocated or highest scaled RU/s, providing cost efficiency for consistent, high-volume applications.^[48] Serverless mode offers an alternative by eliminating upfront provisioning, charging only for RUs consumed per request, which is ideal for bursty, unpredictable, or low-traffic workloads such as development, testing, or sporadic usage.^[18] In this mode, operations are limited to a maximum of 5,000 RU/s per request, with billing at the end of the period (e.g., approximately $0.25 per million RUs).^[18] Unlike provisioned throughput, serverless does not support multi-region distribution and is more cost-effective when average traffic is below 10% of peak.^[49] RU consumption follows deterministic formulas: point reads cost 1 RU per KB, indexed reads vary with the number of properties and index coverage (e.g., up to 10+ RUs for complex queries), and writes typically cost 5-10 RUs per KB depending on storage overhead.^[46] Throughput limits include a maximum of 1,000,000 RU/s per container or database (increasable via support), with no more than 10,000 RU/s per logical partition to ensure even distribution.^[19] For shared database provisioning, the total RU/s applies across all containers, potentially requiring careful workload balancing.^[47] Overall, provisioned throughput suits steady, production loads with reserved capacity, while serverless minimizes costs for intermittent access without idle-time charges.^[18]

Global Distribution

Multi-region Replication

Azure Cosmos DB provides multi-region replication to distribute data globally across Azure regions, ensuring low-latency access for applications serving users worldwide. This feature automatically replicates data to all regions associated with the account, supporting both single-write and multi-write configurations. In single-write setups, one region handles all writes while others serve reads; multi-write mode enables active-active replication where multiple regions accept writes.^[14] Replication occurs synchronously within a local region to a majority of replicas for durability, followed by asynchronous propagation to other regions. This approach balances performance and availability, with writes acknowledging after local commitment while background processes sync changes across regions. In multi-master configurations, any writable region can process writes, distributing load and reducing latency for global users; enabling this involves adding regions via the Azure portal and activating multi-region writes, which can be done without downtime. In multi-region write accounts, use Session, Consistent Prefix, or Eventual consistency for optimal performance, as Strong and Bounded Staleness have limitations.^[50]^[51]^[52] When concurrent writes to the same item occur across regions, conflicts are resolved using built-in policies. The default last-writer-wins mechanism uses timestamps to determine the prevailing version, while custom handlers allow applications to define resolution logic, such as merging or selecting based on business rules. Read regions operate independently, enabling targeted placement for optimal access speeds without affecting write locations.^[53]^[54] Failover and recovery mechanisms include manual intervention or automatic service-managed failover during outages. For strong consistency, the recovery point objective (RPO) is effectively zero, as operations commit only after quorum acknowledgment across replicas. Accounts support adding multiple regions—up to the number available in Azure—for linear throughput scaling; each additional write region multiplies the effective write capacity proportionally to provisioned request units per second (RU/s).^[55]^[56]

Tunable Consistency Models

Azure Cosmos DB provides five tunable consistency levels for read operations, allowing developers to balance data freshness, performance, and availability based on application needs. These levels—Strong, Bounded Staleness, Session, Consistent Prefix, and Eventual—are configurable at the account level or overridden per request, with Session as the default. This tunability is enabled by Cosmos DB's multi-region replication, which supports varied quorum requirements across replicas. Strong consistency is available only for single-region write accounts.^[57] Strong consistency offers linearizability, ensuring reads always return the most recent committed version of an item with full ACID guarantees, including no dirty reads, non-repeatable reads, or phantom reads. It requires a global majority quorum for writes and a local minority quorum for reads, resulting in the highest latency (approximately twice the round-trip time plus 10 ms at the 99th percentile for writes) and reduced availability during network partitions. Performance-wise, it consumes more request units (RUs), roughly twice as many for reads compared to weaker levels, due to minority quorum reads.^[57] Bounded Staleness provides a form of session-like consistency with bounded lag, guaranteeing monotonic reads and writes where data is no older than a specified number of versions (minimum K=10 for single-region accounts) of an item or a time interval (minimum T=5 seconds), with higher minima for multi-region accounts (K=100,000, T=300 seconds). Bounded Staleness is not recommended for multi-region write configurations. It ensures reads reflect the newest committed writes within the staleness bounds and supports consistent prefix reads within the region. This level uses minority quorum reads and local majority writes, offering near-strong consistency but with higher RU consumption for reads (about twice that of eventual) and potential write throttling if lag exceeds limits. Latency is higher than weaker levels but lower than Strong.^[57] Session consistency ensures monotonic reads and monotonic writes within a client session, identified by a session token, making it ideal for user-specific sessions where read-your-writes and write-follows-reads semantics are needed. It prevents reading intermediate or uncommitted states across session operations but allows different sessions to see different views. This level achieves low latency (under 10 ms at the 99th percentile for reads) and high availability using single-replica reads and local majority writes, with efficient RU usage similar to weaker levels. Session tokens must be managed by the application to maintain consistency across requests.^[57] Consistent Prefix guarantees that reads never see writes from aborted transactions or "tombstones" (deleted items), preserving a partial order where updates appear in the order they were committed, especially for transactional batches. It does not provide monotonicity or bounded staleness but avoids out-of-order reads for single-item operations under eventual consistency. With single-replica reads and local majority writes, it offers low latency and high throughput, consuming fewer RUs than Strong or Bounded Staleness.^[57] Eventual consistency provides the weakest guarantees, with no ordering or monotonicity, where replicas may return stale data that eventually converges without bounds on freshness. It supports the highest availability and lowest latency (under 10 ms at the 99th percentile for reads) using single-replica reads and local majority writes, resulting in the lowest RU consumption for reads. This level is suitable for scenarios where occasional staleness is acceptable, such as non-critical caches.^[57] The default consistency level is Session, set at the account level and applying to all operations unless overridden for individual read requests via SDK headers or parameters; changes to the default require application redeployment. Strong and Bounded Staleness incur higher RU costs for reads due to stricter quorums, while Session, Consistent Prefix, and Eventual optimize for speed and cost efficiency.^[57]^[46]

Advanced Capabilities

Analytical Store

The Azure Cosmos DB analytical store is a fully managed, isolated columnar data store designed to enable hybrid transactional and analytical processing (HTAP) directly on operational data without requiring extract, transform, load (ETL) pipelines or data movement. Integrated via Azure Synapse Link, it allows organizations to perform large-scale analytics on the same dataset used for real-time transactions, ensuring performance isolation between operational and analytical workloads. This capability supports APIs such as NoSQL, MongoDB, and Gremlin (in preview), with data automatically synchronized in a one-way manner from the row-oriented transactional store to the analytical store.^[58]^[59]^[60] The synchronization process exports changes continuously and automatically to the analytical store in Parquet format, a columnar storage optimized for compression and query performance, resulting in a significantly smaller storage footprint compared to the transactional store due to the columnar compression in Parquet format. This near real-time export leverages change detection mechanisms to capture inserts, updates, and deletes with low latency, typically making data available for analytics within minutes. Users can query the analytical store using Apache Spark for distributed processing or T-SQL via serverless SQL pools in Azure Synapse Analytics, facilitating complex aggregations, joins, and historical analysis on operational data without impacting application performance.^[59]^[61]^[62] Retention in the analytical store is governed by the Analytical Time-to-Live (ATTL) property, configurable at the container level with a default value of -1 indicating infinite retention; setting a positive integer value in seconds enforces automatic expiration of data beyond that period, with storage costs applying based on the retained volume. The analytical store preserves time-to-live (TTL) settings from the transactional store, ensuring that expired items are removed from both stores to maintain consistency. It also handles schema evolution seamlessly, accommodating changes in data structure over time without requiring manual interventions or breaking existing queries, thanks to its schema-agnostic design that infers and adapts to evolving JSON documents.^[58]^[63]^[64] As of July 2025, in public preview, the analytical store integrates with Microsoft Fabric through a mirroring feature, enabling zero-ETL replication of Cosmos DB data directly into Fabric's OneLake for unified analytics across lakehouse, warehouse, and real-time intelligence workloads. This integration extends Synapse Link capabilities within Fabric, allowing seamless access to operational data for advanced analytics while maintaining data governance and security.^[59]^[65]

Vector Search and Change Feed

Azure Cosmos DB provides built-in vector search capabilities designed for handling high-dimensional vectors in AI-driven applications. This feature enables efficient indexing and querying of vector embeddings, supporting up to 4,096 dimensions using algorithms such as DiskANN, a Microsoft Research-developed approximate nearest neighbor index that optimizes for low-latency searches on large-scale datasets.^[13]^[66] DiskANN facilitates disk-based storage and retrieval, reducing compute overhead by up to 95% compared to traditional in-memory methods while maintaining high recall rates for similarity searches.^[67] Vector search in Cosmos DB supports hybrid queries that combine k-nearest neighbors (kNN) algorithms with full-text search, allowing developers to filter results using structured queries alongside semantic similarity based on metrics like cosine similarity, dot product, or Euclidean distance.^[13] This integration enhances applications in generative AI by storing embeddings directly within JSON documents, enabling semantic search over unstructured data such as text or images. Cosmos DB integrates natively with Azure OpenAI for generating these embeddings, streamlining workflows for retrieval-augmented generation (RAG) scenarios where relevant context is fetched to inform AI models.^[68]^[69] The Change Feed in Azure Cosmos DB serves as a time-ordered, persistent log capturing all changes to items in a container, including inserts, updates, and deletes, processed in the exact sequence they occur to ensure reliable event ordering.^[70] This feed can be consumed through various mechanisms, such as the Cosmos DB SDK for custom applications, Azure Functions for serverless event handling, or Apache Spark for distributed processing in analytics pipelines.^[71]^[72] To support distributed consumers, the Change Feed Processor library manages leases on feed progress, enabling multiple instances to share workload without duplication or loss, with configurable parallelism based on logical partitions.^[73] Consumers can filter the feed starting from a specific sequence number, allowing resumption from a known point in the change history for fault-tolerant processing.^[70] Reactive triggers in Azure Functions automatically invoke code on new changes, facilitating real-time serverless architectures for tasks like data synchronization or notifications.^[72] Announced on July 14, 2025, as part of the public preview of Cosmos DB in Microsoft Fabric, enhancements to vector search include improved DiskANN indexing for scalable deployments and deeper integration with CI/CD pipelines via Git for version-controlled management of vector indexes.^[74] These updates also expanded hybrid search to incorporate full-text scoring models like BM25 alongside vector kNN, boosting relevance in mixed-query scenarios for AI applications.^[74]^[75]

Monitoring and Use Cases

Diagnostics and Tools

Azure Cosmos DB provides a suite of built-in diagnostics and tools to facilitate performance tuning, debugging, and observability, enabling developers to monitor resource utilization, query efficiency, and operational health without external dependencies.^[76] These tools integrate seamlessly with the Azure portal and SDKs, offering granular insights into request processing and system behavior to optimize workloads and troubleshoot issues proactively.^[77] Query metrics deliver a detailed per-query breakdown, including Request Unit (RU) consumption, execution time, and index utilization, accessible directly through the Azure portal or via SDK methods such as ServerSideCumulativeMetrics in the .NET SDK.^[29] This allows users to analyze query performance at both aggregate and partitioned levels, identifying bottlenecks like inefficient indexing or suboptimal query patterns that impact RU usage in provisioned throughput models.^[78] Diagnostics logs capture comprehensive data on requests, errors, and resource metrics, with native integration into Azure Monitor for real-time and historical analysis.^[79] By enabling diagnostic settings, users can route these logs to storage accounts, event hubs, or directly to Azure Monitor Logs, providing visibility into data plane operations such as create, read, update, and delete activities.^[76] As of June 2025, diagnostics logging has been enhanced for Azure Cosmos DB for MongoDB vCore, supporting advanced querying via Azure Monitor Log Analytics.^[80] The Capacity Planner serves as a predictive tool for estimating required RU throughput based on workload characteristics, including document size, read/write ratios, and query complexity, helping to provision cost-effective resources.^[81] Available in the Azure portal, it simulates scenarios to forecast RU/s needs and associated costs, supporting informed scaling decisions for various API models like NoSQL or MongoDB.^[82] Early profiling capabilities in Cosmos DB have evolved, with the legacy profiler tool largely supplanted by modern Azure Cosmos DB Insights in the Azure portal, which highlight query hot spots, latency trends, and failure patterns through interactive visualizations.^[77] These insights consolidate metrics from multiple sources, offering a unified dashboard for debugging performance anomalies without requiring SDK instrumentation.^[83] Alerting mechanisms within Azure Monitor enable proactive notifications for key metrics such as throughput utilization, request latency, and failure rates, configured via metric-based rules in the Azure portal.^[84] Users can define thresholds for conditions like high RU consumption or availability drops, triggering actions such as email notifications or automated remediation through Azure Logic Apps.^[76] For advanced analysis, diagnostics logs can be exported to Log Analytics workspaces, where Kusto Query Language (KQL) enables complex querying of operational data, including filtering by request type, status codes, or geographic regions.^[79] This integration supports custom dashboards and long-term retention, enhancing observability for large-scale deployments.^[85]

Real-world Applications

Cosmos DB has been widely adopted in the gaming industry for managing high-throughput workloads such as leaderboards and player profiles. Microsoft's Xbox services leverage Cosmos DB to handle real-time updates for multiplayer games, supporting millions of writes per second during peak events like global tournaments.^[86]^[87] This capability ensures low-latency access to player statistics and session data, enabling seamless experiences for millions of concurrent users across distributed regions, facilitated by Cosmos DB's global distribution features. In the retail sector, Cosmos DB powers e-commerce platforms by storing dynamic catalogs and enabling personalized recommendations. It supports rapid querying of inventory and customer behavior data, helping businesses handle seasonal spikes in traffic without performance degradation, as demonstrated in various retail workloads.^[86] For Internet of Things (IoT) applications, Cosmos DB excels in ingesting and processing telematics data from connected devices. It facilitates event logging for vehicle fleets and smart sensors, where high-velocity streams of location, sensor readings, and diagnostics are stored and analyzed in real time.^[86] Automotive companies use this for predictive maintenance and route optimization, accommodating billions of events daily through automatic scaling. Social media platforms utilize Cosmos DB for handling user-generated content and dynamic feeds. LinkedIn employs it to store and retrieve profiles, posts, and connections, supporting personalized newsfeeds for over a billion users with consistent low-latency access worldwide.^[88]^[89] The database's multi-model support allows efficient modeling of graph-like relationships, such as social graphs, ensuring reliable delivery of real-time notifications and content updates. In artificial intelligence applications, Cosmos DB serves as a vector store for retrieval-augmented generation (RAG) in chatbots and generative AI tools. Demonstrations at Microsoft Build 2025 showcased its vector search capabilities for embedding storage and similarity queries, enabling context-aware responses in enterprise AI scenarios like knowledge retrieval systems.^[90]^[91] This integration with Azure AI services supports scalable RAG pipelines, improving accuracy in applications from customer support bots to content generation. As of April 2025, real-world AI use cases were highlighted at Azure Cosmos DB Conf, including change feed for real-time AI apps.^[92] Financial services firms apply Cosmos DB for real-time fraud detection, processing transaction streams with sub-millisecond query latencies. It stores payment events and user patterns, allowing machine learning models to flag anomalies instantly during high-volume operations like online banking.^[93]^[94] This approach enhances security by correlating global transaction data across regions for comprehensive risk assessment. As of 2025, over 3,600 companies across industries, including finance, utilize Cosmos DB for mission-critical workloads.^[95]

Limitations and Considerations

Performance Constraints

Azure Cosmos DB imposes a maximum size limit of 2 MB per item, measured as the UTF-8 length of its JSON representation, to ensure efficient storage and query performance across its distributed architecture.^[19] This constraint applies uniformly to documents in the NoSQL API, preventing oversized items that could degrade replication and indexing processes.^[19] Transactions in Azure Cosmos DB are strictly limited to operations within a single logical partition, providing full ACID compliance with snapshot isolation for reads and writes confined to that scope.^[96] Multi-item transactions, such as those executed via stored procedures or triggers, must involve items sharing the same partition key; attempts to span multiple logical partitions require client-side coordination, as native support for distributed transactions across partitions is not available.^[96] This design prioritizes consistency within partitions while leveraging horizontal scaling for overall throughput.^[97] Azure Cosmos DB does not support native joins across partitions, necessitating data denormalization or application-level logic to assemble related information from multiple documents or containers.^[98] Similarly, there is no enforcement of complex relationships or foreign keys, as the service lacks schema constraints; instead, it favors embedding related data within documents to minimize query latency, though referencing patterns can be used for frequently updated or unbounded relationships at the cost of additional read operations.^[98] These approaches encourage denormalized models to optimize performance in a distributed environment.^[98] Throughput in Azure Cosmos DB is constrained by the capacity of physical partitions, each of which can handle up to 10,000 request units per second (RU/s), effectively capping container throughput based on the number of partitions allocated—typically allowing up to 100,000 RU/s for a container with 10 physical partitions, though higher scales are possible with more partitions and no strict account-level upper limit beyond practical distribution.^[21] Accounts can provision up to 1 million RU/s or more, but exceeding default thresholds may require support requests to adjust quotas.^[19] In serverless mode, additional constraints apply, including a maximum of 20,000 RU/s per container and lack of support for features such as multi-region writes.^[19] By default, Azure Cosmos DB indexes all property paths in items using an inverted index structure, which incurs additional overhead on write operations as each property must be processed and stored in the index.^[99] This automatic indexing enhances query efficiency for range scans and lookups but can increase request unit consumption for inserts and updates, particularly in documents with many or complex properties; custom indexing policies allow exclusion of unused paths to mitigate this impact.^[99] Partitioning strategies can help distribute these loads evenly, reducing hotspots and enabling higher overall performance within the defined constraints.^[21]

Cost and Best Practices

Azure Cosmos DB employs a consumption-based pricing model centered on Request Units (RUs), which measure the computational resources required for database operations, along with separate charges for storage and data transfer. For provisioned throughput, users are billed at $0.008 per 100 RU/s per hour for standard configurations, with autoscale options offering equivalent monthly rates such as $5.84 per 100 RU/s per month for general-purpose workloads in a single region. In serverless mode, suitable for intermittent workloads, billing occurs per request at $0.25 per 1,000,000 RUs consumed, without minimum commitments.^[100] Storage is charged at $0.25 per GB per month for transactional data across replicated regions, while analytical storage costs $0.03 per GB per month; a free tier provides 1,000 RU/s and 25 GB of storage monthly for one account per subscription.^[101] To optimize costs, developers should select partition keys that distribute data and workload evenly, preventing hotspots that lead to inefficient RU consumption and potential throttling.^[102] Point reads, which retrieve a single item by its ID and partition key, consume minimal RUs—typically 1 RU for a 1 KB item—and are preferable over queries for known items.^[103] Enabling Time to Live (TTL) on items automates cleanup of expired data, reducing storage charges without manual intervention.^[104] Denormalizing data to align with common query patterns minimizes cross-partition operations, which can increase RU usage, while customizing indexing policies excludes unnecessary paths to lower both storage and query costs.^[103]^[104] Best practices include continuous monitoring of RU consumption via Azure Monitor to identify over-provisioning, where unused throughput results in wasted expenditure, and scaling provisioned RU/s dynamically to match workload patterns.^[105] Selecting consistency levels per operation—such as eventual consistency for reads to halve RU costs compared to strong consistency—balances performance and expense without compromising application needs.^[103] For multi-tenant applications, allocating dedicated throughput to individual tenants or using logical partitioning isolates resource usage, avoiding interference and enabling precise cost attribution.^[102] Common pitfalls involve hot partitions, where uneven data distribution concentrates requests on few partitions, inflating RU demands and triggering throttling that indirectly raises costs through retries.^[102] Over-provisioning throughput beyond actual needs leads to unnecessary hourly charges, while neglecting to optimize queries can multiply RU usage for complex operations.^[105] In 2025, integrations like autoscale throughput in Microsoft Fabric (in preview as of July 2025) enable automatic scaling for lakehouse workloads by provisioning resources only during active use, enhancing cost efficiency for variable patterns compared to fixed provisioning.^[106]^[12]

References

[1]
Azure Cosmos DB
Azure Cosmos DB is a fully managed, serverless, non-relational NoSQL vector database for document data, with built-in vector search and dynamic scaling.
[2]
Azure Cosmos DB: The industry's first globally-distributed, multi ...
May 10, 2017 · Today, we're excited to announce the general availability of Azure Cosmos DB. Azure Cosmos DB is the first globally-distributed data service ...
[3]
Azure Cosmos DB documentation - Microsoft Learn
Azure Cosmos DB documentation ... Fully managed, distributed NoSQL, relational, and vector database for modern app development. High performance, high ...Azure Cosmos DB for NoSQLUnified AI DatabaseResource modelSee moreApache Gremlin
[4]
Announcing Cosmos DB in Microsoft Fabric Featuring New ...
Jul 14, 2025 · The Preview of Cosmos DB in Microsoft Fabric, is now available to all users. Following its initial announcement at Microsoft Build 2025, ...
[5]
Cosmos DB - Revision #10
May 15, 2018 · It started as an internal project called "Florence" at Microsoft and then later expanded to Document DB and then finally to Cosmos DB. It is ...Missing: 2010 | Show results with:2010
[6]
Microsoft announces new tools and services to help developers ...
May 10, 2017 · Introducing Azure Cosmos DB, built from the ground up to power ... General availability of Visual Studio 2017 for Mac, which enables ...
[7]
New innovations at Microsoft Build 2017: Helping developers ...
May 10, 2017 · Today, we announced Azure Cosmos DB, the first globally distributed, multi-model database service delivering turnkey global distribution ...
[8]
Azure updates | Microsoft Azure
May 20, 2020 · Azure Cosmos DB will introduce a new serverless pricing model in the coming months. Serverless will handle traffic bursts on demand, making ...Missing: date | Show results with:date
[9]
Introducing Integrated Vector Database in Azure Cosmos DB for ...
Jun 14, 2023 · You can store, index, and query high dimensional vector data stored directly in Azure Cosmos DB for MongoDB vCore, eliminating the need to transfer your data.
[10]
Postgres 16 available in Azure Cosmos DB for PostgreSQL ...
Sep 30, 2023 · Today we're announcing that Postgres 16 is generally available for production workloads on Azure Cosmos DB for PostgreSQL.
[11]
What's new in Azure Cosmos DB for NoSQL at Microsoft Build 2025
May 19, 2025 · Explore Azure Cosmos DB for NoSQL, offering automatic scaling and high availability for modern applications and generative AI workloads.
[12]
What's New at Microsoft Build 2025 - Azure Cosmos DB Blog
May 19, 2025 · DiskANN vector indexing (GA): Enable large-scale vector similarity search for AI production workloads, supporting up to 16,000 dimensions ...
[13]
Distribute Data Globally with Azure Cosmos DB | Microsoft Learn
Jul 8, 2025 · Azure Cosmos DB is a globally distributed database system that allows you to read and write data from the local replicas of your database.Missing: launch | Show results with:launch
[14]
Consistency level choices - Azure Cosmos DB | Microsoft Learn
Sep 3, 2025 · For multi-region write accounts with two or more regions, data is replicated from the region it was originally written in to all other writable ...Guarantees Associated With... · Bounded Staleness... · Session Consistency
[15]
Unified AI Database - Azure Cosmos DB - Microsoft Learn
Sep 3, 2025 · Azure Cosmos DB is a fully managed NoSQL, relational, and vector database. It offers single-digit millisecond response times, automatic and instant scalability.Azure Cosmos DB for NoSQL · How to choose an API · NET (NoSQL)Missing: 2019 | Show results with:2019<|separator|>
[16]
Security overview and guidance - Azure Cosmos DB - Microsoft Learn
Microsoft Entra authentication using role-based access control is the recommended way to implement authentication in Azure Cosmos DB.Security baseline · Data plane security reference · Microsoft Ignite
[17]
Compare Provisioned Throughput and Serverless | Microsoft Learn
Aug 1, 2025 · Azure Cosmos DB is available in two different capacity modes: provisioned throughput and serverless. You can perform the exact same database operations in both ...Missing: 2019 | Show results with:2019
[18]
Azure Cosmos DB service quotas and default limits - Microsoft Learn
Sep 3, 2025 · Azure Cosmos DB supports CRUD and query operations for resources like containers, items, and databases. It also supports transactional batch ...
[19]
Databases, containers, and items - Azure Cosmos DB
Sep 3, 2025 · To manage your data and provisioned throughput, you create one or more databases within your account and then create one or more containers to store your data.Elements in an Azure Cosmos... · Azure Cosmos DB databases
[20]
Partitioning and horizontal scaling - Azure Cosmos DB
Oct 8, 2025 · The total data storage (each individual physical partition can store up to 50 gigabytes of data). Note. Physical partitions are an internal ...
[21]
Frequently asked questions about Azure Cosmos DB - Microsoft Learn
Multi-model refers to Azure Cosmos DB's support for multiple APIs and data models. In this paradigm, different APIs use different data formats for storage and ...General · Migrating Azure Cosmos DB...
[22]
Queries in Azure Cosmos DB for NoSQL - Microsoft Learn
Azure Cosmos DB for NoSQL has the ability to query data by writing queries using the Structured Query Language (SQL) as a JSON query language.Subqueries · Self-joins · Keywords · Count
[23]
Indexing policies - Azure Cosmos DB - Microsoft Learn
May 8, 2025 · Learn how to configure and change the default indexing policy for automatic indexing and greater performance in Azure Cosmos DB.Overview of Indexing · Microsoft Ignite · Manage indexing policies
[24]
Write Stored Procedures, Triggers, and UDFs - Azure - Microsoft Learn
Jul 10, 2025 · You can create and execute triggers, stored procedures, and UDFs by using the Azure portal, the JavaScript query API in Azure Cosmos DB, and ...
[25]
Azure Cosmos DB NoSQL indexer - Azure AI Search - Microsoft Learn
Oct 8, 2025 · Set up a search indexer to index data stored in Azure Cosmos DB for vector and full text search in Azure AI Search.
[26]
SQL query metrics for Azure Cosmos DB for NoSQL | Microsoft Learn
Aug 14, 2024 · Queries that need to be executed on all partitions have higher latency, and can consume higher RUs.About SQL query execution · Best practices for query...
[27]
Geospatial and GeoJSON location data - Azure Cosmos DB for ...
Jun 11, 2025 · Azure Cosmos DB for NoSQL has built-in geospatial functionality to represent geometric shapes or actual locations/polygons on a map.Azure Cosmos DB for NoSQL
[28]
Choose an API in Azure Cosmos DB - Microsoft Learn
Sep 3, 2025 · Learn how to choose between APIs for NoSQL, MongoDB, Cassandra, Gremlin, and Table in Azure Cosmos DB based on your workload requirements.
[29]
MongoDB Query Language (MQL) features and compatibility
Sep 23, 2025 · Azure Cosmos DB for MongoDB (vCore) service uses the MongoDB wire protocol, which provides seamless compatibility with existing tools and ...Network Protocol Support · Query Language Support
[30]
7.0 Supported Features and Syntax - Azure Cosmos DB for MongoDB
Aug 20, 2025 · Azure Cosmos DB for MongoDB 7.0 supports familiar MongoDB features, comprehensive query language, and database commands like find, insert, and ...
[31]
Pre-data migration steps - Azure Cosmos DB for MongoDB
Aug 14, 2024 · The first pre-migration step is to discover your existing MongoDB resources and assess the readiness of your resources for migration.<|control11|><|separator|>
[32]
Migrate Data Using the Desktop Data Migration Tool - Microsoft Learn
Jul 8, 2025 · Use the desktop data migration tool to migrate data from JSON, MongoDB, SQL Server, or Azure Table storage to Azure Cosmos DB.Prerequisites · Install the desktop data...
[33]
Apache Cassandra features supported by Azure Cosmos DB for ...
Jun 6, 2025 · The Azure Cosmos DB for Apache Cassandra is compatible with Cassandra Query Language (CQL) v3.11 API. It's backward-compatible with version 2.x.Cassandra protocol · Azure Managed Instance for...
[34]
Apache Cassandra and Azure Cosmos DB consistency levels
Sep 26, 2024 · Apache Cassandra provides a write consistency level and a read consistency level, to enable the high availability, consistency, and latency tradeoffs.
[35]
Live migrate data from Apache Cassandra to the Azure Cosmos DB ...
Jun 6, 2025 · Ability to use existing code and tools: Azure Cosmos DB provides wire protocol level compatibility with existing Cassandra SDKs and tools. This ...
[36]
Migrate data from Apache Cassandra to the Azure Cosmos DB for ...
Feb 14, 2025 · Ability to use existing code and tools: The Azure Cosmos DB provides wire protocol-level compatibility with existing Cassandra SDKs and tools.Add Dependencies · Create Scala Notebook For... · Rate Limiting (429 Error)
[37]
Azure Cosmos DB for Gremlin graph support and compatibility with ...
Azure Cosmos DB supports Gremlin, closely following TinkerPop, but has some differences, such as using breadth-first traversal and not supporting Gremlin ...Compatible client libraries · Supported Graph Objects
[38]
Azure Cosmos DB for Gremlin limits - Microsoft Learn
Oct 28, 2025 · This article talks about the limits of Azure Cosmos DB for Gremlin engine and explains how they may impact customer traversals.
[39]
Migrate hundreds of terabytes of data into Azure Cosmos DB
Aug 14, 2024 · This article describes the challenges involved in moving large-scale data to Azure Cosmos DB and introduces you to the tool that helps with the challenges.
[40]
Introduction/Overview - Azure Cosmos DB for Table | Microsoft Learn
Key features of the API for Table include: Wire protocol compatibility: Use your existing Azure Table Storage SDKs and tools with minimal or no code changes.
[41]
Tutorial: Query Azure Cosmos DB by using the API for Table
Aug 14, 2024 · The Azure Cosmos DB for Table supports OData and LINQ queries against key/value (table) data. This article covers the following tasks:Prerequisites · Query on PartitionKey and...
[42]
https://learn.microsoft.com/en-us/azure/cosmos-db/table/overview
[43]
Request Units in Azure Cosmos DB - Microsoft Learn
Sep 3, 2025 · Azure Cosmos DB supports many APIs, such as SQL, MongoDB, Cassandra, Gremlin, and Table. Each API has its own set of database operations.
[44]
Introduction to provisioned throughput in Azure Cosmos DB
Jul 24, 2025 · APPLIES TO: NoSQL MongoDB Cassandra Gremlin Table. Azure Cosmos DB allows you to set provisioned throughput on your databases and containers ...
[45]
How to choose between manual and autoscale on Azure Cosmos DB
Aug 14, 2024 · Azure Cosmos DB supports two types or offers of provisioned throughput: standard (manual) and autoscale. Both throughput types are suitable for ...Overview of provisioned... · Understand your traffic patterns
[46]
Azure Cosmos DB serverless account type - Microsoft Learn
Jul 24, 2025 · The Azure Cosmos DB serverless option best fits scenarios in which you expect intermittent and unpredictable traffic and long idle times.Missing: introduction 2019
[47]
Redundancy, replication, and backup | Azure Docs
Jun 23, 2025 · Those write operations are synchronously replicated within the region, and then replicated asynchronously across other regions. Azure Cosmos DB ...
[48]
Configure Multi-Region Writes - Azure Cosmos DB - Microsoft Learn
Jul 15, 2025 · To use multi-region writes, enable your Azure Cosmos DB account for multiple regions by using the Azure portal. Specify which regions your application can ...
[49]
Multi-region writes - Azure Cosmos DB | Microsoft Learn
Sep 3, 2025 · In a multi-region-write database account with two or more regions, the first region where your account is created is called the "hub" region.
[50]
Conflict resolution types and resolution policies in Azure Cosmos DB
Aug 14, 2024 · For Azure Cosmos DB accounts configured with multiple write regions, update conflicts can occur when writers concurrently update the same item ...
[51]
Manage conflict resolution policies in Azure Cosmos DB
Aug 14, 2024 · With multi-region writes, when multiple clients write to the same item, conflicts might occur. When a conflict occurs, you can resolve the ...Create a last-writer-wins... · Create a custom conflict...
[52]
High Availability (Reliability) in Azure Cosmos DB for NoSQL
Jul 21, 2025 · Azure Cosmos DB ensures high availability with at least three replicas per region, automatic maintenance, and availability zones for failover.Resiliency of compute nodes · Availability zone support
[53]
Optimize multi-region cost in Azure Cosmos DB - Microsoft Learn
Aug 14, 2024 · This article explains how to manage costs of multi-region deployments in Azure Cosmos DB.
[54]
What is Azure Cosmos DB analytical store? - Microsoft Learn
Jun 11, 2025 · Azure Cosmos DB analytical store is a fully isolated column store for enabling large-scale analytics against operational data in your Azure Cosmos DB.Automatically Handle Schema... · Schema Constraints · Schema Representation
[55]
Analytics with Azure Synapse Link - Azure Cosmos DB
Jul 25, 2025 · Azure Synapse Link allows you to directly access Azure Cosmos DB analytical store using Azure Synapse Analytics without complex data movement.Training · Configure and use Azure... · Microsoft Ignite
[56]
Frequently asked questions about Azure Synapse Link for Azure ...
Azure Synapse Link is supported for the Azure Cosmos DB APIs for NoSQL, Gremlin, and MongoDB. The support for Azure Cosmos DB API for Gremlin is now in preview.
[57]
Query Azure Cosmos DB data using serverless SQL pool
Apr 14, 2025 · A serverless SQL pool allows you to analyze data in your Azure Cosmos DB containers that are enabled with Azure Synapse Link in near real time.
[58]
Interact with Azure Cosmos DB using Apache Spark 2 in Azure ...
In this article, you'll learn how to interact with Azure Cosmos DB using Synapse Apache Spark 2. With its full support for Scala, Python, SparkSQL, and C#,
[59]
Configure and use Azure Synapse Link for Azure Cosmos DB
Aug 14, 2024 · Learn how to enable Synapse Link for Azure Cosmos DB accounts, create a container with analytical store enabled, connect the Azure Cosmos DB ...
[60]
Expire Data in Azure Cosmos DB with Time to Live | Microsoft Learn
Jul 22, 2025 · Learn how to use time to live in Azure Cosmos DB to automatically purge documents from the system after a period of time.Time To Live For Containers... · Time To Live Configurations · Examples
[61]
Mirroring Azure Cosmos DB (Preview) - Microsoft Learn
Aug 22, 2025 · Mirroring in Microsoft Fabric provides a seamless no-ETL experience to integrate your existing Azure Cosmos DB data with the rest of your ...
[62]
Vector search in Azure Cosmos DB for NoSQL - Microsoft Learn
Azure Cosmos DB for NoSQL now offers efficient vector indexing and search. This feature is designed to handle multi-modal, high-dimensional vectors.Missing: 2023 | Show results with:2023
[63]
Azure Cosmos DB Vector Search with DiskANN Part 1: Full Space ...
Sep 5, 2024 · Azure Cosmos DB NoSQL provides scalable, low-latency vector search with DiskANN. Build modern AI apps with integrated vector indexing, ...
[64]
Vector Search using 95% Less Compute | DiskANN with Azure ...
Jun 7, 2024 · Ensure high-accuracy, efficient vector search at massive scale with Azure Cosmos DB. Leveraging Microsoft's DiskANN, more IO traffic moves ...
[65]
Vector Similarity Search - Azure Cosmos DB | Microsoft Learn
Oct 17, 2025 · Vector search is a method that helps you find similar items based on their data characteristics rather than by exact matches on a property field.
[66]
https://devblogs.microsoft.com/cosmosdb/azure-cosmos-db-vector-search-with-diskann-part-1-full-space-search/
[67]
Work with the Change Feed - Azure Cosmos DB | Microsoft Learn
Jul 8, 2025 · The change feed in Azure Cosmos DB is a persistent record of changes to a container in the order the changes occur.Change Feed Design Patterns · Change feed processor · Azure Functions trigger
[68]
Read Azure Cosmos DB Change Feed | Microsoft Learn
Jul 22, 2025 · Learn how to work with the change feed in Azure Cosmos DB by using either a push model or a pull model.
[69]
Use Azure Cosmos DB Change Feed with Azure Functions
Jul 8, 2025 · You can create small, reactive Azure Functions that are automatically triggered on each new event in your Azure Cosmos DB container's change feed.
[70]
Change Feed Processor in Azure Cosmos DB | Microsoft Learn
Oct 15, 2025 · Learn how to use the Azure Cosmos DB change feed processor to read the change feed, and learn about the components of the change feed ...
[71]
Announcing Cosmos DB in Microsoft Fabric Featuring New Capabilities! (Preview) | Microsoft Fabric Blog | Microsoft Fabric
### Summary of 2025 Updates on Cosmos DB Vector Search in Microsoft Fabric
[72]
Azure Cosmos DB Conf 2025 Agenda
Explore how Azure Cosmos DB empowers AI workflows by integrating vector storage, semantic layering, and mirroring capabilities with Microsoft Fabric for ...
[73]
Monitor Azure Cosmos DB | Microsoft Learn
Jul 1, 2025 · Azure Cosmos DB server-side metrics include throughput, storage, availability, latency, consistency, and system level metrics. On the client ...Analyze Monitoring Data · Kusto Queries · Alerts
[74]
Monitor and debug with insights in Azure Cosmos DB | Microsoft Learn
Nov 11, 2024 · This article walks through common use cases and how Azure Cosmos DB insights can be used to analyze and debug these issues.
[75]
Get SQL query performance & execution metrics - Microsoft Learn
Aug 14, 2024 · This article presents how to profile SQL query performance on Azure Cosmos DB using ServerSideCumulativeMetrics retrieved from the .NET SDK.Get Query Metrics · Cumulative Metrics · Partitioned Metrics
[76]
Monitor Data using Diagnostic Settings - Azure Cosmos DB
May 1, 2021 · Learn how to use Azure diagnostic settings to monitor the performance and availability of data stored in Azure Cosmos DB.Prerequisites · Create diagnostic settings
[77]
Estimate RU/s using the Azure Cosmos DB capacity planner
Jul 15, 2025 · This article describes how to use the Azure Cosmos DB capacity planner to estimate the required RU/s and cost of your workload when using Azure Cosmos DB for ...
[78]
Estimate costs using the Azure Cosmos DB capacity planner
Aug 14, 2024 · This article describes how to use the Azure Cosmos DB capacity planner to get an estimate of the required RU/s and cost of your workload.
[79]
Azure Monitor integration with Azure Cosmos DB - Microsoft Learn
May 19, 2025 · Azure Cosmos DB insights provide a view of the overall performance, failures, capacity, and operational health of all your Azure Cosmos DB resources.
[80]
Create alerts for Azure Cosmos DB using Azure Monitor
Aug 14, 2024 · Alerts are used to set up recurring tests to monitor the availability and responsiveness of your Azure Cosmos DB resources.
[81]
Troubleshoot issues with diagnostics queries - Azure Cosmos DB
Aug 14, 2024 · In this article, we cover how to write simple queries to help troubleshoot issues with your Azure Cosmos DB account using diagnostics logs sent ...
[82]
Common Use Cases and Scenarios for Azure Cosmos DB
May 30, 2025 · Azure Cosmos DB offers tunable consistency levels that allow fast reads with low latency writes. Hence, storing UI layout data including ...
[83]
PlayFab Roadmap Update – July 2021 - Microsoft Developer
Now, PlayStream is powered by Azure Kubernetes Service, Event Hubs and Cosmos DB. ... Stats & Leaderboards v2 in early 2022. ... In this case study, Microsoft ...
[84]
Adobe uses global cloud to create, deliver, and manage better ...
Oct 23, 2017 · Many online retailers use Adobe Experience Manager content management to power e-commerce websites. ... Azure Cosmos DB. Read the story.Missing: study | Show results with:study
[85]
Top reasons why LinkedIn uses Azure Cosmos DB - YouTube
Mar 28, 2023 · ... LinkedIn chooses Azure Cosmos DB. Learn more from Ankit Gupta, principal software engineer at LinkedIn, in this video from Azure Cosmos DB ...Missing: case | Show results with:case
[86]
Azure Cosmos DB design pattern: Social media apps | Microsoft Learn
Aug 14, 2024 · This article guides you into modeling your social platform's data with Azure's NoSQL database Azure Cosmos DB cost-effectively.Missing: study | Show results with:study
[87]
Foundry connection for Azure Cosmos DB, Global Secondary Index ...
May 20, 2025 · Learn more about all the latest features and functionality announced by the Azure Cosmos DB team at Microsoft Build 2025.
[88]
Retrieval-augmented generation (RAG) in Azure Cosmos DB
Oct 17, 2025 · RAG combines the power of large language models (LLMs) with robust information retrieval systems to create more accurate and contextually relevant responses.Missing: demo | Show results with:demo
[89]
AI in action: 5 real-world intelligent apps you can build on Azure
Mar 18, 2024 · Detect fraud in real-time with Azure Cosmos DB, AKS, and Azure AI services. In this scenario, payment transactions are processed by Azure Cosmos ...
[90]
Real-world success stories of building intelligent apps with Azure
Oct 30, 2024 · Use case #5: Prevent fraud and detect anomalies. Increasingly, organizations leverage AI to identify suspicious financial transactions, false ...
[91]
Transactions and optimistic concurrency control - Microsoft Learn
Jul 15, 2025 · All the database operations within the scope of a container's logical partition are transactionally executed within the database engine that's ...
[92]
Transactional Batch Operations in Azure Cosmos DB - Microsoft Learn
Jul 22, 2025 · The Azure Cosmos DB request size limit constrains the size of the transactional batch payload to not exceed 2 MB, and the maximum execution time ...
[93]
Data Modeling - Azure Cosmos DB for NoSQL - Microsoft Learn
Sep 3, 2025 · Discover how to model data in Azure Cosmos DB, compare relational and document databases, and optimize for performance and scalability.Numbers in JSON · Embed data
[94]
Overview of Indexing - Azure Cosmos DB - Microsoft Learn
Jul 8, 2025 · This article explains how Azure Cosmos DB indexes data and how it uses indexes to improve query performance.Indexing policies · Manage indexing policies · Count · AVG (NoSQL query)
[95]
Azure Cosmos DB Lifetime Free Tier | Microsoft Learn
Jul 8, 2025 · Account with two regions that has one container with a max of 500 RU/s provisioned throughput. Create an account with free tier. You can create ...Free Tier With Shared... · Free Tier With Azure... · Create An Account With Free...Missing: writable | Show results with:writable
[96]
Optimizing throughput cost in Azure Cosmos DB | Microsoft Learn
Aug 14, 2024 · APPLIES TO: NoSQL MongoDB Cassandra Gremlin Table. By offering provisioned throughput model, Azure Cosmos DB offers predictable performance ...<|control11|><|separator|>
[97]
Optimize Request Cost in Azure Cosmos DB | Microsoft Learn
Jul 24, 2025 · The easiest way to find out how much time the query execution is spent in the UDF and the number of RUs consumed, is by enabling the query ...Read Data: Point Reads And... · Queries · Best Practices To Optimize...
[98]
Optimize storage cost in Azure Cosmos DB | Microsoft Learn
Aug 14, 2024 · Azure Cosmos DB expects the item size to be 2 MB or less for optimal performance and cost benefits. If you need any item to store larger ...
[99]
Optimize your costs - Azure Cosmos DB - Microsoft Learn
Optimize your Azure Cosmos DB costs · Understand our pricing model · Plan for optimization · Optimize actual costs · Develop for free.
[100]
Autoscale throughput in Cosmos DB in Microsoft Fabric (preview)
Jul 14, 2025 · Cost-effective: Autoscale helps optimize your RU/s usage and cost usage by scaling down when not in use. You only pay for the resources that ...