Fact-checked by Grok 2 weeks ago

Amazon ElastiCache

Amazon ElastiCache is a fully managed, serverless web service from Amazon Web Services (AWS) that simplifies the setup, operation, and scaling of distributed in-memory caches and data stores in the cloud, providing microsecond latency for high-performance applications.^[1]^[2] It supports open-source compatible engines including Redis OSS, Memcached, and Valkey, allowing developers to use familiar APIs and data structures like hashes, lists, and sets with minimal code changes.^[2]^[1] Launched in 2011, ElastiCache has evolved to handle demanding workloads, scaling to hundreds of millions of operations per second while abstracting infrastructure management.^[3]^[4] The service operates in two primary modes: serverless, which automates capacity planning, hardware provisioning, and cluster design for instant scaling and zero downtime maintenance; and node-based clusters, offering granular control over node types, quantities, and placement across multiple Availability Zones for customized high availability.^[1]^[5] Key features include automatic software patching, monitoring, and backups, along with cross-Region replication through Global Datastore for Redis OSS and Valkey to ensure data durability and low-latency global access.^[1] Security is integrated via Amazon Virtual Private Cloud (VPC) isolation, AWS Identity and Access Management (IAM) controls, and compliance certifications such as HIPAA eligibility, FedRAMP authorization, and PCI DSS.^[2]^[1] ElastiCache accelerates applications by caching frequently accessed data from databases, data lakes, and analytics pipelines, reducing costs and latency in scenarios like generative AI inference, gaming leaderboards, e-commerce personalization, and real-time messaging.^[2]^[6] For instance, it enables semantic caching in large language model (LLM) workflows to minimize redundant computations and supports pub/sub patterns for scalable event-driven architectures.^[6] Recent enhancements, such as the serverless option introduced in November 2023 and Valkey support in October 2024, further emphasize its focus on open-source compatibility and effortless performance optimization.^[7]^[8]

History and Development

Initial Launch

Amazon ElastiCache was initially launched on August 22, 2011, as a fully managed service providing distributed in-memory caching capabilities using the Memcached open-source engine version 1.4.5. This release introduced the ability to create cache clusters consisting of one or more cache nodes, each with configurable memory sizes ranging from 6 GB to 67 GB, deployable across AWS Availability Zones for high availability. The service was designed to integrate seamlessly with existing Memcached-compatible applications, allowing developers to leverage the AWS Management Console, APIs, or command-line tools for provisioning and management without handling underlying infrastructure.^[9] The foundational purpose of ElastiCache addressed the growing demand among developers for a scalable, low-latency caching layer to accelerate data access in web applications, particularly for read-heavy workloads where repeated queries to backend data stores could create bottlenecks. By caching frequently accessed items such as session data, user profiles, or results from expensive computations and database operations, ElastiCache enabled applications to achieve sub-millisecond response times, significantly offloading relational databases like Amazon RDS and reducing their query load. In typical setups, this caching approach could deliver up to 80 times faster read performance compared to direct database access alone.^[9]^[10] At launch, the early architecture emphasized provisioned clusters with online scalability, permitting the addition or removal of cache nodes without downtime to handle varying workloads dynamically. Key operational features included integration with Amazon EC2 for hosting applications, Amazon CloudWatch for monitoring metrics like CPU utilization and eviction rates, and Amazon Simple Notification Service (SNS) for alerts on cluster events. While initial support focused on Memcached's stateless, key-value data model without built-in persistence or failover, the service laid the groundwork for AWS ecosystem compatibility, with later expansions such as Virtual Private Cloud (VPC) integration in December 2012 enhancing security and isolation. Support for the Redis engine was added on September 4, 2013, introducing advanced data structures and replication capabilities to broaden ElastiCache's utility beyond simple caching.^[9]^[11]^[12]

Key Evolutions and Updates

In 2023, Amazon ElastiCache introduced a serverless deployment option, enabling zero-management scaling for Redis OSS and Memcached caches that automatically adjust capacity based on application demands without requiring manual provisioning of nodes.^[7] This update, launched on November 27, 2023, allows caches to be created in under a minute and supports seamless handling of variable traffic patterns, reducing operational overhead for developers.^[13] Building on storage innovations, ElastiCache added data tiering capabilities in 2021, which were expanded in subsequent years to include support for newer engines like Valkey, allowing cost-effective scaling by combining in-memory storage with solid-state drives (SSDs) for infrequently accessed data.^[14] This feature enables clusters to handle up to hundreds of terabytes of data at lower costs—up to 60% savings in some workloads—while maintaining low-latency access through least-recently-used (LRU) eviction policies that promote hot data to memory.^[15] By 2024, data tiering became integral to Valkey-compatible deployments, enhancing price-performance for large-scale caching scenarios.^[16] A significant shift occurred in 2024 with the introduction of support for Valkey, an open-source fork of Redis OSS 7.2.4 created in response to licensing changes by Redis Inc., ensuring continued compatibility with Redis OSS 7.1 and later versions as well as Memcached 1.6.21 and above.^[8] Announced on October 8, 2024, ElastiCache for Valkey version 7.2.6 provides drop-in replacement for existing Redis workloads, with upgrades available without downtime.^[17] In 2025, this support advanced further with Valkey 8.1 in July, introducing memory efficiency improvements for up to 20% more data storage per node, and Valkey 8.2 in October, adding native vector search capabilities.^[18]^[19] The Global Datastore feature, launched in 2020 for multi-Region replication, saw ongoing enhancements through 2025, including broader node type support and integration with Valkey for read replicas across up to two secondary Regions with sub-millisecond latencies for reads in active-passive configurations.^[20] This enables disaster recovery and low-latency global reads, with data automatically synchronized from a primary cluster while allowing writes in the primary Region only.^[21] By mid-2025, it extended to M5, R5, R6g, and R7g instances, making it eligible for AWS Free Tier usage.^[22] Integration expansions in recent years have tied ElastiCache more closely to AWS AI and machine learning services, particularly Amazon Bedrock and Amazon SageMaker. For Bedrock, ElastiCache's vector search in Valkey 8.2, released October 13, 2025, supports indexing and querying high-dimensional embeddings generated by Bedrock models, facilitating retrieval-augmented generation (RAG) for generative AI applications at scale.^[23] With SageMaker, ElastiCache serves as a near-real-time feature store for ML inferences, caching features from SageMaker processing jobs to achieve ultra-low latency—under 10 milliseconds—for online predictions in recommendation systems and personalization workloads.^[24] These native ties, highlighted in 2023–2025 documentation, enable seamless data flow between caching layers and AI pipelines without custom middleware.^[25] In October 2025, ElastiCache added support for dual-stack (IPv4 and IPv6) service endpoints, improving connectivity for applications transitioning to IPv6.^[26]

Architecture and Components

Supported Engines

Amazon ElastiCache supports three primary in-memory data store engines: Memcached, Redis OSS, and Valkey, each designed to handle caching and data storage with varying levels of complexity and functionality.^[27] These engines can be deployed in node-based or serverless modes, allowing flexibility based on workload requirements.^[27] Memcached serves as a simple, distributed key-value store optimized for basic caching operations without built-in persistence, replication, or support for advanced data structures.^[27] It operates in a multi-threaded manner to achieve high-throughput reads and writes, making it suitable for non-durable caching scenarios where data loss on failure is acceptable.^[27] ElastiCache supports Memcached versions 1.4.5 and later, with the latest being 1.6.22, including features like in-transit encryption starting from version 1.6.12.^[28] Redis OSS provides a full-featured in-memory data store that extends beyond basic key-value operations to include persistence options such as RDB snapshots and AOF logs, pub/sub messaging, and rich data structures like sorted sets, lists, hashes, and geospatial indexes.^[27] It also supports Lua scripting for custom server-side logic and clustering for horizontal sharding and high availability through automatic failover.^[27] ElastiCache offers Redis OSS versions 4.0.10 and later, with the current major version at 7.1, enabling advanced capabilities like data tiering for cost optimization.^[28] Valkey, introduced to ElastiCache in 2024 as a community-driven fork of Redis OSS, maintains identical APIs and compatibility while emphasizing open-source governance following changes in Redis licensing.^[28] It inherits Redis OSS features such as persistence, replication, pub/sub, complex data structures, and clustering, with enhancements like sharded pub/sub and access control lists available from version 7.2 onward, as well as vector search in version 8.2 for handling vector embeddings in AI and machine learning applications.^[27]^[29] Supported versions in ElastiCache start from 7.2 and extend to the latest 8.2, ensuring seamless upgrades from compatible Redis OSS clusters.^[28] When selecting an engine, Memcached is preferred for applications requiring simplicity, the lowest latency, and straightforward scalability without the overhead of persistence or replication.^[27] In contrast, Redis OSS or Valkey are chosen for workloads involving complex operations, such as transactions, scripting, or geospatial indexing, where data durability and advanced querying are essential; Valkey may be favored for its commitment to open-source principles.^[27]^[28]

Core Components and Operations

The following describes the core components and operations for node-based clusters in Amazon ElastiCache, which enable efficient in-memory caching through user-managed infrastructure. Serverless mode abstracts these elements, automatically handling capacity and scaling without node or cluster management.^[1] At the foundation are nodes, which serve as the basic compute units responsible for memory allocation and input/output operations. Each node provides a fixed-size chunk of RAM and is selected based on instance types ranging from small options like cache.t4g.micro to large configurations such as cache.r7g.16xlarge, all within the same cluster to ensure consistency.^[30]^[31] Clusters represent logical groupings of one or more nodes, allowing for flexible deployment configurations. A single-node cluster offers simplicity for basic caching needs, while multi-node clusters incorporate primary-replica replication to enhance data durability and read scalability, with the primary node handling writes and replicas serving reads.^[32]^[33] For horizontal scaling in larger deployments, ElastiCache supports sharding, which partitions data across multiple shards when cluster mode is enabled, particularly for Valkey and Redis OSS engines. Each shard consists of a primary node and up to five read replicas, enabling distribution of data and workload across 1 to 500 shards to manage high-volume applications effectively.^[34]^[33] Key operations in ElastiCache ensure reliability and maintenance with minimal disruption. Automatic failover promotes a read replica to primary in multi-AZ deployments, typically completing in under 30 seconds to maintain availability during node failures. Backups provide point-in-time recovery, with automatic snapshots retained for up to 35 days, and manual backups stored indefinitely until deleted. Patching and maintenance activities, such as engine version updates, are performed in a rolling manner across nodes in Multi-AZ setups to avoid downtime.^[35]^[36]^[37]^[38] In typical data flow, clients connect to the cluster via a configuration endpoint, directing queries to the cache for fast retrieval; on a cache miss, the application forwards the request to a backend data store like Amazon DynamoDB before storing the result in the cache. When memory limits are reached, eviction occurs based on policies such as least recently used (LRU), which removes the least accessed items to free space while preserving frequently used data.

Features

Caching and Data Structures

Amazon ElastiCache employs several caching strategies to balance performance, consistency, and data freshness in in-memory operations. The cache-aside pattern, often implemented as lazy loading, allows applications to query the cache first and fetch data from a backing persistent store only on cache misses, with the application responsible for subsequent writes to keep the cache updated.^[39] Write-through caching involves manually synchronizing updates from the cache to the persistent store to ensure immediate consistency, though this requires application-level implementation in ElastiCache for Redis OSS and Valkey engines.^[40] Additionally, time-to-live (TTL) settings enable automatic expiration of cache entries to prevent stale data, with configurable durations that support jitter to avoid thundering herds during evictions.^[41] For ElastiCache clusters using the Redis OSS or Valkey engines, a variety of advanced in-memory data structures enhance caching capabilities beyond simple key-value storage. Strings function as versatile building blocks, supporting atomic increments and decrements for use as counters in real-time analytics.^[25] Lists provide efficient append and pop operations, making them suitable for implementing queues or stacks in message processing workflows.^[42] Sets maintain unique unordered collections, enabling fast membership checks and set operations like unions or intersections for deduplication tasks. Sorted sets, with scored elements, facilitate ordered rankings such as leaderboards. Hashes organize field-value pairs to represent complex objects compactly. Bitmaps offer space-efficient manipulation of binary data for aggregation in user behavior analytics, while HyperLogLog structures approximate the cardinality of large sets with minimal memory overhead.^[42] In contrast, ElastiCache for Memcached focuses on simplicity and high-throughput key-value operations, supporting only basic string data types with commands limited to get, set, increment, and decrement for counter-like functionality.^[27] ElastiCache also supports semantic caching through vector search capabilities in Valkey version 8.2 on node-based clusters, which is compatible with the Redis OSS protocol (announced October 2025), where applications store vector embeddings of prompts and responses to identify and reuse semantically similar content in generative AI workflows.^[23] This approach reduces redundant large language model (LLM) inferences by matching query vectors against cached ones using similarity metrics, with configurable thresholds and metadata filtering to ensure relevance.^[23] In LLM applications, semantic caching can yield significant cost savings—up to 88% with a 90% cache hit ratio—while improving response times from seconds to milliseconds by avoiding repeated computations on similar inputs.^[23] These structures, such as sorted sets for leaderboards or publish/subscribe for real-time notifications, further extend caching utility in diverse scenarios.^[42]

Scaling and Availability

Amazon ElastiCache supports vertical scaling through online node type modifications, allowing users to upgrade or downgrade instance types, such as from t3 to r6g, to adjust compute and memory capacity without significant disruption.^[43] This process involves creating new nodes with the updated type, synchronizing data from existing nodes, and replacing old nodes while keeping the cluster operational, typically resulting in minimal downtime of seconds during the switchover.^[43] Vertical scaling is available for Valkey 7.2+ and Redis OSS 3.2.10+ clusters and can be performed via the AWS Management Console, CLI, or API, either immediately or during a maintenance window.^[43] Horizontal scaling in ElastiCache varies by deployment mode. In node-based clusters, auto-scaling automatically adds or removes shards and replicas based on CloudWatch metrics like CPU utilization or database capacity, enabling elastic adjustment to workload demands without manual intervention.^[44] For serverless caches, scaling is instantaneous and automatic, monitoring ECPUs per second and data storage to add capacity as needed, supporting up to 5 million requests per second with sub-millisecond p50 read latency.^[45] This serverless approach eliminates provisioning overhead and ensures seamless elasticity up to 90,000 ECPUs per second when using read replicas.^[45] Availability in ElastiCache is enhanced through Multi-AZ deployments, which distribute nodes across multiple Availability Zones for fault tolerance and provide a 99.99% monthly uptime SLA when configured with automatic failover.^[46] Automatic failover promotes a read replica with the lowest replication lag to primary status in seconds if the primary node fails, minimizing downtime without requiring manual intervention.^[35] Read replicas further support availability by offloading read traffic for load balancing, distributing queries across nodes to improve throughput and resilience.^[35] Data tiering optimizes availability and cost by automatically offloading infrequently accessed (cold) data to lower-cost SSD storage while keeping hot data in memory, using an LRU algorithm to manage eviction.^[15] This feature, available on r6gd nodes for Valkey version 7.2 or later and Redis OSS version 6.2 or later, retains up to 20% of the dataset in DRAM for fast access, adds approximately 300 microseconds of latency for SSD-retrieved items, and delivers over 60% cost savings compared to memory-only nodes at full utilization by expanding effective capacity up to 4.8 times.^[15] Global replication via Global Datastore enables asynchronous cross-Region data copying for disaster recovery, with primary clusters handling writes and secondary clusters providing low-latency reads.^[21] Replication latency is typically under 1 second, allowing applications to access local replicas for sub-second response times while maintaining data consistency across Regions.^[25] In failure scenarios, a secondary cluster can be promoted to primary in less than 1 minute, ensuring rapid recovery without data loss.^[25]

Security and Compliance

Amazon ElastiCache provides robust network security through integration with Amazon Virtual Private Cloud (VPC), which isolates cache clusters in a private network environment to prevent unauthorized access from the public internet.^[47] Security groups act as virtual firewalls to control inbound and outbound traffic to ElastiCache clusters, allowing administrators to define rules based on IP addresses, ports, and protocols.^[47] Additionally, ElastiCache supports private endpoints via VPC peering and AWS PrivateLink for secure, private connectivity to the service API without traversing the public internet.^[48] For data in transit, ElastiCache enables TLS encryption, which secures communications between clients and cache nodes or among nodes within a cluster; this feature is available for Redis OSS versions 3.2.6 and later, Valkey 7.2 and later, and Memcached 1.6.12 and later, requiring deployment in a VPC and compatible client libraries.^[49] Data protection in ElastiCache includes at-rest encryption using AWS Key Management Service (KMS), which encrypts data on disk during synchronization, swap operations, and backups stored in Amazon S3; customers can use either AWS-managed keys or their own customer-managed KMS keys for greater control.^[50]^[16] This encryption is supported on specific node types and is mandatory for serverless caches, with Redis OSS 4.0.10 and later, Valkey 7.2 and later, and Memcached on serverless configurations.^[50] Authentication mechanisms encompass AWS Identity and Access Management (IAM) for API-level access, role-based access control (RBAC) for fine-grained permissions on user operations, and the Redis AUTH command, which requires a password for cluster access when in-transit encryption is enabled.^[51]^[52] ElastiCache adheres to several compliance standards, making it eligible for HIPAA to handle protected health information when configured appropriately, authorized under FedRAMP Moderate for U.S. government use, and compliant with PCI DSS for payment card data processing.^[53]^[2] These validations are conducted by third-party auditors and cover all supported engines including Valkey, Memcached, and Redis OSS.^[54] Audit logging is facilitated through integration with AWS CloudTrail, which captures API calls and management events for ElastiCache to support compliance monitoring and forensic analysis.^[54] Advanced security features in ElastiCache for Redis include Access Control Lists (ACLs) implemented via RBAC, which allow creation of user groups with specific permissions defined by access strings to restrict commands and keys, thereby enforcing least-privilege access.^[52] Parameter groups enable enforcement of security policies, such as disabling data persistence by setting parameters like appendonly to no in Redis OSS or equivalent in Valkey, preventing sensitive data from being written to disk and reducing exposure risks.^[55] These configurations apply to node-based clusters and can be modified via the AWS Management Console, CLI, or SDK to tailor security postures without downtime in many cases.^[56]

Use Cases

Application Performance Enhancement

Amazon ElastiCache enhances application performance by serving as a high-speed in-memory cache that offloads frequently accessed data from primary databases such as Amazon RDS and Amazon DynamoDB, thereby reducing database load and improving response times. By caching query results, ElastiCache can decrease the load on underlying databases by up to 90%, as demonstrated in e-commerce scenarios where read operations are offloaded to the cache. This offloading shifts latency from milliseconds typical of disk-based databases to microseconds in ElastiCache, enabling up to 80x faster read performance when integrated with Amazon RDS for MySQL.^[57]^[25] For web applications, ElastiCache supports efficient session storage by using Redis-compatible data structures like hashes to store user sessions, including authentication details and preferences. This approach allows applications to scale statelessly across multiple instances without relying on sticky sessions or server-local storage, facilitating horizontal scaling and improved availability during traffic spikes. In practice, such session management reduces the need for database round-trips for transient data, contributing to sub-millisecond access times and seamless user experiences in high-traffic environments.^[10] ElastiCache also enables robust rate limiting to prevent API abuse and maintain system stability, leveraging atomic operations such as incrementing counters for request tracking per user or endpoint. Developers can implement complex throttling logic using Lua scripts executed atomically on the server side, ensuring consistency without race conditions even under concurrent loads. This capability supports millions of operations per second with microsecond response times, protecting backend resources while enforcing fair usage policies.^[58] Beyond performance gains, ElastiCache contributes to cost optimization by mitigating the need to over-provision databases for peak read demands, allowing rightsizing of RDS or DynamoDB instances. For instance, in e-commerce applications handling 80% read-heavy workloads, caching can reduce database queries by up to 95%, leading to significant savings—such as a 6x cost reduction in DynamoDB capacity through targeted read offloading. These efficiencies arise from ElastiCache's ability to handle transient data at a fraction of the cost of persistent storage, without compromising scalability.^[57]^[10]

Real-Time Data Processing

Amazon ElastiCache for Redis enables real-time data processing by leveraging its in-memory data structures to handle live data streams and interactive applications with sub-millisecond latency. This capability is particularly valuable for event-driven workloads where immediate data ingestion, updates, and retrieval are essential, such as in gaming, social platforms, and IoT systems. By supporting atomic operations and high-throughput commands, ElastiCache ensures consistency in concurrent environments without the overhead of traditional databases.^[6] For leaderboards, ElastiCache utilizes Redis sorted sets to maintain real-time rankings, such as top scores in multiplayer games. Each entry consists of a unique member (e.g., user ID) associated with a score (e.g., points earned), automatically sorted in ascending order for efficient querying. Commands like ZADD update scores atomically, while ZRANGEBYSCORE or ZREVRANGE retrieve ordered ranges, such as the top 10 players, with logarithmic time complexity (O(log N + M), where M is the number of elements returned). This approach offloads computational complexity from the application to the cache, enabling updates and queries for millions of records in under a millisecond, far outperforming relational databases for similar tasks.^[59]^[6] Pub/sub messaging in ElastiCache facilitates broadcasting updates across channels, ideal for real-time features like chat rooms or live notifications. Publishers send messages via the PUBLISH command to a specific channel, while subscribers use SUBSCRIBE for exact matches or PSUBSCRIBE for pattern-based subscriptions (e.g., news.sports.*). Messages are fire-and-forget, delivered only to active subscribers without persistence, and channels are bound to shards for scalability. In cluster mode, ElastiCache supports horizontal scaling across multiple shards, handling high concurrency and large subscriber bases through sharding and replication.^[6]^[60] Time series data processing benefits from ElastiCache's lists, sorted sets, or streams for ingesting IoT or sensor data in chronological order. For instance, sorted sets store timestamps as scores with sensor readings as members, allowing range queries via ZRANGEBYSCORE to fetch recent data points efficiently. Redis streams append entries as time-sequenced records, supporting consumer groups for parallel processing and trimming old data to manage memory. Aggregation, such as averaging sensor values over intervals, can be performed using Lua scripts for custom, atomic computations executed server-side, reducing network round trips and ensuring consistency in high-velocity streams.^[61]^[60] Message boards and threaded discussions leverage Redis hashes to store post details, enabling atomic updates in concurrent scenarios. A hash key (e.g., post:123) holds fields like content, timestamp, and reply counts, with commands such as HSET for setting values and HINCRBY for incrementing metrics like likes or views atomically. This structure supports nested threads by linking child posts via set membership or additional hash fields, ensuring thread-safe operations without locks. Multi-key transactions via the MULTI/EXEC block or Lua scripts further guarantee atomicity across related updates, such as incrementing a reply counter while appending to a list.^[6]^[60]

AI and Generative Applications

Vector search, enabling semantic caching and other features, became generally available for Amazon ElastiCache in October 2025 with Valkey 8.2 and Redis OSS 7.1.^[19] Amazon ElastiCache supports semantic caching for generative AI applications by leveraging vector search to store and retrieve responses based on semantic similarity between prompts, rather than exact matches. This approach uses vector embeddings to perform similarity searches, such as cosine distance, enabling the caching of responses for conceptually similar queries and thereby reducing the need for repeated large language model (LLM) inferences.^[23] In production workloads, semantic caching with ElastiCache can achieve cost reductions, for example up to 23% with a 25% cache hit ratio, while also lowering latency to microseconds for cache hits.^[23] ElastiCache facilitates Retrieval-Augmented Generation (RAG) in generative AI by providing low-latency vector retrieval from knowledge bases, allowing LLMs to incorporate fresh, external data for more accurate and contextually relevant responses. Vector search in ElastiCache indexes high-dimensional embeddings—generated from sources like Amazon Bedrock's Titan Text Embeddings—and supports real-time updates to handle dynamic datasets with millions to billions of vectors.^[23] This integration minimizes hallucinations in LLM outputs by retrieving semantically similar documents via metrics like cosine distance or Euclidean, with recall rates up to 99% and query performance scaling to hundreds of millions of operations per second.^[6] ElastiCache's support for configurable eviction policies and time-to-live (TTL) on keys ensures efficient management of vector stores in RAG pipelines.^[23] For agentic AI applications, such as multi-turn chatbots, ElastiCache serves as a persistent memory layer using vectors and hashes to store conversation history across sessions, enabling context-aware interactions without relying on stateless LLM calls. Developers can embed session states or dialogue vectors in ElastiCache for Redis, allowing agents to recall prior exchanges and maintain personalization at sub-millisecond latencies.^[6] This memory mechanism integrates with frameworks like LangChain, supporting open-source agent architectures by caching intermediate states or tool outputs to reduce computational overhead in long-running tasks.^[62] In recommendation engines for AI-driven e-commerce and personalization, ElastiCache employs hashes for user-item interaction data, sorted sets for ranking preferences, and vectors for similarity-based matching to deliver real-time suggestions. For example, user behavior embeddings can be stored as vectors and queried for cosine similarity to recommend items, with sorted sets maintaining top-N results for dynamic updates.^[6] This setup supports near-real-time personalization, scaling horizontally to handle high-throughput scenarios like session-based recommendations in generative AI apps.^[23] ElastiCache briefly integrates with Amazon Bedrock to generate embeddings for these vectors, streamlining the pipeline from data ingestion to inference.^[23]

Deployment Options

Serverless Caching

Amazon ElastiCache Serverless is a fully managed caching option that eliminates the need for infrastructure provisioning, allowing users to deploy high-availability caches in under a minute through the AWS Management Console, API, or CLI.^[1] Upon creation, users specify only the cache name, engine (Valkey, Redis OSS, or Memcached), and optional configurations such as VPC settings, encryption, and security groups, with AWS automatically handling the underlying cluster topology and replication across multiple Availability Zones for 99.99% availability.^[13] This serverless mode supports instant auto-scaling from zero capacity to peak demand based on real-time monitoring of memory, compute, and network utilization, adapting seamlessly to fluctuating application traffic without manual intervention.^[1] In terms of operations, AWS fully manages patching, backups, and failover in ElastiCache Serverless, ensuring zero-downtime maintenance and transparent minor version updates, while major engine upgrades are performed seamlessly with advance notifications via the console or Amazon EventBridge.^[13] Automatic backups are enabled by default, allowing data restoration or migration using Redis RDB files or Amazon S3, and failover is handled automatically across Availability Zones to maintain data durability and availability.^[13] The mode supports all major ElastiCache engines—Valkey (up to version 8.2 as of November 2025), Redis OSS 7.1 and later, and Memcached 1.6.21 and later—with engine upgrades occurring without disruption, abstracting away the complexities of node management and hardware replacements.^[1]^[63] Recent updates include Valkey 8.0 (November 2024) for faster scaling and memory efficiency, 8.1 (July 2025) with Bloom filters, and 8.2 (October 2025) enabling vector search. Key benefits include zero-downtime scaling and maintenance, making it suitable for variable workloads such as bursty AI inference applications that experience unpredictable spikes in requests.^[13] Pricing follows a pay-per-use model, charging for data stored in gigabyte-hours (with minimums of 100 MB for Valkey and 1 GB for others) and ElastiCache Processing Units (ECPUs) consumed per request based on data transferred (1 ECPU per KB), plus snapshot storage, enabling cost efficiency without upfront commitments.^[22] For example, a workload handling 1 million requests per second can achieve sub-millisecond latencies, such as p50 GET latency of approximately 751 µs, while automatically evicting keys via LRU policy when limits are reached.^[13] However, ElastiCache Serverless has specific limitations, including the absence of custom node types, as capacity is managed entirely by AWS without user-specified instance configurations.^[1] For Valkey and Redis OSS, each serverless cache supports up to 5,000 GiB of data overall, with a maximum of 32 GiB per sharding slot and automatic sharding to distribute load, though users lack direct control over shard topology; Memcached uses automatic distribution without explicit sharding slots (limits per AWS quotas).^[64] In contrast to node-based clusters, this mode prioritizes simplicity over granular customization.^[1]

Node-Based Clusters

Node-based clusters in Amazon ElastiCache represent the traditional provisioned deployment model, where users explicitly configure and manage caching resources for precise control over performance and capacity. This approach allows selection of specific node types, such as cache.t3.micro for lightweight workloads or cache.r7g.large for high-memory needs, to match application requirements.^[65] Configuration involves specifying the number of nodes, with limits varying by engine and mode—for instance, up to 60 nodes for Memcached clusters (default, increasable via quotas); for Valkey or Redis OSS in cluster mode disabled, 1 primary and up to 5 read replicas (total up to 6 nodes); and in cluster mode enabled, up to 500 shards (default quotas may be lower, e.g., 90 nodes, but increasable to 500 for versions 5.0.6 and higher), each supporting 1 primary and up to 5 replicas for a total of up to 500 nodes depending on configuration.^[32]^[66]^[67] Node placement across multiple Availability Zones (AZs) enhances availability by distributing resources geographically. For Valkey or Redis OSS, cluster mode can be enabled to support sharding, partitioning data across multiple shards for horizontal scalability, or disabled to form replication groups with a single shard and multiple read replicas for simpler setups.^[65]^[68] Memcached clusters, in contrast, operate without native replication or sharding, relying on multi-node configurations for distribution.^[32] Management of node-based clusters includes manual scaling through adding or removing nodes to adjust capacity, enabling horizontal scaling without downtime in supported configurations. ElastiCache supports Multi-AZ deployments for Valkey and Redis OSS replication groups, where read replicas are placed in different AZs to enable automatic failover if the primary node fails, promoting a replica to primary in seconds while maintaining endpoint consistency via DNS propagation.^[69]^[35] Replication between primary and replicas is asynchronous, ensuring data durability across AZs but with potential for minor lag during failover.^[68] This model suits steady workloads with predictable traffic patterns, such as caching for enterprise databases or session stores, where fixed capacity provides consistent performance without automatic adjustments. For Valkey or Redis OSS, it integrates advanced persistence features like RDB snapshots for point-in-time backups and Append-Only File (AOF) logging for durable writes, allowing clusters to function as more than transient caches.^[69]^[70]^[71] In contrast to serverless options, node-based clusters emphasize explicit provisioning and tuning for reliable, controlled environments.^[45]

Performance and Monitoring

Key Metrics

Key performance metrics for Amazon ElastiCache provide insights into the efficiency and health of caching operations, enabling users to evaluate latency, resource utilization, and overall system reliability. Latency metrics, such as SuccessfulReadRequestLatency and SuccessfulWriteRequestLatency, measure the time taken to process read and write requests in microseconds, supporting percentiles from P0 to P100 for detailed analysis like P99 values. ElastiCache achieves microsecond read latency for serverless deployments and provides sub-millisecond latency for reads in optimized configurations. Throughput is assessed via command counts like GetTypeCmds and SetTypeCmds, with clusters capable of handling up to 500 million requests per second on large nodes such as r7g.4xlarge or greater.^[72]^[25]^[16] CPU and memory utilization are tracked per node to ensure adequate sizing and prevent bottlenecks. EngineCPUUtilization indicates the percentage of CPU used by the engine thread, while DatabaseMemoryUsagePercentage reflects the ratio of used memory to the maximum configured (used_memory/maxmemory). High eviction rates, measured by the Evictions metric as the count of keys removed due to reaching the maxmemory limit, signal potential sizing issues where insufficient memory leads to data loss and increased backend queries. These metrics are collected via Amazon CloudWatch for real-time monitoring.^[72] The cache hit ratio is a critical indicator of caching effectiveness, calculated as

\text{Cache Hit Rate} = \left( \frac{\text{cache_hits}}{\text{cache_hits} + \text{cache_misses}} \right) \times 100

in percentage terms, with a target of approximately 80% or higher to minimize misses and optimize performance. Connection metrics, including CurrConnections for the current number of client connections (excluding replicas) and NewConnections for the total accepted connections, help assess network load, while CmdErr tracks failed commands via the ErrorCount metric to identify error rates.^[72]

Monitoring and Optimization

Amazon ElastiCache integrates seamlessly with Amazon CloudWatch to enable comprehensive monitoring of cluster performance.^[73] This integration publishes host-level and engine-specific metrics every 60 seconds, allowing users to track resource utilization and cache behavior in real time.^[73] For instance, the EngineCPUUtilization metric measures the percentage of CPU used by the cache engine process, providing insights into computational load.^[72] Users can configure CloudWatch alarms on these metrics to proactively detect thresholds, such as when EngineCPUUtilization exceeds 90% of available capacity, triggering notifications or automated scaling actions.^[74] For ElastiCache clusters using Redis OSS or Valkey engines, CloudWatch offers enhanced metrics tailored to in-memory data management.^[72] A key example is the Evictions metric, which counts the number of keys removed from the cache due to memory constraints, helping identify potential data loss or performance degradation.^[74] Alarms on Evictions can alert when values surpass baseline thresholds, prompting immediate investigation into memory pressure.^[74] These Redis- and Valkey-specific metrics complement general host metrics like CPUUtilization and SwapUsage, ensuring holistic visibility into engine health.^[74] Optimization in ElastiCache involves fine-tuning configuration parameters to align with workload demands, primarily through custom parameter groups.^[55] Parameter groups allow modification of engine-specific settings without restarting the cluster, such as adjusting the maxmemory parameter to allocate optimal memory limits based on node type and expected data volume.^[75] Eviction policies can be tuned via these groups to manage memory overflow effectively; for example, the allkeys-lru policy evicts the least recently used keys across the entire dataset, regardless of TTL, which is ideal for write-heavy caching scenarios.^[76] Applying such tweaks—such as switching from the default volatile-lru to allkeys-lru—helps maintain high cache hit ratios and prevents unnecessary evictions.^[76] The ElastiCache management console provides intuitive dashboards for visualizing metrics and logs, facilitating quick assessments of cluster status.^[77] Users can access real-time views of key indicators like connections and latency directly from the console, alongside historical trends.^[78] For deeper bottleneck analysis, slow logs capture commands exceeding configurable execution thresholds, detailing duration, client details, and command types in formats like JSON or text.^[79] Enabling slow log delivery to CloudWatch Logs or Amazon Kinesis Data Firehose allows querying and filtering entries to pinpoint latency issues, such as slow queries impacting throughput.^[79] The slowlog-log-slower-than parameter sets the threshold in microseconds, while slowlog-max-len limits log size to balance detail and performance.^[79] Recent updates, such as ElastiCache for Valkey version 8.0 (November 2024), introduce faster scaling capabilities—such as from 0 to 5 million requests per second in under 13 minutes—and improved memory efficiency, enhancing overall performance metrics.^[80] Best practices for ElastiCache emphasize right-sizing clusters to match workload patterns, often leveraging Reserved Instances for cost efficiency.^[81] Reserved Instances provide up to 56% savings over On-Demand pricing for light utilization, with potential for higher savings based on utilization and term length, and size flexibility allowing coverage across node types within the same family (e.g., cache.r6g instances).^[81] To right-size, monitor metrics like Evictions and CurrConnections to determine if scaling up node memory or adding replicas is needed, avoiding over-provisioning.^[74] For Redis OSS and Valkey, which operate on a single-threaded model, ongoing monitoring of EngineCPUUtilization is crucial to detect bottlenecks, as high values indicate contention that may require sharding or instance upgrades.^[72]

Limitations

Technical Constraints

Amazon ElastiCache imposes several engine-specific technical constraints that influence its suitability for various workloads. For the Memcached engine, there is no built-in support for data persistence or replication, meaning all data is stored solely in memory and lost upon node failure or restart without application-level safeguards.^[33] In contrast, the Redis OSS and Valkey engines provide persistence options like RDB snapshots and AOF logs, but they operate as single-threaded processes per node, where command execution occurs sequentially on the main thread, potentially creating bottlenecks for high-throughput scenarios exceeding 1 million operations per second without implementing sharding across multiple nodes.^[72]^[82] This single-threaded architecture, while efficient for simple key-value operations, requires careful workload distribution to avoid CPU saturation on individual nodes.^[83] Scalability in ElastiCache is capped by design to ensure manageability and performance consistency. Clusters support a maximum of 500 shards and 500 nodes for Valkey or Redis OSS versions 5.0.6 and higher in cluster mode, limiting the total in-memory capacity and throughput to what these configurations can handle, such as up to approximately 210 TB without data tiering.^[34]^[33] For Memcached, the limit is stricter at 60 nodes per cluster. Additionally, ElastiCache does not offer native multi-tenancy, requiring separate clusters for isolating workloads from different applications or tenants to maintain security and performance boundaries.^[84]^[33] Regional availability introduces further constraints, as not all node types or features are uniformly supported across AWS Regions. For instance, not all sizes of advanced instance families like m7g or r7g are supported in every region; for example, in Asia Pacific (Thailand) and Mexico (Central) Regions, sizes up to 16xlarge are supported. As of November 2025, support for M7g and R7g was extended to AWS GovCloud (US) Regions.^[31]^[85] While ElastiCache for Valkey is available in all AWS Regions as of late 2024, certain engine-specific features, such as vector search in Valkey 8.2, became available in all AWS Regions as of October 2025.^[8]^[19] Data management in ElastiCache also faces inherent limits, with each node supporting up to approximately 420 GB of in-memory data depending on the selected instance type, beyond which data tiering to attached storage is required for larger datasets.^[15] Furthermore, there is no built-in write-through caching mechanism; synchronization between the cache and backing data stores must be implemented at the application level to ensure consistency, which can introduce complexity in write-heavy environments.^[40] These constraints collectively emphasize the need for application architects to align ElastiCache deployments with workload patterns that fit within these boundaries.

Cost and Operational Challenges

Amazon ElastiCache offers multiple pricing models to suit varying workload requirements, but these can introduce financial complexities for users. The on-demand model charges per node-hour without long-term commitments, with rates typically ranging from $0.02 to $0.50 per hour based on node type and instance family, such as $0.350 per hour for a cache.r7g.xlarge node. Reserved nodes provide cost savings of up to 55% through 1- or 3-year commitments, with options for no upfront, partial upfront, or all upfront payments, allowing flexibility across node sizes. The serverless variant bills based on data stored at $0.0837 per GB-hour for Valkey-compatible caches and ElastiCache Processing Units (ECPUs) at $0.00227 per million requests, enabling pay-per-use without provisioning infrastructure. Additionally, data transfer fees—such as $0.01 per GB for inter-Availability Zone traffic within the same region—can increase total costs by 10-20% in distributed architectures, while backup storage adds $0.085 per GB-month.^[22] A significant operational challenge stems from vendor lock-in due to ElastiCache's reliance on AWS-specific APIs, parameters, and integration features, which hinder seamless portability to other cloud providers or self-managed open-source environments. Migrating to open-source Redis or alternatives often necessitates substantial re-architecting, including redesigning cluster configurations and handling data transfer downtime, as offline migrations are common to avoid inconsistencies. This dependency can trap organizations in the AWS ecosystem, complicating multi-vendor strategies and increasing long-term switching costs.^[86] Operational overhead is another key drawback, particularly the steep learning curve associated with Redis clustering in ElastiCache, where users must grasp advanced concepts like shard distribution, replication, and failover despite managed infrastructure. While ElastiCache automates much of the setup, configuring and troubleshooting cluster-mode enabled setups demands expertise in Redis protocols, potentially leading to errors in high-availability designs. Furthermore, the absence of native automated TTL (time-to-live) optimization tools means manual key management is required to prevent over-caching, where expired or unused data consumes excess memory and drives up costs without built-in eviction tuning beyond standard LRU policies.^[87]^[64] Users frequently encounter escalating bills from suboptimal resource sizing, such as provisioning oversized or idle nodes that continue to accrue hourly charges even during low-demand periods, amplifying expenses in variable workloads. For instance, failing to right-size clusters can result in 20-60% higher costs before optimization, as highlighted in AWS cost management guides. ElastiCache's limited native support for hybrid or multi-cloud deployments exacerbates these issues, requiring custom integrations like VPNs or direct connects for on-premises connectivity, which add administrative burden and potential latency without out-of-the-box federation across providers. Technical constraints, such as maximum node limits, can indirectly worsen sizing challenges by forcing premature scaling.^[88]^[89]

Alternatives

Managed Cloud Services

Google Cloud Memorystore provides a fully managed in-memory data store service supporting Valkey, Redis (open-source versions up to 7.2), and Memcached, offering 100% compatibility with open-source Redis Cluster and Memcached protocols.^[90] It enables horizontal scaling through clustering, with Memcached instances supporting up to 20 nodes and Redis Cluster instances scaling via shards for capacities up to 300 GB per single node, alongside tools like the open-source Memorystore Cluster Autoscaler for automated node adjustments.^[91]^[92] Memorystore integrates seamlessly with Google Cloud AI tools, including LangChain for generative AI applications and vector search capabilities to enhance accuracy and reliability in AI workloads.^[90]^[93] In comparison to Amazon ElastiCache, Memorystore's autoscaling and clustering features provide similar elasticity but emphasize Google Cloud-native optimizations, such as integration with Vertex AI, potentially offering lower latency within GCP environments; however, cross-cloud access may introduce higher network latency due to VPC peering requirements.^[94] Azure Cache for Redis delivers enterprise-grade managed caching with tiers including Enterprise and Enterprise Flash, supporting active geo-replication for global data synchronization and up to 99.999% availability.^[95] Note that Azure Cache for Redis (Basic, Standard, and Premium tiers) is scheduled for retirement on September 30, 2028, with Enterprise tiers retiring on March 30, 2027; Microsoft recommends migrating to the newer Azure Managed Redis, which became generally available in May 2025 and offers enhanced multi-region Active-Active configurations with up to 99.999% availability.^[95]^[96] It accommodates clustering for large-scale deployments, with maximum capacities reaching 4.5 TB in Flash Optimized instances, making it suitable for high-throughput applications processing millions of requests per second.^[97]^[98] The service excels in Microsoft ecosystems, integrating directly with Azure SQL Database and Cosmos DB to boost performance through caching layers, though its pricing—starting at approximately $50 per month for a 2.5 GB Basic instance—can be relatively higher for small-scale deployments compared to entry-level options in other clouds.^[95]^[99] Relative to ElastiCache, Azure Cache for Redis provides robust geo-replication but may incur additional costs for smaller workloads, while its clustering supports terabyte-scale data more explicitly in enterprise setups.^[100] IBM Cloud Databases for Redis offers a managed Redis OSS service optimized for caching, message brokering, and transient data storage, with a focus on high availability through low-latency, high-throughput configurations.^[101] Limited to Redis, it supports hybrid cloud deployments and integrates with Kubernetes-based applications on IBM Cloud, allowing binding to containerized services for seamless connectivity.^[102] Serverless maturity is less pronounced compared to competitors, as it primarily emphasizes provisioned managed instances rather than fully elastic, on-demand scaling.^[103] Key differences from ElastiCache include IBM's stronger emphasis on hybrid and Kubernetes integrations for multicloud portability, whereas ElastiCache prioritizes deep AWS-native ties, such as direct compatibility with Lambda for serverless caching to reduce database query latency.^[101]^[2] Overall, while ElastiCache benefits from AWS ecosystem synergies, alternatives like Memorystore, Azure Cache (and its successor Managed Redis), and IBM Databases for Redis enhance portability across their respective clouds or hybrid setups.^[104]

Self-Hosted and Open-Source Options

Self-hosted implementations of Redis and its open-source forks provide users with complete control over caching infrastructure, allowing deployment on virtual machines like AWS EC2 instances or container orchestration platforms such as Kubernetes.^[105]^[106] These setups enable customization of software versions, configurations, and scaling without reliance on managed cloud services, though they necessitate manual management of high availability (HA), backups, and security. For HA, tools like Redis Sentinel monitor instances and automate failover by promoting replicas to primary roles upon detecting failures, ensuring continuity in primary-replica architectures.^[107] Valkey, a BSD-licensed fork of Redis OSS maintained by contributors including AWS and Google, supports similar self-hosting deployments while preserving Redis compatibility for caching, queuing, and key-value workloads.^[108] DragonflyDB serves as a drop-in replacement for Redis, offering full API compatibility while introducing multi-threaded processing to handle concurrent operations more efficiently than the single-threaded Redis core.^[109] This design delivers up to 25 times higher throughput on comparable hardware for read-heavy workloads, making it suitable for self-hosted environments on EC2 or Kubernetes where performance bottlenecks arise from CPU contention.^[110] Users can deploy DragonflyDB in self-managed clusters for cost-effective scaling at high volumes, though it integrates less seamlessly with AWS-specific tools compared to native options.^[111] KeyDB, an open-source multithreaded fork of Redis developed by Snapchat, enhances replication and storage capabilities for self-hosted setups. It introduces active replication, allowing replicas to accept both reads and writes independently—even during master outages—reducing latency in distributed environments without needing external failover mechanisms like Sentinel.^[112] Additionally, KeyDB's FLASH feature enables tiering of less frequently accessed data to SSD storage, expanding effective memory capacity while maintaining persistence and sub-millisecond access for hot data.^[113] This multi-threading support optimizes CPU utilization across cores, achieving higher throughput than traditional Redis in multi-core systems.^[114] Opting for self-hosted or open-source alternatives like these avoids vendor lock-in and eliminates service fees associated with managed offerings, potentially reducing operational costs for large-scale deployments through direct hardware control.^[115] However, these approaches demand significant expertise in operations, including manual provisioning of HA, monitoring, and backups, which can increase administrative overhead and risk downtime without dedicated teams.^[116] They prove particularly advantageous for on-premises data centers or non-AWS cloud providers seeking customization and data sovereignty.^[117]

References

[1]
What is Amazon ElastiCache? - Amazon ElastiCache
Amazon ElastiCache is a web service that makes it easy to set up, manage, and scale a distributed in-memory data store or cache environment in the cloud.How ElastiCache works · Comparing node-based... · Related services
[2]
Amazon ElastiCache – AWS
Amazon ElastiCache is a serverless, fully managed caching service delivering microsecond latency performance with full Valkey-, Memcached-, and Redis ...
[3]
Amazon ElastiCache – Now With a Dash of Redis | AWS News Blog
Sep 4, 2013 · We launched Amazon ElastiCache about two years ago, and have steadily added features ever since. In the last two years we have added auto ...
[4]
Amazon launches AWS ElastiCache in Western Europe - Enterprise
Dec 6, 2011 · AWS launched Amazon ElastiCache in the US East (Northern Virginia) Region back in August and today enabled ElastiCache in EU West (Dublin), US ...
[5]
How ElastiCache works - AWS Documentation
ElastiCache Serverless enables you to create a cache without worrying about capacity planning, hardware management, or cluster design.
[6]
Common ElastiCache Use Cases and How ElastiCache Can Help
ElastiCache is used for in-memory data stores, gaming leaderboards, messaging (pub/sub), and recommendation data (hashes).
[7]
AWS announces Amazon ElastiCache Serverless
Nov 27, 2023 · A new serverless option for Amazon ElastiCache that simplifies cache management and instantly scales to support the most demanding applications.
[8]
Amazon ElastiCache Customers - AWS
"By upgrading from Amazon ElastiCache for Redis OSS to ElastiCache for Valkey in October 2025, we were able to significantly reduce downstream calls—by up to 80 ...
[9]
Amazon ElastiCache – Distributed In-Memory Caching
Aug 22, 2011 · The initial release of Amazon ElastiCache supports version 1.4.5 of Memcached. A Cache Parameter Group holds a set of configuration values ...Missing: November | Show results with:November
[10]
Solutions for building modern applications with Amazon ElastiCache ...
Dec 1, 2023 · With ElastiCache, you can save up to 55% in cost and gain up to 80 times faster read performance by using ElastiCache with Amazon Relational ...Caching Solutions Using... · Boosting Application... · Elasticache For Machine...Missing: 80-90% | Show results with:80-90%
[11]
ElastiCache Documentation history
Find the revision dates, related releases, and important changes to the Amazon ElastiCache User Guide.
[12]
Multi-AZ Support / Auto Failover for Amazon ElastiCache for Redis
Oct 24, 2014 · August 2011 – Initial launch with support for the Memcached caching engine in one AWS Region. December 2011 – Expansion to four additional ...
[13]
Amazon ElastiCache Serverless for Redis and Memcached is now ...
Nov 27, 2023 · A new serverless option that allows customers to create a cache in under a minute and instantly scale capacity based on application traffic patterns.
[14]
Announcing data tiering for Amazon ElastiCache for Redis - AWS
Nov 23, 2021 · You can now use data tiering for Amazon ElastiCache for Redis as a lower cost way to scale your clusters to up to hundreds of terabytes of capacity.
[15]
Data tiering in ElastiCache - AWS Documentation - Amazon.com
Data tiering provides a new price-performance option for Valkey or Redis OSS workloads by utilizing lower-cost solid state drives (SSDs) in each cluster node.
[16]
Amazon ElastiCache FAQs - AWS
After you are signed up for ElastiCache, please refer to the ElastiCache documentation, which includes the Getting Started Guides for Amazon ElastiCache.
[17]
Announcing Amazon ElastiCache for Valkey - AWS
Oct 8, 2024 · ElastiCache for Valkey is now available in all AWS regions. You can upgrade from ElastiCache for Redis OSS to ElastiCache for Valkey in a few ...
[18]
Amazon ElastiCache and Amazon MemoryDB announce support for ...
Oct 8, 2024 · As of October 8 th 2024, we've added support for Valkey 7.2 on Amazon ElastiCache and Amazon MemoryDB, our fully managed in-memory services.
[19]
Amazon ElastiCache now supports Valkey 8.1 - AWS
Amazon ElastiCache now supports Valkey 8.1. Posted on: Jul 24, 2025. Today, Amazon ElastiCache announces support for Valkey 8.1, bringing the latest ...
[20]
Announcing vector search for Amazon ElastiCache - AWS
Oct 13, 2025 · Vector search for ElastiCache is available with Valkey version 8.2 on node-based clusters in all AWS Regions at no additional cost. To get ...
[21]
Amazon ElastiCache for Redis announces Global Datastore - AWS
Mar 16, 2020 · Global Datastore is available with Amazon ElastiCache for Redis 5.0.6 and supported on M5 and R5 nodes. Starting today, Global Datastore is ...
[22]
Replication across AWS Regions using global datastores
When you create a global datastore in ElastiCache for Valkey or Redis OSS, it automatically replicates your data from the primary cluster to the secondary ...Amazon ElastiCache · Prerequisites and limitations · Using global datastores (CLI)
[23]
Pricing for Amazon ElastiCache
If you signed up for AWS Free Tier prior to July 15, 2025, you have 750 hours of ElastiCache cache.t3.micro node usage free for up to 12 months.Missing: 2024 | Show results with:2024
[24]
Announcing vector search for Amazon ElastiCache
Oct 13, 2025 · You can now use ElastiCache to index, search, and update billions of high-dimensional vector embeddings from popular providers like Amazon ...Announcing Vector Search For... · Enhancing Generative Ai... · Scaling And Performance Of...
[25]
Build an ultra-low latency online feature store for real-time ...
Sep 25, 2023 · ElastiCache, together with solutions like Amazon SageMaker Feature Store and Feast, provide comprehensive framework for feature versioning ...Elasticache As A Low Latency... · Solution Overview · Generate Training Data
[26]
Amazon ElastiCache Features
Global Datastore provides fully managed, fast, reliable, and secure replication across AWS Regions. You can write to your ElastiCache cluster in one Region and ...
[27]
Comparing node-based Valkey, Memcached, and Redis OSS clusters
Amazon ElastiCache supports the Valkey, Memcached, and Redis OSS cache engines. Each engine provides some advantages. Use the information in this topic to help ...
[28]
Engine versions and upgrading in ElastiCache - AWS Documentation
Beginning with Redis OSS 6.0, ElastiCache will offer a single version for each Redis OSS minor release, rather than offering multiple patch versions.<|separator|>
[29]
Managing nodes in ElastiCache - AWS Documentation
An ElastiCache node is a fixed-size RAM chunk. You can add, remove, and reboot nodes. Other operations include scaling and finding connection endpoints.Missing: core | Show results with:core
[30]
Supported node types - Amazon ElastiCache - AWS Documentation
ElastiCache supports the following node types. Generally speaking, the current generation types provide more memory and computational power at lower cost.
[31]
Managing clusters in ElastiCache - AWS Documentation
An ElastiCache cluster is a collection of nodes with a specified engine. Most operations are at the cluster level, and it needs a unique identifier.Missing: core | Show results with:core
[32]
ElastiCache components and features - AWS Documentation
ElastiCache components include nodes, shards, clusters, replication, endpoints, parameter groups, security, subnet groups, backups, and events.Missing: core | Show results with:core
[33]
Working with shards in ElastiCache - AWS Documentation
A shard is a collection of 1-6 ElastiCache nodes, used to separate large databases into smaller, faster, and more easily managed parts.Missing: core | Show results with:core
[34]
Minimizing downtime in ElastiCache by using Multi-AZ with Valkey ...
You can use this operation to test automatic failover on up to 15 shards (called node groups in the ElastiCache API and AWS CLI) in any rolling 24-hour period.Enabling Multi-AZ · Failure scenarios with Multi-AZ... · Testing automatic failover
[35]
Testing Automatic Failover to a Read Replica on ... - Amazon AWS
Apr 4, 2017 · This process should usually take about 30 seconds. The failed node is then replaced and returned to the cluster as a replica node. You can use ...
[36]
Scheduling automatic backups - Amazon ElastiCache
The maximum backup retention limit is 35 days. If the backup retention limit is set to 0, automatic backups are disabled for the cache. When you schedule ...
[37]
Minimizing downtime during maintenance - Amazon ElastiCache
Cluster mode configuration has the best availability during managed or unmanaged operations. We recommend that you use a cluster mode supported client.
[38]
Caching patterns - Database Caching Strategies Using Redis
Two common approaches are cache-aside or lazy loading (a reactive approach) and write-through (a proactive approach).
[39]
Caching strategies for Memcached - Amazon ElastiCache
Lazy loading. As the name implies, lazy loading is a caching strategy that loads data into the cache only when necessary. It works as described following.Lazy loading · Write-through · Adding TTL
[40]
Cache Validity - Database Caching Strategies Using Redis
Cache validity is controlled by TTL/expiration. TTLs should consider data change rate, risk of outdated data, and adding time jitter to TTLs.
[41]
Amazon ElastiCache and Self-Managed Redis
Amazon ElastiCache (Redis OSS) improves throughput and reduces latency by leveraging more cores for processing I/O and dynamically adjusting to the workload.
[42]
Online vertical scaling by modifying node type - Amazon ElastiCache
Scale up – Increase read and write capacity by adjusting the node type of your Valkey or Redis OSS cluster to use a larger node type. · Scale down – Reduce read ...
[43]
Auto Scaling Valkey and Redis OSS clusters - Amazon ElastiCache
ElastiCache auto scaling automatically increases/decreases shards/replicas using CloudWatch metrics and target values, triggered by CloudWatch alarms.
[44]
Scaling ElastiCache - AWS Documentation - Amazon.com
ElastiCache Serverless automatically accommodates your workload traffic as it ramps up or down. ... sub-millisecond p50 read latency. If you anticipate that an ...Missing: heavy | Show results with:heavy<|control11|><|separator|>
[45]
SLA for Amazon ElastiCache
Oct 16, 2024 · This Amazon ElastiCache Service Level Agreement (SLA) is a policy governing the use of Amazon ElastiCache (ElastiCache) and applies separately to each account ...<|control11|><|separator|>
[46]
Security in Amazon ElastiCache - AWS Documentation
The following topics show you how to configure Amazon ElastiCache to meet your security and compliance objectives. You also learn how to use other AWS services ...
[47]
Amazon VPCs and ElastiCache security
This section explains how to manually configure an ElastiCache cluster in an Amazon VPC. This information is intended for users who want a deeper understanding.
[48]
ElastiCache in-transit encryption (TLS) - AWS Documentation
To help keep your data secure, Amazon ElastiCache and Amazon EC2 provide mechanisms to guard against unauthorized access of your data on the server.
[49]
At-Rest Encryption in ElastiCache - AWS Documentation
ElastiCache at-rest encryption is a feature to increase data security by encrypting on-disk data. It is always enabled on a serverless cache.
[50]
Authenticating with the Valkey and Redis OSS AUTH command - Amazon ElastiCache
### Summary of Authentication Methods in Amazon ElastiCache
[51]
Role-Based Access Control (RBAC) - Amazon ElastiCache
RBAC controls access to ElastiCache caches via user groups, assigning users to roles and setting permissions using access strings, creating security boundaries.Applying RBAC to an... · Migrating from AUTH to RBAC
[52]
https://docs.aws.amazon.com/AmazonElastiCache/latest/dg/Clusters.RBAC.html
[53]
Compliance validation for Amazon ElastiCache
For more information about your compliance responsibility when using AWS services, see AWS Security Documentation. When Amazon ElastiCache is listed as ...
[54]
Configuring engine parameters using ElastiCache parameter groups
Identifies common customer issues with ElastiCache parameter groups, and provides guidance on how to avoid these issues.
[55]
Parameter management in ElastiCache - AWS Documentation
ElastiCache parameters are grouped together into named parameter groups for easier parameter management. A parameter group represents a combination of specific ...
[56]
From caching to real-time analytics: Essential use cases for Amazon ...
Dec 6, 2024 · Session management – ElastiCache for Valkey is commonly used for storing and managing user session data in web applications. Its fast read ...From Caching To Real-Time... · Common Use Cases Implemented... · Elasticache For Valkey As A...
[57]
Build a real-time gaming leaderboard with Amazon ElastiCache for ...
Jun 25, 2019 · Sorted sets contain a list of members with associated scores. Although set members are unique, score values are permitted to repeat. Scores are ...Introduction To Sorted Sets · Putting It All Together · Common Leaderboard...
[58]
None
### Summary of Redis Pub/Sub, Streams, Lua, Hashes, Atomic Updates, and Scaling in ElastiCache
[59]
Managing IoT and Time Series Data with Amazon ElastiCache for ...
Oct 17, 2016 · In this post, we'll design an IoT solution that captures sensor information and persists it in Amazon ElastiCache for Redis.Missing: Lua aggregation
[60]
[PDF] Optimize gen AI apps with durable semantic caching in Amazon ...
What is durable semantic caching? Cache the results of repetitive expensive queries to improve performance, scale, and cost. But what is durable semanticcaching ...Missing: redundancy | Show results with:redundancy
[61]
Improve speed and reduce cost for generative AI workloads with a ...
Aug 16, 2024 · Semantic caching can improve the performance of generative AI applications by storing responses based on the semantic meaning or context within the queries.
[62]
Build AI Virtual Assistant with ElastiCache - Amazon AWS
Nov 22, 2024 · This video explores how to build a generative AI virtual assistant using Amazon Bedrock, Langchain, and Amazon ElastiCache.
[63]
Valkey and Redis OSS configuration and limits - Amazon ElastiCache
Clients trying to set more data than this on a single Valkey or Redis OSS slot will trigger the eviction policy on the slot, and if no keys are evictable, will ...
[64]
Creating and managing a node-based ElastiCache cluster
ElastiCache enables you to operate a node-based cluster by choosing the node-type, number of nodes, and node placement across AWS Availability Zones for your ...Missing: core sharding
[65]
https://docs.aws.amazon.com/AmazonElastiCache/latest/dg/designing-elasticache-cluster.html
[66]
High availability using replication groups - Amazon ElastiCache
You can group 2 to 6 nodes into a cluster with replicas where 1 to 5 read-only nodes contain replicate data of the group's single read/write primary node.Missing: synchronous | Show results with:synchronous
[67]
Scaling node-based clusters - Amazon ElastiCache
By using Amazon ElastiCache you can scale to meet current demand, paying only for what you use. ElastiCache enables you to scale your cache to match demand.
[68]
Snapshot and restore - Amazon ElastiCache - AWS Documentation
ElastiCache caches can back up data by creating a snapshot, which can be used to restore a cache or seed data to a new one. Backups are stored in Amazon S3.Exporting a backup · Deleting a backup · Restoring from a backup into a...Missing: retention | Show results with:retention
[69]
Ensuring you have enough memory to make a Valkey or Redis OSS ...
When you work with ElastiCache for Redis OSS, Redis OSS calls a background write command in a number of cases: When creating a snapshot for a backup. When ...
[70]
Metrics for Valkey and Redis OSS - Amazon ElastiCache
Latency is defined as CPU time taken by ElastiCache to process the command. Note that for clusters using data tiering, the time taken to fetch items from SSD is ...
[71]
Monitoring use with CloudWatch Metrics - Amazon ElastiCache
ElastiCache provides metrics that enable you to monitor your clusters. You can access these metrics through CloudWatch.
[72]
Which Metrics Should I Monitor? - Amazon ElastiCache
Monitor key metrics like cache hit ratio, operation latencies, network consumption, CPU utilization to optimize DAX cluster performance, caching, scaling.Missing: throughput | Show results with:throughput
[73]
Engine specific parameters - Amazon ElastiCache
The eviction policy for keys when maximum memory usage is reached. Valid values are: volatile-lru | allkeys-lru | volatile-random | allkeys-random ...
[74]
Evictions - Database Caching Strategies Using Redis
In addition to the existing time-based LRU policy, Amazon ElastiCache (Redis OSS) also supports a least frequently used (LFU) eviction policy for evicting keys.
[75]
Viewing an ElastiCache cluster's details - AWS Documentation
To view logs, choose the Logs tab, which indicates if the cluster is using Slow logs or Engine logs and provides relevant details. For more information, see Log ...Missing: analysis | Show results with:analysis
[76]
Monitoring CloudWatch Cluster and Node Metrics
ElastiCache and CloudWatch are integrated so you can gather a variety of metrics. You can monitor these metrics using CloudWatch. Note. The following examples ...
[77]
Log delivery - Amazon ElastiCache - AWS Documentation
Log delivery lets you stream SLOWLOG or Engine Log to one of two destinations. You enable and configure log delivery when you create or modify a cluster using ...Contents of a slow log entry · Contents of an engine log entry
[78]
Reserved nodes - Amazon ElastiCache - AWS Documentation
All reserved nodes are size flexible. When you purchase a reserved node, one thing that you specify is the node type, for example cache.r6g.xlarge. For more ...
[79]
Large number of connections (Valkey and Redis OSS)
ElastiCache for Redis OSS supports 65,000 connections, but high numbers increase response time. Using read replicas, distributing writes, and connection pools ...
[80]
Best practices: Valkey/Redis OSS clients and Amazon ElastiCache
Aug 13, 2021 · In this post, we cover best practices for interacting with Amazon ElastiCache for Valkey and Redis OSS resources with commonly used open-source Valkey or Redis ...Missing: rate counters
[81]
Quotas for ElastiCache - AWS Documentation
Quotas for ElastiCache ; Serverless snapshots per day per cache, 24, The maximum number of serverless snapshots you can take per day for each serverless cache.
[82]
How to migrate from Amazon ElastiCache to Redis Enterprise or ...
We recommend offline data migration. It's straightforward but will cause downtime during the data transfer from ElastiCache to Redis Cloud.
[83]
ElastiCache best practices and caching strategies
Below you can find recommended best practices for Amazon ElastiCache. Following these improves your cache's performance and reliability.
[84]
Optimize the cost of your Amazon ElastiCache for Redis workloads
Jun 20, 2023 · In this post, we go over 5 different recommendations to optimize the cost of your Amazon ElastiCache for Redis clusters.Optimize The Cost Of Your... · 2. Use Reserved Instances · 3. Achieve Over 60% Cost...Missing: 80-90% | Show results with:80-90%
[85]
Latency reduction of hybrid architectures with Amazon ElastiCache
Mar 12, 2018 · This blog post illustrates one of them: how to use caching to reduce latency, increase performance, and improve fault tolerance at the same time.
[86]
Memorystore: in-memory Redis compatible data store
### Summary of Google Cloud Memorystore Key Features
[87]
Scaling Google Cloud Memorystore for high performance
Feb 18, 2022 · A single node Memorystore instance can support a keyspace as large as 300 GB and a maximum network throughput of 16gbps.
[88]
GoogleCloudPlatform/memorystore-cluster-autoscaler - GitHub
The Cloud Memorystore Cluster Autoscaler is a companion tool that allows you to automatically increase or reduce the number of nodes/shards in one or more ...
[89]
Memorystore for Redis vector search and LangChain integration
Mar 9, 2024 · Memorystore for Redis gets new generative AI building blocks: vector search and store, and LangChain integration.
[90]
GCP managed notebook connetion to Memorystore - Stack Overflow
Jun 9, 2023 · Only systems within the same VPC can connect. Vertex AI is located in a different network. Google offers peering to connect networks together.
[91]
Azure Cache for Redis | Microsoft Azure
### Summary of Azure Cache for Redis Key Features
[92]
Scale an Azure Cache for Redis instance - Microsoft Learn
Jul 1, 2024 · This result is a clustered F1500 cache with capacity 9. For more information, see Azure Cache for Redis Pricing. Do all Redis clients ...Types of scaling · Are there scaling limitations... · Do I need to make any...
[93]
Scale an Azure Managed Redis instance
Oct 15, 2025 · The largest cache size you can have is 4.5 TB, called Flash Optimized A4500 instance. Azure Cache for Redis Pricing. Why can only I scale down ...
[94]
Azure Cache for Redis pricing
Basic ; C2, 2.5 GB, Low, 2,000, $65.70/month ; C3, 6 GB, Moderate, 5,000, $131.40/month.
[95]
Best practices for the Enterprise tiers - Azure Cache for Redis
Oct 9, 2025 · In the Basic, Standard, and Premium tiers of Azure Cache for Redis, determining the number of virtual CPUs (vCPUs) utilized is straightforward.Zone Redundancy · Sharding And Cpu Utilization · Clustering On Enterprise
[96]
IBM Cloud Databases for Redis
IBM Cloud Databases for Redis offers a powerful, open source, in-memory key value store that can act as a cache, queue or transient store.
[97]
Connecting a Kubernetes service application - IBM Cloud Docs
Connecting an IBM Cloud application. Applications running in IBM Cloud can be bound to your IBM Cloud® Databases for Redis deployment.Missing: Cache serverless
[98]
Best practices for Databases for Redis - IBM Cloud Docs
Databases for Redis is a managed Redis OSS service offered on IBM Cloud. It is an in-memory data structure store used as a database, cache, message broker, and ...
[99]
Understanding high availability for Cloud Databases for Redis
This ensures low latency and high throughput, making Databases for Redis ideal for caching and performance-sensitive workloads.
[100]
Install on AWS EC2 | Docs - Redis
This tutorial shows you how to install Redis Insight on an AWS EC2 instance and manage ElastiCache Redis instances using Redis Insight.Step 1: Launch Ec2 Instance · Step 2: Verify Permissions... · Step 4: Run Redis Insight In...
[101]
Install on Kubernetes | Docs - Redis
This tutorial shows how to install Redis Insight on Kubernetes (K8s). This is an easy way to use Redis Insight with a Redis Enterprise K8s deployment.
[102]
3.3 Understanding Sentinels - Redis
Mar 4, 2024 · If you have a system that uses Sentinel for high availability, then you need to have a client that supports Sentinel. Not all libraries have ...
[103]
Valkey
Valkey is an open source (BSD) high-performance key/value datastore that supports a variety of workloads such as caching, message queues, and can act as a ...Try Valkey · Valkey Releases · Documentation · Download
[104]
Dragonfly | An In-Memory Data Store without Limits
Dragonfly is 100% API compatible with Redis, Valkey, and Memcached, allowing for quick and seamless migrations that result in up to 25X better performance on ...Redis Performance · Documentation · Dragonfly Pricing · Blog
[105]
Redis vs. Dragonfly Scalability and Performance
Feb 27, 2023 · Dragonfly is Redis compatible, with better performance and ... throughput for the multi-threaded Dragonfly far exceeds that of Redis.
[106]
Features - Dragonfly
Non-contending, multi-threaded processing mean much higher throughput on the same hardware. Read this benchmark report for the details. Higher hit rates.
[107]
Active Replica Setup | KeyDB - The Faster Redis Alternative
Active replica setup in KeyDB allows both read and write operations on replicas, even when the master connection is severed, and enables circular connections.Set Up Your Replica Node To... · Technical Implementation · Config FileMissing: fork tiering
[108]
KeyDB on FLASH (SSD Storage) - The Faster Redis Alternative
FLASH storage enables you to expand memory capacity greatly without a huge compromise on performance. Best of all your data persists to the FLASH memory as it ...Keydb Flash · How It Works · When To Use Keydb On FlashMissing: fork tiering
[109]
About KeyDB | KeyDB - The Faster Redis Alternative
KeyDB is a fast, multithreaded fork of Redis, created to be easy to use and high-performing, with a focus on performance and less moving parts.Missing: tiering | Show results with:tiering
[110]
ElastiCache vs. Redis: A Comprehensive Comparison for Better ...
Performance is a critical factor in caching solutions, with speed and latency benchmarks being essential metrics. Redis excels in balancing memory usage with ...
[111]
https://www.dragonflydb.io/features
[112]
Managed Services vs. Self-Hosted Services in System Design
Jul 23, 2025 · Managed services outsource this responsibility to a service provider, while self-hosted services keep it in-house, giving organizations full ...