Fact-checked by Grok 2 weeks ago

VoltDB

VoltDB, now known as Volt Active Data since its rebranding in February 2022, is an in-memory NewSQL relational database management system (RDBMS) designed for high-velocity online transaction processing (OLTP) applications that require sub-millisecond latency and massive scalability.^[1]^[2] It operates as a distributed, shared-nothing architecture that stores data primarily in RAM to eliminate disk I/O bottlenecks, supporting standard ANSI SQL queries, ACID-compliant transactions, and stored procedures written in Java.^[3]^[2] Developed as the commercial implementation of the academic H-Store project, VoltDB was founded in 2009 by database pioneer Michael Stonebraker—recipient of the 2014 ACM A.M. Turing Award—along with co-founders Scott Jarr and others, including researchers from MIT, Brown University, and Carnegie Mellon University.^[4]^[5]^[6] The H-Store research, initiated around 2006, aimed to rethink traditional RDBMS architectures for multicore processors and cluster environments by serializing transactions to avoid locks and latches, achieving significantly higher throughput than disk-based systems like PostgreSQL.^[3] The company released the first version of VoltDB in 2010 as an open-source project under the AGPLv3 license for its community edition, with enterprise editions offering additional features like advanced replication and support.^[7]^[8] Key to its design is a focus on real-time data processing, combining transactional and streaming workloads to handle millions of operations per second across clusters of commodity hardware.^[9]^[2] VoltDB ensures high availability through k-safety replication (up to five nines, or 99.999% uptime) and automatic failover, while partitioning data and procedures across nodes to enable linear scalability by adding servers.^[2] It supports export to external systems like Kafka for integration with analytics pipelines and provides tools for fault recovery without data loss.^[9] Notable for powering mission-critical applications in finance, telecommunications, and IoT—such as fraud detection and ad tech—Volt Active Data addresses the limitations of legacy databases in handling explosive data growth and real-time demands.^[4]^[10]

Overview

Definition and Core Characteristics

Volt Active Data, formerly known as VoltDB, is an ACID-compliant, in-memory relational database management system (RDBMS) designed specifically for real-time, high-throughput online transaction processing (OLTP) applications that demand sub-millisecond latency and massive scalability.^[2]^[11] It operates as a NewSQL database, combining the familiarity of SQL with the performance of NoSQL systems to handle high-velocity data streams without compromising relational integrity.^[12] This design addresses the limitations of traditional disk-based RDBMS in modern environments, where applications require processing millions of transactions per second while maintaining data consistency.^[2] At its core, Volt Active Data employs a shared-nothing architecture, where data and processing are distributed across independent nodes in a cluster, enabling horizontal scaling by adding servers without shared resources.^[11] It uses horizontal partitioning to shard data automatically across nodes, supporting both partitioned and replicated tables for balanced load distribution and fault tolerance.^[11] Transactions are executed serially on single-threaded engines per partition, eliminating the need for locks, latches, or multi-version concurrency control, which results in deterministic serialization and simplified ACID compliance.^[2]^[11] SQL queries are supported through stored procedures written in Java, embedding ANSI-standard SQL for schema definition and data manipulation, allowing developers to leverage familiar relational paradigms while achieving high performance.^[11] The platform is optimized for OLTP workloads in domains requiring immediate data processing, such as financial trading for real-time fraud detection, telecommunications for call detail record (CDR) handling, and IoT for ingesting and analyzing sensor data streams.^[12] In February 2022, the product was rebranded from VoltDB to Volt Active Data to better reflect its evolution into a comprehensive platform that integrates streaming data processing with transactional capabilities, enabling active decision-making on data in motion.^[1]

Licensing and Deployment Options

Volt Active Data is distributed under a dual licensing model, offering both an open-source Community Edition licensed under the GNU Affero General Public License version 3 (AGPLv3) and a proprietary commercial Enterprise Edition for production use. The Community Edition provides core database functionality, including support for multi-node clusters, while the Enterprise Edition includes advanced features such as enhanced durability options, export connectors, and professional support. Client libraries for programmatic access are separately licensed under the MIT License. In January 2025, Volt Active Data introduced the Developer Edition, a free option allowing developers and architects to evaluate the product suite easily using Docker Desktop without manual cluster configuration.^[13]^[14] The software is primarily written in Java for stored procedures and client interfaces, with core components implemented in C++ for performance optimization.^[15] It runs on 64-bit POSIX-compliant operating systems, with official support for Linux distributions including Red Hat Enterprise Linux (RHEL) and Rocky Linux 8.8 and later (including 9.0 and 10.0), Ubuntu 20.04, 22.04, and 24.04 and later, and Debian 12.1 and later; macOS 13.0 (Ventura) and later for development and testing only. As of Volt Active Data version 15.0 (September 2025).^[16] The Community Edition is suitable for single-node development and basic multi-node setups, whereas the Enterprise Edition enables scalable production clusters with additional high-availability features. Deployment options for Volt Active Data include on-premises installations on physical or virtual servers, as well as cloud environments such as Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP).^[17] For containerized deployments, Volt Active Data supports Kubernetes through the Volt Operator, a custom resource that automates cluster management, introduced in version 10.0 in August 2020, along with Helm charts for streamlined installation.^[18] Installation typically involves downloading a platform-specific distribution package (ZIP for Windows or tar.gz for Unix-like systems), which includes the necessary JAR files and binaries.^[19] To set up a database, users initialize the root directory using the voltdb init command with a deployment configuration file specifying cluster size, partitioning, and other parameters, followed by starting the database with voltdb start (optionally in admin mode for schema loading).^[20]^[21] For Kubernetes deployments, the Volt Operator handles initialization and scaling via custom resources and YAML manifests.^[22]

History and Development

Origins and Founding

VoltDB originated from the H-Store research project, initiated in 2007 at MIT by a team led by Michael Stonebraker, along with collaborators from Brown University, Yale University, and Carnegie Mellon University, including Sam Madden and Daniel Abadi.^[23]^[5] H-Store was developed as an experimental prototype for a main-memory online transaction processing (OLTP) system, aiming to overcome key limitations of traditional disk-based relational database management systems (RDBMS), such as row-level locking, buffer management overhead, and multi-threaded contention that hindered scalability on multi-core hardware.^[23] The project emphasized a shared-nothing architecture where data is partitioned across nodes, allowing single-threaded execution of transactions on individual partitions to maximize performance without locking or latching.^[23] In response to the emerging NoSQL movement around 2006–2009, which prioritized scalability and availability over strict ACID compliance but often at the cost of consistency and query expressiveness, Stonebraker and his team sought to retain full SQL support and ACID guarantees while achieving comparable high-throughput performance through in-memory processing.^[24] H-Store's design specifically targeted single-partition transactions to avoid the overhead of distributed consensus protocols like two-phase commit, enabling sub-millisecond latencies for read-write workloads typical in OLTP applications.^[23] VoltDB Inc. was established in 2009 as a commercial spin-off to productize and extend the H-Store technology, with Michael Stonebraker and Scott Jarr serving as co-founders, Stonebraker as chief technology officer.^[25]^[26] The company received its initial Series A funding of $5 million in September 2010 from investors including Sigma Partners, laying the groundwork for developing a robust, enterprise-grade in-memory database that preserved the research prototype's core innovations.^[27] Early efforts focused on refining the elimination of distributed consensus for single-partition operations, positioning VoltDB as a "NewSQL" solution that bridged the performance gap between traditional RDBMS and NoSQL systems without compromising transactional integrity.^[24]

Major Releases and Evolution

VoltDB's initial release, version 1.0, occurred on May 25, 2010, introducing it as an open-source in-memory relational database management system (RDBMS) designed for high-velocity online transaction processing (OLTP). This version laid the foundation for its NewSQL architecture, emphasizing single-threaded execution and in-memory storage to achieve low-latency performance.^[28] Subsequent major releases introduced key enhancements to monitoring, data handling, security, and deployment flexibility. Version 5.0, released on January 28, 2015, added the Virtual Machine Console for real-time monitoring and diagnostics, alongside expanded SQL support and integrations with Hadoop ecosystem tools like HDFS and Kafka exporters. Version 6.0, launched in January 2016, incorporated geospatial data types and functions, enabling native storage and querying of geographic locations and regions for applications like location-based services.^[29] Version 7.1, released in March 2017, implemented TLS encryption for client and internal network communications, enhancing security for distributed deployments. Later, version 10.0 in August 2020 brought native Kubernetes support via the Volt Operator and Helm charts, simplifying cloud-native orchestration and scaling.^[30] Version 11.0, issued on April 21, 2022, supported Java 17 runtime and integrated with DataDog for advanced observability and metrics collection. Version 13, released starting in 2023, further improved Kubernetes integration, performance optimizations, and operational tools for large-scale deployments.^[31] In February 2022, amid these developments, the product rebranded to Volt Active Data to reflect its expanded role in real-time data processing beyond traditional OLTP.^[1] More recent updates have focused on stability, configuration, and operational efficiency. Version 12.3.15, released on August 29, 2025, primarily addressed bug fixes and minor improvements to existing features.^[32] Version 14.3.0, dated June 30, 2025, enhanced database configuration processes, including better support for initialization and schema management.^[33] The latest major release, version 15 on September 29, 2025, streamlined setup procedures, improved streaming import/export capabilities, and introduced a new DATE SQL data type for precise temporal handling.^[34] Over its evolution, Volt Active Data has shifted from a pure OLTP-focused system to a translytical platform, integrating real-time analytics with transactional processing to support hybrid workloads.^[35] This progression includes the addition of streaming pipelines for importing data from sources like Kafka and exporting to analytics systems, enabling continuous data ingestion and ad-hoc querying without separate batch processing.^[36]

Technical Architecture

Design Principles and Data Model

VoltDB's design is fundamentally oriented toward high-velocity transactional workloads, leveraging in-memory storage to achieve low-latency performance by eliminating disk I/O for transaction processing.^[37] The system assumes access to modern, reliable hardware with ample main memory and multi-core processors, enabling a shared-nothing architecture that scales horizontally across clusters.^[38] A key principle is single-threaded execution per partition, which avoids the overhead of locks, latches, and multi-version concurrency control (MVCC) by processing transactions sequentially within each partition.^[39] This approach prioritizes throughput over complex concurrency, ensuring predictable performance for short-lived transactions under high contention.^[37] The data model in VoltDB is relational, supporting standard SQL tables defined via DDL statements, with constraints such as primary keys and foreign keys.^[40] Tables are partitioned horizontally by a primary key column, distributing rows across partitions to enable parallel processing, while small, read-mostly tables can be replicated across all nodes for global access.^[40] VoltDB provides updatable views to simplify query logic over base tables and supports various indexes, including hash indexes for equality lookups, tree indexes for ordered access, and sorted indexes for range queries.^[40] However, transactions cannot perform full joins across partitions; instead, data must be co-located within the same partition to maintain efficiency, with multi-partition operations handled through serialized coordination.^[39] Partitioning forms the core of VoltDB's scalability strategy, dividing the database into independent shards—each managed by a dedicated thread on a multi-core server or across cluster nodes—to process transactions in isolation.^[37] Single-partition transactions execute atomically and in parallel across partitions, maximizing throughput, while multi-partition transactions are serialized at a coordinator to ensure consistency without distributed locking.^[39] This design allows linear scaling by adding partitions or nodes, as each handles a subset of the data and workload without interference.^[38] In contrast to traditional disk-oriented RDBMS, VoltDB eliminates MVCC and latching mechanisms, relying instead on deterministic serial execution to provide ACID guarantees with minimal overhead.^[37] Durability is achieved through command logging, which records stored procedure invocations between periodic snapshots, rather than traditional write-ahead logging (WAL), enabling fast recovery via replay without persistent redo logs.^[37] Stored procedures define transactions in a single round-trip, further reducing latency compared to ad-hoc SQL in conventional systems.^[38]

Storage and Query Processing

VoltDB employs a fully in-memory storage model, where all data is held in RAM to eliminate disk access latencies and maximize throughput.^[41] Tables are stored in a row-oriented format, with tuples organized into blocks for efficient access and modification within each partition.^[42] This approach avoids traditional buffer pool management, as the entire dataset resides in memory, allowing direct manipulation of data structures.^[39] Memory management in VoltDB is automatic and aggressive, focusing on compaction to reclaim unused space. When updates or deletes create gaps in storage blocks, the system performs incremental compaction during single-partition transactions, moving live tuples to consolidate space and free entire blocks for reuse or release back to the operating system.^[42] For variable-length data like long strings or binary values exceeding 63 bytes, VoltDB uses dedicated memory pools to minimize allocation overhead, retaining freed space for future use rather than immediately returning it to the OS.^[42] This ensures stable memory usage under varying workloads, with overall consumption scaling primarily with the active dataset size. Query processing in VoltDB centers on stored procedures written in Java, which encapsulate transactions as multi-statement SQL operations compiled directly into bytecode for execution.^[41] Developers define the logic explicitly, without relying on a traditional query optimizer; instead, the procedures are pre-planned to leverage partitioning for efficiency.^[43] Single-partition queries, which access data within one partition, execute serially in microseconds due to the compiled nature and lack of locking overhead.^[41] Multi-partition queries are coordinated by a designated partition acting as an initiator, which orchestrates a two-phase commit across involved sites to ensure atomicity.^[44] The execution model enforces deterministic serializability, processing transactions in a single-threaded manner per partition to achieve full ACID compliance without locks, latches, or traditional concurrency controls.^[44] Each partition runs commands sequentially, guaranteeing serializable isolation by design; non-deterministic elements, such as random functions, are either restricted or made deterministic through controlled inputs like procedure-specific seeds.^[44] For ad-hoc SQL queries, VoltDB supports execution via the @AdHoc system procedure or tools like sqlcmd, but these receive limited optimization compared to stored procedures, lacking precompilation and full partitioning awareness, which makes them suitable only for infrequent or exploratory use.^[43] Durability is provided through synchronous command logging, which records invocations of stored procedures to disk in real-time, enabling transaction-level recovery without logging individual data changes.^[45] This log captures the exact commands executed, allowing replay for crash recovery while minimizing I/O overhead.^[41] Complementing this, periodic snapshots capture the full database state to disk at configurable intervals, serving as checkpoints for faster restoration and protection against irrecoverable failures.^[45] Together, these mechanisms ensure data persistence while maintaining high performance, with command logs providing fine-grained durability and snapshots offering efficient bulk recovery.^[46]

Distribution and Replication Mechanisms

VoltDB operates on a shared-nothing architecture, partitioning data and execution across nodes in a cluster to enable horizontal scalability without shared resources like locks or disks.^[37] Tables are automatically partitioned using a hash of primary key values, distributing partitions evenly across available nodes during initialization.^[37] This design ensures that single-partition transactions execute locally on one node, avoiding inter-node communication for the majority of workloads.^[39] Elastic scaling supports dynamic cluster resizing by adding or removing servers without downtime; new nodes join the cluster, and partitions are automatically rebalanced to maintain even distribution.^[47] The system supports linear throughput scaling for single-partition operations up to hundreds of nodes, though multi-partition transactions introduce coordination overhead that can create bottlenecks at scale.^[37] Single-partition transactions require no two-phase commit, relying instead on the single-threaded execution model per partition for atomicity.^[48] Intra-cluster replication achieves fault tolerance through k-safety, where each partition maintains k+1 copies (typically k=1 or k=2) across distinct nodes, allowing the cluster to tolerate up to k node failures without interruption.^[49] All replicas are active peers, synchronously receiving and executing updates to ensure consistency; temporary tables facilitate rejoining of recovered nodes by replaying recent transactions.^[49] For cross-site reliability, asynchronous database replication (DR) streams binary logs of committed transactions in parallel per partition to remote clusters, enabling disaster recovery and global distribution with minimal latency impact on the primary site.^[50] Node failures are detected by the cluster through network monitoring, triggering automatic failover to surviving replicas and subsequent partition rebalancing to restore full k-safety.^[51] This process ensures high availability, with the database continuing operations on remaining nodes while logging the failure for administrative recovery.^[51]

Key Features

Performance Optimizations

VoltDB employs compiled stored procedures to achieve zero-interpretation overhead during execution. Stored procedures are written in Java and compiled directly into bytecode as part of the application catalog, allowing VoltDB to pre-optimize execution plans for embedded SQL statements based on the schema and query patterns. This approach ensures deterministic, high-speed transaction processing without runtime parsing or interpretation costs.^[52] For handling bulk data operations, VoltDB supports batching mechanisms in its import and export connectors to minimize latency and overhead. During exports, connectors such as JDBC batch multiple INSERT statements—typically in approximately two-megabyte chunks—to external databases, reducing the number of individual network calls. Similarly, the Kafka importer retrieves records in batches from topics before invoking stored procedures, enabling efficient ingestion of large volumes without per-record overhead.^[53] Elastic durability provides tunable logging options to balance performance and data persistence in high-throughput environments. Administrators can configure command logging as synchronous for full ACID compliance or asynchronous to prioritize speed, with adjustable queue sizes and snapshot frequencies to control recovery point objectives without compromising query latency.^[54] This flexibility allows applications to scale ingestion rates while maintaining configurable levels of fault tolerance. VoltDB's indexing strategies optimize access patterns for in-memory storage. Hash indexes excel in equality-based lookups, offering constant-time retrieval for primary keys or unique constraints by mapping values directly to locations. Sorted (tree) indexes, the default type, support range queries and ordered scans efficiently through balanced tree structures, making them suitable for analytical filters or inequality operations. Partial indexing further reduces memory usage by applying indexes only to rows matching specific predicates, such as non-null values or status flags, thereby minimizing footprint in sparse datasets. Built-in streaming integrations facilitate low-latency data pipelines via export connectors to targets like Kafka and JDBC. The Kafka connector serializes VoltDB streams into producer messages, enabling real-time distribution to topics for downstream processing with minimal buffering delays. JDBC exports stream changes directly to relational databases, supporting asynchronous or synchronous modes to integrate VoltDB outputs into hybrid ecosystems without custom middleware.^[55] Performance tuning in VoltDB includes per-partition thread pooling and JVM configurations to sustain high throughput. Each database partition executes transactions serially on a dedicated thread, avoiding locks while leveraging thread pools for I/O operations like network exports to prevent bottlenecks. JVM optimizations, such as heap sizing and garbage collection tuning (e.g., using G1GC with low-pause targets), minimize stalls in memory-intensive workloads by allocating sufficient off-heap storage for indexes and data.

Durability and Fault Tolerance

VoltDB ensures data durability primarily through command logging and periodic snapshots, which together provide a robust mechanism for persisting in-memory data to disk without fully sacrificing its high-performance design.^[45] Command logging records every transaction invocation as a binary log entry appended to disk, capturing the stored procedure calls rather than their full outcomes to minimize storage overhead and I/O impact.^[45] This logging can operate in synchronous mode, where each transaction is written to disk before completion to guarantee no data loss, or asynchronous mode, where logs are buffered in memory and flushed at configurable intervals (such as every 1-4 milliseconds or after a set number of transactions) for better throughput at the risk of minor data loss in crashes.^[56] Complementing command logging, binary snapshots create full, transactionally consistent backups of the database state, typically configured to occur automatically every 5 to 60 minutes, serving as checkpoints that truncate preceding logs to manage storage.^[46] For fault tolerance, VoltDB employs K-safety, which replicates each data partition across K+1 nodes in the cluster, ensuring that the system can withstand the failure of up to K nodes without losing availability or data.^[49] For instance, with K=1, every partition has two copies on distinct nodes, allowing the cluster to continue operations using the surviving replica if one node fails, followed by automatic promotion and rebalancing of partitions to restore full redundancy across the remaining nodes.^[49] Point-in-time recovery is achieved by restoring the most recent snapshot and then replaying the command logs to reconstruct all subsequent transactions up to the failure point, enabling precise data restoration during cluster restarts.^[45] Despite its in-memory nature, which introduces volatility on node crashes, VoltDB mitigates this through the aforementioned logging and snapshots, though full synchronous durability incurs disk I/O overhead that can reduce transaction throughput.^[45] For scenarios requiring asynchronous persistence to external systems, export tables stream data out of VoltDB in a non-blocking manner to targets like Kafka or file systems, decoupling durability from core database performance.^[36] The recovery process involves halting the cluster, reinitializing from the latest snapshot via the voltadmin restore command, and replaying logs to catch up, ensuring minimal downtime.^[57] To address geo-redundancy and disaster recovery, VoltDB supports Active(N) database replication (DR), which asynchronously mirrors selected tables across multiple clusters in different data centers using binary logs per partition.^[50] This setup allows independent operation of each cluster with eventual consistency, incorporating conflict resolution policies—such as last-writer-wins or custom stored procedures—to handle simultaneous updates on the same records during failover or synchronization.^[50] DR integrates seamlessly with command logging and snapshots, permitting individual clusters to recover locally without disrupting replication.^[50]

Integration and Extensibility

VoltDB provides multiple client interfaces for connecting applications to the database, enabling seamless integration across various programming languages and environments. The JDBC driver allows Java applications to interact with VoltDB using standard database connectivity methods, supporting ad hoc queries, prepared statements, stored procedure invocations, and metadata examination. Connections are established via URLs like jdbc:voltdb://server:port, with optional parameters for security and topology awareness, requiring the VoltDB JAR and Guava library in the classpath.^[58] Additionally, the JSON over HTTP interface facilitates access from non-Java clients by sending HTTP requests to port 8080, where stored procedure parameters are encoded as JSON arrays and responses return results in JSON format, supporting languages like PHP, Python, and Perl.^[59] For native performance, VoltDB offers dedicated Java and C++ client libraries. The Java Client2 API, the modern and recommended interface included in the distribution as of 2025, supports synchronous and asynchronous calls to stored procedures, handling connections to single or multiple cluster nodes. The C++ client, available as a pre-compiled kit or from source, implements the VoltDB wire protocol for invoking procedures synchronously via invoke() or asynchronously with callbacks, though it is single-threaded and not thread-safe. Stored procedures are invoked remotely via RPC mechanisms in these clients, where the procedure name and parameters are passed to the cluster, which routes the request to the appropriate partition for execution.^[15]^[60] Extensibility in VoltDB centers on customizable code execution within the database. Developers write custom stored procedures as Java classes extending the VoltProcedure abstract class, implementing the run() method to define transactional logic using SQLStmt for parameterized queries queued via voltQueueSQL() and executed with voltExecuteSQL(). These procedures encapsulate complex business logic and are compiled into the database schema for atomic execution. User-defined functions (UDFs) further enhance flexibility, allowing scalar or aggregate functions to be defined in Java and declared via CREATE FUNCTION or CREATE AGGREGATE FUNCTION statements, integrating custom computations directly into SQL queries for tasks like data conversion or machine learning model evaluation.^[61]^[62] Integration with external systems is supported through streaming import/export connectors and specialized tools. VoltDB integrates with Apache Kafka for real-time data ingestion and egress, using the Kafka importer to subscribe to topics and insert records into database tables, and the Kafka export connector to publish serialized data from export streams to Kafka queues following version 0.8.2 protocols. For broader real-time pipelines, Volt Active(SP)—VoltDB's stream processing framework—enables building cloud-native data flows that combine Kafka sources with stateful or stateless operations, leveraging the database for reference data access to support low-latency decisions. As of 2025, Volt Active(SP) supports stateless stream processing integrated with OLTP, enabling the handling of petabytes of data over multiple real-time streams and data stores.^[63]^[36]^[64] Compatibility with business intelligence (BI) tools is achieved via export connectors like JDBC, HTTP, and file-based targets, which stream transactional data asynchronously to external systems without impacting database performance, allowing real-time analytics and dashboard updates.^[36] Security features ensure controlled and protected access to VoltDB clusters. Role-based access control (RBAC) manages permissions by assigning users to roles defined in the schema, enabling security with deployment.security.enabled = true and checking privileges at stored procedure runtime. Communication is secured using Transport Layer Security (TLS/SSL), which encrypts data between clients and servers, or Kerberos for ticket-based authentication in networked environments. For the hash-based provider, clients provide credentials validated using SHA-256 hashing. These mechanisms—TLS for encryption and Kerberos for integrated authentication—protect against unauthorized access and data interception when enabled in the deployment configuration.^[65]

Applications and Comparisons

Real-World Use Cases

VoltDB finds prominent application in real-time fraud detection within the financial industry, where it processes millions of transactions per second to identify and prevent fraudulent activity with sub-millisecond latency. Huawei integrates VoltDB into its FusionInsight platform for credit card fraud detection, reducing processing latencies from batch windows to real-time responses.^[66] In telecommunications, VoltDB powers billing and call detail record (CDR) processing for high-velocity data streams, enabling immediate consistency and real-time event analysis. Openet employs VoltDB within its business support system (BSS) framework to support real-time mobile data monitoring and reduce total cost of ownership for service providers.^[67] Emagine leverages VoltDB in its RED.cloud platform to process 1.5 billion CDRs daily, which has decreased out-of-bundle usage by over 500% and increased airtime sales by 253%.^[68] Ad technology platforms utilize VoltDB for real-time bidding (RTB) auctions, where it ingests, analyzes, and acts on streaming data to deliver targeted advertisements at scale. TripleLift uses VoltDB to enable precise programmatic native ad targeting, handling data streams across more than 700 million unique datasets while maintaining accurate campaign pacing and low-latency decisions.^[69] In fantasy sports, Volt Active Data powers platforms like Dream11, supporting over 75 million users with sub-millisecond latency for real-time personalization and handling peaks of 165,000 registrations per second per contest, including 16.5 million concurrent users as of March 2025.^[70]^[71] For Internet of Things (IoT) scenarios, VoltDB supports real-time processing in industrial environments, focusing on visual intelligence and high-velocity data streams.^[72] Across these applications, VoltDB delivers over three million transactions per second with latency under five milliseconds, enabling scalable streaming at enterprise volumes without data movement between systems.^[73] It addresses challenges in translytical workloads by eliminating extract-transform-load (ETL) delays through in-place hybrid online transaction processing (OLTP) and online analytical processing (OLAP).^[74]

Comparisons with Other Databases

VoltDB differs significantly from traditional relational database management systems (RDBMS) such as Oracle and MySQL, primarily in its in-memory architecture and elimination of disk-based locking mechanisms, which enable it to achieve substantially higher throughput for online transaction processing (OLTP) workloads. In a TPC-C-like benchmark conducted on identical hardware, VoltDB demonstrated 45 times higher throughput compared to Oracle, attributing this to its avoidance of legacy overheads like buffer management and latching. Similarly, in single-node performance tests using a variant of the TPC-C benchmark, VoltDB was approximately 100 times faster than MySQL for OLTP operations. However, VoltDB's focus on OLTP means it lacks mature support for online analytical processing (OLAP), requiring integration with external systems for complex analytics, unlike the more versatile query capabilities in Oracle and MySQL.^[75]^[76]^[77] In contrast to NoSQL databases like Cassandra, VoltDB maintains full ACID compliance and standard SQL semantics, making it more suitable for transactional workloads that demand strict consistency. Cassandra, designed for high availability and partition tolerance, relies on eventual consistency, which can lead to temporary data discrepancies across nodes, whereas VoltDB ensures serializable isolation without such trade-offs. Performance benchmarks show VoltDB outperforming Cassandra in throughput efficiency, processing 1.72 times more operations per second per Mbps of network bandwidth in comparative tests. Additionally, VoltDB's relational model supports fixed schemas and joins, providing better query expressiveness for structured data, but it offers less schema flexibility than Cassandra's dynamic, column-family-based approach, which accommodates varying data structures more easily.^[78]^[75] Compared to other NewSQL databases like CockroachDB, VoltDB employs a simpler scaling model based on shared-nothing partitioning without the overhead of distributed consensus protocols such as Raft, allowing for faster transaction execution in single-partition scenarios. Both systems provide ACID guarantees and horizontal scalability, but VoltDB assumes transactions are confined to single partitions, shifting partitioning logic to developers, while CockroachDB handles multi-partition transactions transparently via consensus. In the Star Schema Benchmark, VoltDB exhibited the fastest query execution times for OLTP-oriented workloads (e.g., 32 ms at scale factor 1), outperforming CockroachDB in pure OLTP throughput, though CockroachDB showed superior scalability for data loading across larger scales.^[77]^[75] Overall, VoltDB excels in scenarios prioritizing high velocity and strong consistency for OLTP, leveraging its in-memory design for low-latency processing, but it incurs higher memory requirements than disk-optimized systems and places greater responsibility on developers for effective data partitioning to avoid cross-partition overheads.^[77]

References

[1]
Introducing Volt Active Data
Feb 21, 2022 · VoltDB has changed its name to Volt Active Data to better reflect the company's evolution and the changes in the real-time data market.
[2]
Chapter 1. Overview - VoltDB - Volt Active Data
VoltDB is a revolutionary new database product. Designed from the ground up to be the best solution for high performance business-critical applications.Missing: website | Show results with:website
[3]
H-store: a high-performance, distributed main memory transaction ...
Our demonstration presented here provides insight on the development of a distributed main memory OLTP database and allows for the further study of the ...
[4]
Company - Volt Active Data
History. Designed by 2014 Turing Award winner Dr. Michael Stonebraker, Volt was built to be the real-time data processing solution of the 21st century, a ...
[5]
H-Store: Next Generation OLTP Database Research
Jun 3, 2016 · If you want to use a more state-of-the-art DBMS that is based on H-Store's architecture, then you should use VoltDB. Yes, VoltDB is open-source.
[6]
[PDF] FOR IMMEDIATE RELEASE - Volt Active Data
VoltDB is the commercial realization of the H-Store academic research project, a collaboration of senior computer scientists from MIT (led by Dr. Michael ...
[7]
https://github.com/voltdb/voltdb
[8]
VoltDB Open Source / Community Edition - ODBMS.org
Feb 7, 2020 · The source code for the open source Community Edition is publicly hosted on GitHub. Whether you want to build, modify or understand VoltDB, you ...
[9]
Volt Active Data | Data Platform for Mission-Critical Applications
Ensure uptime, lower costs, and scale easily with the only data platform built for reliable support of mission-critical applications.Career Opportunities · Without compromising · Developer Edition · Company
[10]
Announcing Volt Active Data
Feb 21, 2022 · As of today, we are no longer VoltDB and are now Volt Active Data. It's an important move that reflects how we've grown as a company.
[11]
[PDF] VoltDB Technical Overview | Volt Active Data
For fast, data-driven applications, VoltDB offers substantial performance and cost advantages as illustrated by the results of the TPC-C-like benchmark ...
[12]
Capabilities
### Summary of Volt Active Data (formerly VoltDB)
[13]
Planning Guide - VoltDB - Volt Active Data
The VoltDB client libraries, for accessing VoltDB databases programmatically, are licensed separately under the MIT license. Volt Active Data, VoltDB, and ...
[14]
Chapter 8. Using VoltDB with Other Programming Languages
VoltDB stored procedures are written in Java and the primary client interface also uses Java. However, that is not the only programming language you can use ...
[15]
Chapter 2. Installing VoltDB - Volt Active Data
Kits are built and qualified on the following platforms: Debian 12.1 or later. Red Hat (RHEL) version 8.8 or later, including 9.0., 10.0 and subsequent releases.
[16]
1.2. Setting Up Your Kubernetes Environment - VoltDB
Kubernetes V1.28.x or later. VoltDB on Kubernetes has been validated for the following cloud service providers: AWS. Azure. Google Cloud. OpenShift. Tanzu.Missing: deployment options
[17]
https://docs.voltdb.com/KubernetesAdmin/IntroSetup.php
[18]
2.2. Installing VoltDB - Volt Active Data
Upgrade VoltDB. Initialize a new database root directory (voltdb init). Start the new database in admin mode (voltdb start --pause). Restore the snapshot ...
[19]
3.2. Initializing the Database Root Directory - VoltDB
Once you create your configuration files, you are ready to initialize the database root directory, using the voltdb init command. You issue this command on ...
[20]
3.2. Initializing and Starting a VoltDB Database on a Cluster
You initialize and start a cluster the same way you start a single node: with the voltdb init and start commands. The only difference is that when starting ...
[21]
Chapter 1. Introduction - VoltDB
This chapter provides an overview of how these components work together to support running virtualized VoltDB clusters in a Kubernetes environment.
[22]
[PDF] The End of an Architectural Era (It's Time for a Complete Rewrite)
We then proceed in Section 5 to present experimental data on H-Store and a popular RDBMS on TPC-C. We conclude the paper in Section 6 with some radical ...
[23]
[PDF] OldSQL vs. NoSQL vs. NewSQL on New OLTP - USENIX
NoSQL vs. NewSQL on New OLTP. Michael Stonebraker, CTO. VoltDB, Inc. Page 2. VoltDB. 2. Old OLTP. ▫ Remember how we used to buy airplane Hckets in the. 1980s. + ...
[24]
Michael Stonebraker, VoltDB Co-Founder and Real-time Data ...
PRNewswire/ -- Michael Stonebraker, who in 2009 co-founded in-memory database company VoltDB, has been awarded the 2014 ACM A.M. Turing Award for his...
[25]
Volt Active Data - Crunchbase Company Profile & Funding
On Feb 21, 2022, VoltDB rebranded as Volt Active Data. Phone Number +978 528 ... How much funding has Volt Active Data raised to date? Volt Active Data ...
[26]
VoltDB - Wikipedia
Volt Active Data (formerly VoltDB) is an in-memory database designed by Michael Stonebraker, Sam Madden, and Daniel Abadi.
[27]
Using VoltDB - Volt Active Data
This book explains how to use VoltDB to design, build, and run high performance applications. Table of Contents
[28]
Introducing Geospatial Capabilities in Volt Active Data v6.0
A geospatial app example simulates a mobile ad broker, using geospatial queries to determine which ad to show based on a user's location and bid regions.
[29]
VOLTDB RELEASES V10 - ODBMS.org
Sep 3, 2020 · VoltDB has just released V10 – a momentous milestone in our evolution from a very fast, highly-available ACID database to a complete fast ...
[30]
Release Notes - VoltDB - Volt Active Data
Volt Active Data changed how procedure timeout conditions are handled in V12.0. However, it turns out this change conflicts with the time limit for logging ...
[31]
[PDF] Release Notes - Volt Active Data
Jun 30, 2025 · The process for upgrading from the recent versions of VoltDB is as follows: 1. Shutdown the database, creating a final snapshot (using voltadmin ...Missing: history milestones<|control11|><|separator|>
[32]
Release Notes - VoltDB - Volt Active Data
Sep 29, 2025 · VoltDB V15 adds a new SQL datatype, DATE, that represents a date between January 1, 1583 and December 31, 9999, inclusive. In addition to ...
[33]
With v8, Volt Active Data Introduces the World's Fastest Translytical ...
The new version of Volt Active Data (Volt Active Data v8) delivers more predictable, long-tail latency responses based on real-time data and historical ...Missing: evolution pipelines
[34]
Chapter 15. Streaming Data: Import, Export, and Migration - VoltDB
Interaction with the VoltDB database is transactional, providing the same ACID guarantees as all other transactions. Interaction with the external system ...
[35]
[PDF] VoltDB Technical Overview - ODBMS.org
VoltDB is designed to optimize the VLSI designs of multi-core CPUs. It ... This section discusses some of the VoltDB features that make integration fast and easy.Missing: characteristics | Show results with:characteristics
[36]
[PDF] The End of an Architectural Era (It's Time for a Complete Rewrite)
Jul 16, 2007 · VLDB '07, September 23-28, 2007, Vienna, Austria. Copyright 2007 ... H-Store runs on a grid of computers. All objects are partitioned.
[37]
1.3. How VoltDB Works - Volt Active Data
The VoltDB architecture is designed to simplify the process of scaling the database to meet the changing needs of your application. Increasing the number of ...Missing: principles | Show results with:principles
[38]
Chapter 4. Designing the Database Schema - VoltDB
The database schema is a specification that describes the structure of the VoltDB database such as tables and indexes, identifies the stored procedures that ...Missing: principles | Show results with:principles
[39]
[PDF] VoltDB Technical Overview | Volt Active Data
VoltDB is a fast in-memory RDBMS with 100x better performance, using SQL, ACID transactions, and data held in memory for maximum throughput.
[40]
13.3. How the Database Manages Memory - VoltDB - Volt Active Data
VoltDB manages memory for persistent and semi-persistent storage aggressively to ensure unused space is compacted and released when available.
[41]
Chapter 5. Designing Stored Procedures to Access the Database
It is possible to perform ad hoc queries on a VoltDB database. However, ad hoc queries do not benefit as fully from the performance optimizations VoltDB ...
[42]
[PDF] How VoltDB does Transactions - Volt Active Data
VoltDB's stored procedure API provides deterministic functions for the current time and also provides deterministic seeds (on a per-command basis) for ...
[43]
Chapter 14. Command Logging and Recovery - VoltDB
or "truncate" — the log keeping only a record of procedure invocations since the ...
[44]
3.4. Saving the Data - VoltDB
VoltDB can save this information to disk through the use of command logs and snapshots, so use of these features is strongly encouraged.
[45]
9.3. Elastic Scaling to Resize the Cluster - VoltDB - Volt Active Data
Elastic scaling is the ability to resize the cluster as needed, without having to shutdown the database. Elastic scaling supports both increasing and ...Missing: nothing | Show results with:nothing
[46]
[PDF] How VoltDB does Transactions - Volt Active Data
Multi-partition transactions that modify data are two-phase committed across partitions. ... Many use cases in VoltDB use multi-partition transactions ...
[47]
Chapter 10. Availability - VoltDB - Volt Active Data
Command Logging is a feature where, in addition to periodic snapshots, the system keeps a log of every stored procedure (or "command") as it is invoked. If, for ...
[48]
Chapter 11. Active(N) Database Replication - VoltDB
K-safety maintains redundant copies of partitions within a single VoltDB database, which helps protect the database cluster against individual node failure.<|control11|><|separator|>
[49]
10.3. Recovering from System Failures - VoltDB - Volt Active Data
When running with K-safety on, if a node goes down, the remaining nodes of the database cluster log an error indicating that a node has failed. By default, ...Missing: detection gossip automatic failover rebalance
[50]
[PDF] Performance Guide - Volt Active Data
This book provides recommendations for optimizing VoltDB applications. It assumes you are already familiar with VoltDB and its features. If you are new to ...
[51]
15.4. VoltDB Import Connectors - Volt Active Data
VoltDB can export data from selected streams and tables to external targets, it supports importing data into selected tables from external sources.
[52]
3.4. Sizing for Durability - VoltDB - Volt Active Data
VoltDB has several features that increase the durability of the database, including K-Safety, snapshots, command logging, and database replication.Missing: elastic | Show results with:elastic
[53]
15.3. VoltDB Export Connectors - Volt Active Data
When the Kafka connector receives data from the VoltDB export sources, it establishes a connection to the Kafka messaging service as a Kafka producer. It ...
[54]
14.3. Configuring Command Logging for Optimal Performance
VoltDB provides three settings you can use to optimize command logging. The following sections describe these options.
[55]
Chapter 13. Saving & Restoring a VoltDB Database
### Summary of Snapshots in VoltDB
[56]
8.3. JDBC Interface - VoltDB - Volt Active Data
JDBC (Java Database Connectivity) is a programming interface for Java programmers that abstracts database specifics from the methods used to access the data.Missing: APIs ODBC
[57]
8.2. JSON HTTP Interface - VoltDB - Volt Active Data
Jul 1, 2010 · 8.2.2. Using the JSON Interface from Client Applications · Encode the parameters for the stored procedure as a JSON-encoded string · Instantiate ...
[58]
Chapter 5. Designing Stored Procedures to Access the Database
### Summary: How Stored Procedures Are Called via RPC
[59]
5.2. The Anatomy of a VoltDB Stored Procedure - Volt Active Data
voltdb.* . Although VoltDB stored procedures must be written in Java and the primary client interface is Java (as described in Chapter 6, Designing VoltDB ...
[60]
10.4. Declaring a User-Defined Function - VoltDB
You declare scalar functions using the CREATE FUNCTION statement, specifying the name of the function and the associated Java class path and method name.
[61]
Active(SP) Stream Data Processing - 1.5.4
### Summary of VoltDB Active(SP) for Real-Time Pipelines
[62]
Chapter 12. Security - VoltDB - Volt Active Data
To secure the actual communication between the server and client, you can implement either Transport Layer Security (TLS) or Kerberos security. Use of TLS is ...Missing: based | Show results with:based
[63]
Huawei Selects Volt Active Data to Power Financial Services Fraud ...
“Huawei chose Volt Active Data not only for its ability to dramatically reduce current fraud detection latencies, but also for future applications which will ...
[64]
Customers - Volt Active Data
Review the list of Volt Active Data's comprehensive customer list, and consider joining our long list of partners.
[65]
[PDF] Fast Data Use Cases for Telecommunications
Sep 6, 2017 · VoltDB is an in-memory,. NewSQL database that became popular with telcos for its ability to handle the speed and scale of fast data. This ebook ...Missing: features | Show results with:features
[66]
TripleLift Delivers Advanced Programmatic Real-Time Advertising ...
By ingesting, analyzing and acting on incoming data streams in real time, Volt Active Data enables TripleLift to deliver a true RTB (real-time bidding) auctions ...
[67]
Industrial IoT - Volt Active Data
Volt Active Data combines the fast ingestion, real-time analytics, scale and transactional decision-making crucial to enable applications of the IIOT.
[68]
New Benchmark Data Solidifies Volt Active Data as Fastest ...
Volt Active Data enables a throughput of more than three million transactions per second with a latency of less than five milliseconds, significantly surpassing ...
[69]
VoltDB Introduces New Version of Translytical Database for ...
Feb 8, 2018 · According to VoltDB, the new version delivers more predictable, long-tail latency responses based on real-time data and historical intelligence ...Missing: ETL | Show results with:ETL
[70]
[PDF] TECHNICAL OVERVIEW - Volt Active Data
of the TPC-C-like benchmark below, in which voltDB and a well-known OLTP ... commercial DBMS such as Oracle. 45x better throughput. Page 2. 2. Dealing with ...
[71]
[PDF] Angus Macdonald PhD thesis - St Andrews Research Repository
VoltDB is 100 times faster than MySQL in single node performance when running a variant of the TPC-C benchmark [23]25. C-Store / Vertica. C-Store is a column ...<|control11|><|separator|>
[72]
NewSQL Databases Assessment: CockroachDB, MariaDB Xpand ...
Dec 26, 2022 · Conclusions: CockroachDB and VoltDB are the best performing databases in terms of scalability and performance. Keywords: SQL; NewSQL; OSSpal; ...
[73]
Comparing Fast Data Performance: Volt Active Data vs Cassandra
Oct 2, 2017 · Volt Active Data performs 1.72 times more operations per second per Mbps than Cassandra, highlighting its efficiency in processing data with ...