Fact-checked by Grok 2 weeks ago

ArangoDB

ArangoDB is a native multi-model, open-source NoSQL database system designed to handle graph, document, key-value, and full-text search data models within a unified core, enabling flexible data storage and querying without the need for multiple specialized databases.^[1] Developed by ArangoDB Inc., which was founded in 2014 by Claudius Weinberger and Frank Celler in Cologne, Germany. The database employs the ArangoDB Query Language (AQL), a declarative, SQL-like language that allows complex traversals and joins across all supported data models in a single query.^[1] As of November 2025, the latest stable version is 3.12.6.1, available in community and enterprise editions; the community edition, licensed under the Business Source License (BSL), provides full access to features without time limits for non-commercial use and for internal commercial use up to a 100 GiB dataset size.^[2]^[3] Key features of ArangoDB include horizontal scalability through sharding and replication, support for ACID transactions, and integration with machine learning workflows via graph analytics engines for algorithms like PageRank and connected components.^[1] It also offers advanced capabilities such as full-text search with ArangoSearch and vector search for AI applications, reducing infrastructure costs by up to 70% compared to siloed systems.^[4] The system is particularly suited for handling connected data in scenarios requiring real-time analytics, such as recommendation engines, fraud detection, supply chain optimization, and generative AI platforms like chatbots and copilots.^[5] Notable adopters include enterprises in finance, healthcare, and technology sectors, such as Deloitte, Cloudera, and NVIDIA, which leverage its performance for scalable AI and graph-based workloads.^[6]

Introduction

Overview

ArangoDB is an open-source, native multi-model NoSQL database that supports graph, document, key-value, vector, and search data models within a single core, allowing seamless integration of diverse data structures without the need for multiple specialized databases.^[5] This architecture enables developers to handle complex, interconnected data workloads efficiently in one unified system. The primary purpose of ArangoDB is to unify data management for applications requiring flexible querying across models, facilitating use cases such as AI-driven contextual analytics, real-time recommendations, and knowledge graphs.^[5] By combining these capabilities, it addresses the challenges of siloed data systems, promoting faster development and more agile data processing in modern applications.^[5] As a foundation for AI data platforms, ArangoDB reduces integration costs by up to 70% through its native support for multiple paradigms, enabling enterprises to build scalable solutions for generative AI, fraud detection, and personalized services without extensive custom engineering.^[5]

Key Characteristics

ArangoDB is distinguished by its native multi-model architecture, which integrates support for graph, document, key-value, vector, and search data models within a single database engine, enabling developers to perform seamless operations across these models without requiring data duplication or complex external joins. This unified approach allows for querying diverse data types using a single declarative language, reducing the need for multiple specialized databases and minimizing integration overhead by up to 70%.^[7] At its core, ArangoDB stores data in JSON format, with internal representation in the efficient VelocyPack binary format, providing schema flexibility that accommodates evolving application requirements without predefined structures. This design supports full ACID-compliant transactions across all supported models in single-server deployments, ensuring atomicity, consistency, isolation, and durability for multi-document and multi-collection operations, while in distributed setups, it maintains ACID properties for operations within the same shard.^[8] For high-performance workloads, ArangoDB incorporates GPU acceleration, particularly through integration with NVIDIA's cuGraph for graph analytics, enabling faster processing of complex computations like pattern detection and centrality measures. It also offers both horizontal scaling via distributed clustering and auto-sharding, as well as vertical scaling to handle varying loads efficiently, making it suitable for enterprise-scale applications.^[9]^[10] Developer-friendly aspects are further enhanced by ArangoDB's schema-free nature, which promotes agile development, and its native support for vector embeddings and search, facilitating integration with modern AI tools such as large language models (LLMs) for applications like GraphRAG and contextual intelligence systems that ground AI outputs in trusted enterprise data.^[8]^[11]

History and Development

Founding and Early Development

ArangoDB originated in 2011 in Cologne, Germany, when developers Claudius Weinberger, Frank Celler, and Lucas Dohmen began working on a new database project named AvocadoDB. The initiative aimed to develop a flexible NoSQL database capable of handling multiple data models, including key-value, document, and graph structures, to address limitations in existing systems that often required separate databases for different data types.^[12] In May 2012, the project was renamed ArangoDB to avoid potential legal conflicts associated with the original name, while retaining the avocado-inspired logo as a nod to its versatile design. Shortly thereafter, in spring 2012, the first version of ArangoDB was released as an open-source project under the Apache 2.0 license, emphasizing its early emphasis on integrating document and graph storage capabilities for more efficient data management.^[13]^[14]^[15] The project's growth led to the formal establishment of ArangoDB GmbH in May 2014 by Weinberger, Celler, and Dohmen, marking the transition from a personal development effort to a commercial entity dedicated to further developing, maintaining, and supporting the database. This company formation in Cologne laid the groundwork for professionalizing the open-source project while continuing to foster community contributions.^[16]

Funding and Growth

ArangoDB received its first external funding in February 2015 with a €1.85 million seed round led by Machao Holdings AG and triAGENS.^[16] In June 2017, ArangoDB secured €4.2 million in seed funding led by Target Partners, with participation from CP Ventures and others, to accelerate its international expansion, particularly strengthening its presence in the US market.^[17] This investment supported the company's efforts to build on its multi-model database foundation, originally developed from the open-source AvocadoDB project started in 2011.^[18] Building on this momentum, ArangoDB raised $10 million in a Series A round in March 2019, led by Bow Capital with involvement from Target Partners and existing investors.^[19] The funds were allocated toward global expansion, including hiring additional engineering and sales personnel to meet rising demand for its native multi-model database and to drive product development.^[20] This round coincided with the relocation of its headquarters to San Francisco, California, marking a key step in establishing a stronger foothold in the North American market while maintaining operations in Cologne, Germany.^[21] In October 2021, ArangoDB announced a $27.8 million Series B funding round led by Iris Capital, with participation from Bow Capital, Target Partners, and New Forge, bringing total financing to approximately $47 million.^[22] The investment aimed to advance graph machine learning capabilities, enhance analytics and AI integrations, and support cloud-native services for enterprise-scale deployments.^[23] These funding rounds fueled significant organizational growth, including the expansion of its workforce to over 100 employees across three continents by 2023.^[24] The company maintained its engineering hub in Cologne, Germany, while the San Francisco office served as the primary headquarters, enabling a distributed team to serve a global customer base in industries such as finance, healthcare, and technology.^[25]

Major Releases and Milestones

ArangoDB's major releases have progressively enhanced its multi-model capabilities, performance, and integration with emerging technologies. Version 3.0, released in June 2016, marked a significant milestone by unifying document, graph, and key-value models into a single, cohesive architecture, enabling seamless queries across data types.^[26] This release laid the foundation for ArangoDB's native multi-model support, allowing developers to mix and match data models without application-level sharding.^[27] Subsequent versions focused on scalability and advanced analytics. ArangoDB 3.8, generally available on July 29, 2021, introduced new graph algorithms, including support for weighted traversals and k-shortest paths, improving analytics at scale for complex networks.^[28] In September 2022, version 3.10 added native ARM architecture support, broadening deployment options for edge and cloud environments, alongside computed values and automated graph sharding.^[29] Version 3.11, released on May 30, 2023, optimized search and graph query performance with features like improved AQL execution and enhanced view management, boosting usability for large-scale data operations.^[30] The 3.12 series, starting with its general availability on March 27, 2024, integrated vector search capabilities and AI-focused optimizations, such as improved memory accounting and parallel AQL execution, to support generative AI workloads. As of November 2025, the latest stable release is 3.12.6.1 from November 8, 2025, which includes enhancements to the Kubernetes operator for better orchestration in containerized environments.^[26]^[31] A key product milestone was the launch of ArangoDB Oasis, the company's managed cloud service, on November 20, 2019, simplifying deployment and scaling for multi-model databases across AWS and Google Cloud. By 2025, ArangoDB emphasized generative AI integrations through the Arango AI Suite, featuring tools for multimodal data ingestion, LLM connectivity, and graph-powered RAG systems to enable contextual AI applications.^[32] In October 2023, ArangoDB announced a shift in its licensing model to promote sustainability. Starting with version 3.12, the source code adopted the Business Source License (BSL) 1.1, while binaries fell under the ArangoDB Community License, which limits commercial use in the Community Edition to datasets under 100GB per cluster. This change drew criticism from parts of the open-source community for restricting commercial applications compared to the previous Apache 2.0 license.^[33]

Version	Release Date	Key Milestones
3.0	June 2016	Unified multi-model architecture
3.8	July 2021	Weighted graph traversals and analytics
3.10	September 2022	ARM support and automated sharding
3.11	May 2023	Search and graph performance enhancements
3.12	March 2024	Vector search and AI optimizations

Technical Architecture

Core Components

ArangoDB's storage engine is built on RocksDB, a persistent key-value store optimized for handling large datasets with fast read and write operations. It persists documents on disk while maintaining hot data in memory, using a log-structured merge-tree design to ensure efficient storage and recovery. The engine supports native handling of JSON documents in a schema-optional manner, allowing flexible, semi-structured data storage without rigid schema enforcement. Write-ahead logging (WAL) is employed for durability and replication, with WAL files typically sized around 64 MiB and configurable via options like --rocksdb.write-buffer-size. Compression using the LZ4 algorithm is enabled by default starting from level 2 of the storage hierarchy to optimize disk usage.^[34] The execution engine processes AQL (ArangoDB Query Language) queries by generating and optimizing execution plans through a cost-based optimizer. This optimizer creates multiple potential plans for a query, evaluates their estimated costs, and selects the one with the lowest cost to ensure efficient execution while preserving query semantics. Key optimization rules include index usage, filter removal when covered by indexes, and asynchronous prefetching to improve performance. Parallel execution is supported, particularly in distributed environments, using nodes like ScatterNode and GatherNode to distribute and collect data across shards, though core plan optimization occurs even in standalone setups. The engine represents queries as pipelines of execution nodes, such as IndexNode for index scans and ReturnNode for result output, enabling targeted optimizations like index-only or scan-only paths.^[35] ArangoDB provides several index types to accelerate data retrieval, all integrated with the RocksDB storage engine for persistence. Persistent indexes serve as the primary type for equality matches, range queries, and sorting, offering logarithmic time complexity and supporting options like sparsity control and caching; hash and skiplist indexes are legacy aliases for this type and are no longer recommended for new implementations. Full-text indexes enable word-based searches on attributes, supporting prefix and exact word matching, though they are deprecated since version 3.10 in favor of the more advanced ArangoSearch views. Geo-spatial indexes facilitate location-based queries, such as radius searches or nearest-neighbor lookups, using 2D coordinates or GeoJSON objects, and are invoked via specific AQL functions or automatic optimization. All these indexes are stored on disk with in-memory caches configurable via parameters like --cache.size and --rocksdb.block-cache-size to balance performance and resource usage.^[36] The transaction manager in ArangoDB ensures ACID compliance for operations spanning multiple collections and graphs by leveraging RocksDB's built-in transaction capabilities. For standalone AQL queries, it implements atomicity, consistency, isolation, and durability, where changes are isolated until commit and persisted via WAL for recovery. Transactions can involve multiple document collections, treating graphs as interconnected collections to maintain integrity across edges and vertices. Stream transactions allow explicit begin/commit/abort control for multi-document operations, while JavaScript transactions (deprecated in version 3.12) provide a programmatic interface with automatic commit handling. Durability is configurable, but committed changes are guaranteed to survive server restarts.^[34]^[37]

Clustering and Scaling

ArangoDB achieves distributed deployment through its Cluster mode, which distributes data across multiple nodes using automatic sharding and synchronous leader-follower replication to ensure high availability and fault tolerance.^[38] In this setup, collections are partitioned into shards based on a configurable shard key, typically the document's _key field via consistent hashing, allowing data to be evenly spread across DB-Server nodes without manual intervention.^[39] Each shard maintains one leader replica responsible for handling writes, with one or more follower replicas that synchronously replicate changes to maintain consistency; the replication factor, set per collection, determines the total number of copies (e.g., 3 for one leader and two followers).^[40] The system supports both active-passive and active-active configurations for resilience. In active-passive setups, such as the deprecated Active Failover mode for single-server instances, one active leader handles operations while passive followers asynchronously replicate data for automatic failover.^[41] For active-active clustering in distributed environments, particularly in the Enterprise Edition, datacenter-to-datacenter replication enables bidirectional synchronization across geographically separated clusters, allowing read and write operations from multiple active sites.^[42] Leader election occurs automatically if a leader fails, with configurable timeouts (e.g., 15 seconds), ensuring minimal downtime through the resilient Agency component that coordinates the cluster using Raft consensus.^[38] Horizontal scaling in ArangoDB is achieved by dynamically adding DB-Server nodes, which triggers shard rebalancing to distribute load evenly and increase overall throughput linearly with the number of nodes; the architecture has no inherent limits on scalability, supporting hundreds of DB-Servers and Coordinators constrained only by hardware resources like CPU, memory, and network bandwidth.^[43] This enables handling large-scale workloads, such as terabyte-sized datasets or high query volumes, by scaling out across commodity hardware while maintaining performance through the stateless Coordinator nodes that route client requests.^[44] To address challenges in geo-distributed data access, ArangoDB introduces satellite collections, which replicate an entire collection synchronously to every DB-Server node in the cluster, allowing joins with sharded data to execute locally on each node and minimizing cross-node network traffic—ideal for scenarios requiring low-latency operations across distributed locations.^[45] Complementing this, SmartJoins optimize cross-shard queries by enforcing identical sharding on related collections (via the distributeShardsLike property), enabling the query optimizer to perform co-located joins without routing data through the Coordinator, thus reducing latency and inter-node communication for complex operations like graph traversals or analytical joins.^[46] Deployment and management of scaled clusters are streamlined in containerized environments via the ArangoDB Kubernetes Operator (kube-arangodb), introduced with enhancements in version 3.12, which automates provisioning, scaling, backups, and failover handling within Kubernetes clusters to support elastic resource allocation and seamless integration with cloud-native infrastructures.^[47]

Features

Data Models Supported

ArangoDB supports multiple native data models, allowing users to store and query data in key-value, document, graph, and vector formats within the same database instance. This multi-model approach enables seamless integration across models without data duplication or complex ETL processes.^[48] The document model in ArangoDB is based on JSON objects stored in collections, supporting nested structures and flexible schemas without rigid upfront definitions. Documents can contain structured or semi-structured data, with each document being self-contained and capable of having unique attributes. This model facilitates granular queries on individual attributes, aggregation operations, and the use of secondary indexes for efficient retrieval. For example, a document might represent a user profile with embedded arrays for preferences, allowing direct access to nested elements.^[48]^[49] The key-value model serves as a foundational subset of the document model, providing simple persistent storage where each entry is identified by an immutable string key (_key). It leverages a primary index on the key for fast lookups and includes a unique identifier (_id) in the format <collection>/<key>. This model is particularly suited for caching scenarios, with support for time-to-live (TTL) settings to automatically expire entries after a specified duration. Users can store arbitrary JSON values associated with keys, enabling straightforward get, set, and delete operations.^[48]^[50] ArangoDB's graph model employs a property graph structure, consisting of vertices (nodes as documents) and edges (documents with _from and _to attributes linking vertices). Edges are directed, supporting traversals in outbound, inbound, or bidirectional directions. Native graph algorithms, such as shortest path and neighborhood queries, are built-in for efficient pattern matching and relationship analysis. For instance, in a social network graph, vertices could represent users, and edges could denote friendships, allowing queries to traverse multi-hop connections. These models can be queried across boundaries using AQL.^[48]^[51] Introduced in version 3.12.4, the vector model enables storage of embeddings—arrays of numerical vectors generated by machine learning models to capture semantic meanings—as attributes within documents. These embeddings support similarity searches using indexes powered by the Faiss library, with configurable distance metrics like cosine similarity, inner product, or L2 distance. Vector indexes must be created on pre-populated data, and new embeddings are dynamically assigned to clusters for ongoing searches. This model integrates natively with graph and document structures, allowing hybrid queries that combine semantic similarity with relational traversals, such as retrieving similar documents connected via graph edges in AI-driven applications.^[52]^[53]

Query Language and Processing

ArangoDB's primary query interface is the ArangoDB Query Language (AQL), a declarative language designed for manipulating data across document, graph, and key-value models within a unified syntax.^[54] AQL allows users to express desired results using SQL-like constructs, including operations for reading, writing, and modifying data without specifying the underlying execution details. It supports joins to combine data from multiple collections, subqueries for nested logic, and graph traversals to navigate relationships, enabling complex queries like finding connected components or shortest paths in a single statement.^[55] For example, a traversal query might use the FOR ... IN GRAPH syntax to explore edges from a starting vertex, applying filters and options for direction, depth, and uniqueness.^[55] Query processing in ArangoDB begins with parsing the AQL statement on the server, followed by optimization to generate an efficient execution plan. The optimizer employs cost-based planning, evaluating multiple potential plans and selecting the one with the lowest estimated cost based on heuristics such as data access patterns and index usage.^[35] Early pruning is achieved through rules that reposition filters closer to data sources, reducing the volume of intermediate results; for instance, the move-filters-up rule shifts conditions before joins or traversals.^[35] Parallel execution is facilitated in clustered environments via rules like async-prefetch, which enables asynchronous loading of data batches, and parallelize-gather, which distributes computation across shards for scalable performance across data models.^[35] To handle large datasets securely and efficiently, AQL incorporates bind parameters for injecting values into queries, preventing SQL injection attacks while allowing parameterized reuse.^[56] Parameters are denoted with @ for values (e.g., FOR doc IN collection FILTER doc.age > @minAge RETURN doc) or @@ for collection names, passed separately via APIs like bindVars.^[56] Results are streamed using cursor-based interfaces, which return data in configurable batches (via batchSize) rather than loading everything into memory at once.^[57] This streaming mode, enabled with the stream: true option, processes results lazily on the server, minimizing memory overhead for voluminous outputs and supporting iterative client-side consumption through subsequent cursor requests.^[57]

Analytics and Search Capabilities

ArangoDB provides robust full-text search capabilities through its ArangoSearch engine, which supports configurable analyzers to process and tokenize text data for efficient querying. Analyzers transform input values into sub-values, such as breaking text into words, applying stemming, normalization for case and accents, or generating n-grams, with support for 18 types including text, identity, delimiter, geo-spatial, and pipeline analyzers that chain multiple operations.^[58] These analyzers enable features like frequency and position metadata for advanced ranking, and they can be managed via HTTP API or JavaScript modules. Scoring functions, such as BM25 and TF-IDF, rank search results by relevance, with BM25 scaling term frequency (default k=1.2) and document length (default b=0.75), while TF-IDF optionally normalizes scores based on term frequency and inverse document frequency.^[59] Full-text search integrates seamlessly with graph traversals in AQL queries, allowing hybrid semantic searches that combine token-based matching across collections with relationship traversals for context-aware results. For instance, users can perform a SEARCH operation on document attributes using analyzers and then apply graph traversals like FOR vertex, edge IN OUTBOUND to explore connected entities, enabling applications such as querying product descriptions while traversing recommendation graphs.^[60] This federation across data models supports relevance sorting via scores, making it suitable for complex, multi-hop semantic queries. Vector similarity search in ArangoDB leverages indexed embeddings for high-dimensional semantic matching, supporting metrics like cosine similarity (angular distance, range -1 to 1), Euclidean distance (L2 norm, lower values indicate similarity), and inner product (dot product for magnitude-aware similarity).^[61]^[62] Vector indexes, powered by the Faiss library and enabled via the --vector-index startup option since version 3.12.4, store embeddings as document attributes and facilitate approximate nearest neighbor searches with functions like APPROX_NEAR_COSINE or APPROX_NEAR_L2, adjustable via nProbe for precision-speed trade-offs. These capabilities are commonly used in retrieval-augmented generation (RAG) systems to fetch contextually similar documents and in recommendation engines to identify similar items based on embedding representations.^[61] Foxx microservices extend ArangoDB's functionality by allowing developers to build custom JavaScript-based services that run directly within the database on the V8 engine, providing low-latency access to data for analytics extensions. These stateless, RESTful endpoints support complex custom logic, such as processing query results for specialized analytics or integrating machine learning models through JavaScript libraries or external API calls, without requiring separate application servers.^[63] Services can be distributed across V8 contexts for scalability and are ideal for embedding ML inference, like calling pre-trained models for feature extraction during queries. ArangoDB's AQL includes built-in aggregation functions such as SUM, AVG, MIN, MAX, VARIANCE_SAMPLE, and COUNT_UNIQUE, applied via the COLLECT operation with an AGGREGATE clause to compute statistics over grouped data, enabling efficient analytical processing.^[64] The WINDOW operation supports sliding window aggregations for time-series and real-time analytics, including row-based (fixed row counts, e.g., preceding 1 row), range-based (numeric offsets, e.g., ±10), and duration-based (ISO 8601 intervals, e.g., PT30M for 30 minutes) windows to calculate running totals, rolling averages, or other properties on sorted datasets.^[65] Accelerated computations are available through GPU support in integrations like the cuGraph adapter, which enables NVIDIA-accelerated graph analytics for large-scale aggregations and traversals in production environments.

Editions and Deployment

Community Edition

The ArangoDB Community Edition is the free, open-source version of the database, licensed under the Business Source License (BSL) 1.1 since version 3.12 in 2023, which permits unrestricted use for development, testing, and non-commercial purposes, including internal business operations, provided the aggregated dataset size across a single cluster does not exceed 100 GiB.^[66]^[67] This license replaces the previous Apache 2.0 terms and ensures the source code remains publicly available while restricting commercial production deployments beyond the size limit, after which users must obtain an Enterprise license for continued use.^[66] Starting with version 3.12.5, the Community Edition encompasses all core multi-model capabilities of ArangoDB, including support for document, graph, key-value, search, and vector data models within a unified core, along with the ArangoDB Query Language (AQL) for declarative querying across these models.^[67] It also provides basic clustering functionality for high availability, synchronous replication, and automatic failover, without any time-based restrictions on these features.^[67] However, exceeding the 100 GiB dataset threshold triggers warnings for two days, followed by read-only mode for another two days, and eventual shutdown if not addressed, emphasizing its design for controlled-scale environments.^[67] This edition is particularly suited for prototyping applications, educational purposes, and small-scale deployments where cost-free experimentation with multi-model data management is needed, such as building proof-of-concept graph analytics or document-based APIs.^[66] Binaries and source code are distributed under the ArangoDB Community License, enabling users to compile and deploy without licensing fees for qualifying uses.^[67] The Community Edition can be downloaded from the official ArangoDB website in formats like tar.gz, zip, or via package managers for major operating systems, and the source code is hosted on GitHub for direct access and contributions.^[68]^[69] Community support is available through extensive documentation covering installation, AQL usage, and troubleshooting, as well as forums like the ArangoDB Google Group, Slack channel, and Stack Overflow with the arangodb tag for peer assistance and discussions.^[70]

Enterprise Edition

The ArangoDB Enterprise Edition serves as the commercial counterpart to the Community Edition, enabling production deployments for business applications without the 100 GiB dataset size restriction imposed on the free version.^[71] It encompasses all core functionalities, such as multi-model data support and AQL querying, while providing unrestricted access to advanced scaling and security capabilities for enterprise-scale operations. This edition is licensed for commercial use, ensuring compliance with production requirements and eliminating the read-only mode that activates in the Community Edition upon exceeding size limits.^[71] Key enhancements in the Enterprise Edition focus on security and data protection, including hardware-accelerated encryption at rest for on-disk storage and 256-bit AES encryption for backups, along with key rotation mechanisms to maintain data integrity.^[8] Advanced auditing features capture comprehensive logs of all server interactions, supporting forensic analysis and regulatory adherence.^[8] Role-based access control (RBAC) is integrated via built-in user management with password- and token-based authentication, enabling fine-grained permissions for users and services.^[8] These measures facilitate compliance with standards such as GDPR, HIPAA, and CCPA, particularly for sensitive workloads involving personal health information or financial data.^[72] For distributed environments, the Enterprise Edition supports SmartGraphs, which implement value-based sharding to optimize graph partitioning and reduce traversal latency in clustered setups, ideal for high-availability scenarios like fraud detection in financial networks or supply chain analytics.^[73] Hot backups ensure consistent, incremental data protection without downtime, complemented by performance tuning tools available through professional support services.^[8] Unlike the Community Edition, there are no inherent limits on dataset scale, allowing seamless handling of terabyte-level volumes in production clusters. Licensing for the Enterprise Edition is subscription-based, typically priced per node, CPU core, or usage metrics, with costs determined via direct consultation with ArangoDB sales; this model targets organizations requiring dedicated support, including 24/7 assistance, security patches, and customized optimizations for mission-critical deployments.^[74]

Cloud Services

ArangoDB offers its managed cloud services through the ArangoGraph Insights Platform, formerly known as ArangoDB Oasis, which was launched in November 2019 as a fully managed Database-as-a-Service (DBaaS) solution.^[75] This platform enables users to deploy and operate ArangoDB clusters without handling infrastructure management, supporting deployments on Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP).^[76] It incorporates serverless scaling capabilities, allowing elastic horizontal auto-scaling to adjust resources dynamically based on workload demands, alongside options for OneShard (single-node) and sharded cluster configurations for varied performance needs.^[77] Key operational features include managed backups with one-click disaster recovery, 24/7 monitoring and alerting for deployment health, and multi-availability zone clusters for high availability and replication across zones.^[77] The platform supports the full range of capabilities from ArangoDB's Community and Enterprise Editions, including vector search for similarity matching in AI applications and an integrated AI toolkit via ArangoGraphML for graph machine learning tasks tailored to generative AI (GenAI) use cases.^[78]^[79] Multi-region replication ensures data durability and low-latency access, while zero-downtime upgrades maintain continuous availability during version transitions.^[77] Global data residency is facilitated through provider-specific region selections to comply with regulatory requirements.^[76] Pricing follows a prepaid credit-based pay-as-you-go model, with a free 14-day trial available without a credit card.^[80]^[81] Easy migration from on-premises environments is supported via built-in data loading tools, such as ArangoGraph Data Loader, which handles imports from local or remote databases through guided use cases.^[82] Partnerships with AWS, Azure, and GCP enable seamless integrations, and hybrid deployments are possible by combining cloud instances with self-managed on-premises ArangoDB setups for flexible architectures.^[76] Enterprise Edition features, such as advanced security and SmartGraphs, are fully utilized within these cloud deployments.^[83]

References

[1]
Recommended Resources
**Summary of ArangoDB Introduction (from https://docs.arangodb.com/3.12/):**
[2]
ArangoDB advances graph database fortunes with new funding
Oct 6, 2021 · Founded in 2014, ArangoDB has built out a graph database technology that also supports multimodel database capabilities, enabling it to handle ...
[3]
Current version of ArangoDB
3.12.6. is the latest version of ArangoDB. It was updated 2025-10-27. Other current versions: 2.8.11.
[4]
ArangoDB Enterprise • Arango
ArangoDB was built as a native graph multi-model database, including massively scalable graphs, vector, document, key-value, and full-text search in one ...
[5]
ArangoDB
Arango gives you the trusted data infrastructure for building enterprise-grade AI applications like co-pilots, chatbots, and agents. It helps you scale with ...Downloads · ArangoDB Enterprise · Integrating ArangoDB with... · Supply Chain
[6]
Model Flexibility | ArangoDB - Adaptable Data Solutions
ArangoDB's Multi-Model Support: Simplifying Data Management and Integration with Unified Storage and Querying of Graph, Full Text Search, Key/Value, and ...
[7]
Feature list of the ArangoDB core database system | ArangoDB ...
Data Format: JSON, internally stored in a binary format invented by ArangoDB called VelocyPack. Schema-free: Flexible data modeling without having to define a ...
[8]
Community Edition Features | ArangoDB Documentation
The open-source version of ArangoDB is available under the permissive Apache 2.0 license and offers an extensive feature set including cluster support for free.
[9]
Why Arango • Arango - ArangoDB
Arango delivers horizontal and vertical scaling with GPU acceleration, workload isolation, and auto-sharding. It's built for enterprise performance without ...
[10]
Arango Brings “The GenAI Edge You've Been Missing” to NVIDIA ...
Oct 7, 2025 · Arango is a proud member of the NVIDIA Inception Program, accelerating graph analytics and GenAI with GPU-optimized performance. It is the only ...
[11]
https://arangodb.com/product/ai/
[12]
ArangoDB Secures €2.2M Investment Led by Target Partners
Nov 24, 2016 · Weinberger and Frank Celler (co-founder & CTO) have been working on this database since 2012 and founded the ArangoDB GmbH in May 2014. They ...
[13]
Is UNQL Dead? Future of NoSQL Query Languages - ArangoDB
Apr 7, 2012 · Note: We changed the name of the database in May 2012. AvocadoDB is now called ArangoDB. UNQL started with quite some hype last year. However, ...
[14]
ArangoDB System Properties - DB-Engines
Current release, 3.11.5, November 2023. License info Commercial or Open Source, Open Source info Apache Version 2; Commercial license (Enterprise) available.
[15]
ArangoDB
ArangoDB is a multi-model mostly-memory database. It supports key-value, documents, and graphs stores with JSON data format.
[16]
ArangoDB Finalizes 4.2 Million Euro Investment Led by Target ...
Jun 29, 2017 · ArangoDB secures €4.2 million investment led by Target Partners. Discover how this funding will drive our growth and innovation in database ...
[17]
German database startup ArangoDB closes funding round at €4.2 ...
Jun 29, 2017 · Open source database startup ArangoDB has closed its latest funding round at €4.2 million. This investment was led by Target Partners and ...
[18]
ArangoDB Receives Series A Funding Led by Bow Capital
Mar 14, 2019 · Phew, it's been quite a ride, but today the whole team is super excited to announce a $10 million Series A funding for ArangoDB, our native ...
[19]
ArangoDB Secures $10 million Series A Investment as Global ...
ArangoDB's Series A investment will allow it to accelerate product development and revenue growth ... On Thursday, 28 March 2019, ArangoDB's Founder ... funds ...
[20]
Multimodel database company ArangoDB lands $10M in funding
Mar 14, 2019 · Database startup ArangoDB GmbH says it has relocated its headquarters to the US after landing a $10 million Series A round of financing.Missing: global | Show results with:global
[21]
ArangoDB Secures $27.8M Series B for Graph ML
Oct 6, 2021 · ArangoDB announces $27.8 million Series B investment to accelerate development of next-generation graph ML, providing advanced analytics and AI capabilities at ...Missing: history | Show results with:history
[22]
ArangoDB Announces $27.8 Million Series B Investment to ... - PRWeb
Oct 6, 2021 · This Series B investment brings ArangoDB's total financing to $47 million since its foundation. ArangoDB continues to gain traction for its ' ...
[23]
Claudius Weinberger to Leave ArangoDB: Latest Updates
May 8, 2023 · Weinberger started ArangoDB with Co-Founder Frank Celler in 2015 and has developed it into one of the leading graph databases and the most ...
[24]
ArangoDB Headquarters and Office Locations - Craft.co
Locations ; Germany, Köln, Hohenstaufenring 43-45. HQ ; United States, San Francisco, 415 Mission Street FL 37 ; United States, San Francisco, 548 Market St #61436 ...Missing: Berlin | Show results with:Berlin
[25]
ArangoDB - endoflife.date
ArangoDB is a graph database system developed by ArangoDB Inc. ArangoDB is a multimodel database system since it supports three data models.
[26]
Features and Improvements in ArangoDB 3.0
ArangoDB v3.13 is under development and not released yet. This documentation is not final and potentially incomplete. Features and Improvements in ArangoDB 3.0.
[27]
Introducing ArangoDB 3.8: Graph Analytics at Scale
Jul 29, 2021 · July 29 2021,/General, Releases. Estimated reading time: 5 minutes. We are proud to announce the GA release of ArangoDB 3.8! With this release ...
[28]
ArangoDB 3.10 Release | Graph Usability at Scale
SAN FRANCISCO – October 4, 2022 – ArangoDB, the company behind the most complete graph data and analytics platform, today announced the GA release of ...
[29]
ArangoDB Boosts Performance and Usability Across Search, Graph ...
May 30, 2023 · ArangoDB Boosts Performance and Usability Across Search, Graph, and Analytics with Release of ArangoDB 3.11. SAN FRANCISCO, May 30, 2023 — ...
[30]
ArangoDB Announcements - Google Groups
ArangoDB Announcements ; Aug 8. ArangoDB version 3.12.5.2 released ; Jul 4. ArangoDB version 3.12.5 released ; Jun 2. ArangoDB version 3.11.14 released ; May 8.<|separator|>
[31]
The Next Evolution of Arango: Powering the Age of Contextual AI
Oct 23, 2025 · Deployment flexibility across AWS, GCP, Azure, or NVIDIA GPU acceleration – giving enterprises freedom without fragmentation.
[32]
ArangoDB's Licensing Evolution | Sustainable Future
Oct 11, 2023 · These changes allow us to enhance Community Edition with many of the features currently only available in Enterprise Edition, such as OneShard ...
[33]
Storage Engine | ArangoDB Documentation
ArangoDB uses RocksDB transactions to implement the transaction handling for standalone AQL queries (outside of JavaScript Transactions and Stream Transactions) ...Missing: architecture manager
[34]
The AQL query optimizer | ArangoDB Documentation
The optimizer might produce multiple execution plans for a single query. It then calculates the costs for all plans and picks the plan with the lowest total ...
[35]
Which Index to use when
### Index Types in ArangoDB
[36]
Transactions
### Summary of ArangoDB Transaction Handling
[37]
Cluster deployments | ArangoDB Documentation
Following case B, the DB-Server notices that it now holds a follower replica of that shard and it resynchronizes its data with the new leader. The replication ...
[38]
Sharding | ArangoDB Documentation
Sharding allows you to use multiple machines to run a cluster of ArangoDB instances that together constitute a single database system. Sharding is used to ...
[39]
Replication
### Summary of Leader-Follower Replication in ArangoDB Clusters
[40]
Active Failover deployments | ArangoDB Documentation
You can set up multiple single server instances to have one leader and multiple asynchronously replicated followers with automatic failover.
[41]
Datacenter-to-Datacenter Replication | ArangoDB Documentation
ArangoSync performs replication in a single direction only. That means that you can replicate data from cluster A to cluster B or from cluster B to cluster A, ...
[42]
Limitations of ArangoDB cluster deployments
An ArangoDB Cluster is limited by the available resources of CPU, memory, disk and network bandwidth and latency. Moreover, high numbers of databases, ...Missing: scaling | Show results with:scaling<|control11|><|separator|>
[43]
The scalability of ArangoDB and its data models
ArangoDB is a distributed database system supporting multiple data models, and can thus be scaled horizontally, that is, by using many servers, typically based ...Missing: multi- JSON GPU schema flexibility integration
[44]
SatelliteCollections
### Summary of Satellite Collections in ArangoDB (v3.13)
[45]
SmartJoins
### Summary of SmartJoins for Cross-Shard Queries
[46]
ArangoDB Kubernetes Operator
### Summary: Introduction and Version for Kubernetes Operator
[47]
Data Models | ArangoDB Documentation
ArangoDB is a native multi-model database with flexible data models for key-values, documents, and graphs
[48]
https://docs.arangodb.com/3.12/concepts/data-models/
[49]
https://docs.arangodb.com/3.12/concepts/data-structure/documents/
[50]
Graphs | ArangoDB Documentation
ArangoDB's graph model is that of a property graph. Every record, whether node or edge, can have an arbitrary number of properties. Each document is a fully- ...
[51]
Vector indexes
### Summary of Vector Indexes in ArangoDB 3.12
[52]
Features and Improvements in ArangoDB 3.12
From version 3.12.5 onward, the Community Edition includes all Enterprise Edition features without time restrictions. You still need a license to use version ...
[53]
AQL Documentation
### Summary of AQL from ArangoDB Documentation
[54]
https://docs.arangodb.com/3.12/aql/
[55]
Bind parameters in AQL
### Summary of Bind Parameters in AQL
[56]
HTTP interfaces for AQL queries
Summary of each segment:
[57]
404 Page not found
### Summary of Analyzers in ArangoDB for Full-Text Search
[58]
404 Page not found
### Summary of Scoring Functions in ArangoSearch for Ranking Search Results
[59]
404 Page not found
### Integration of Full-Text Search with Graph Traversals for Semantic Queries in ArangoDB
[60]
404 Page not found
### Summary of Vector Search Functions, Similarity Metrics, and Applications
[61]
404 Page not found
### Vector Indexes and Supported Distance Metrics
[62]
404 Page not found
### Summary of Foxx Microservices
[63]
404 Page not found
### Summary of Aggregation Functions in AQL for Analytics
[64]
404 Page not found
### Summary of WINDOW Operation for Aggregations in Time-Series and Real-Time Analytics
[65]
Features and Capabilities | ArangoDB Documentation
The Enterprise Edition is an ArangoDB deployment with an activated license. It allows you to use ArangoDB for commercial purposes and removes the 100 GiB ...
[66]
https://arangodb.com/2023/10/evolving-arangodbs-licensing-model-for-a-sustainable-future/
[67]
ArangoDB is a native multi-model database with flexible ... - GitHub
ArangoDB is a scalable graph database system to drive value from connected data, faster. Native graphs, an integrated search engine, and JSON support.ArangoDB · Issues 748 · Pull requests 50<|control11|><|separator|>
[68]
ArangoDB - Google Groups
If you have a specific question about ArangoDB, please use and check StackOverflow (tag arangodb). Join the Community Slack if you want to chat with other ...
[69]
Features and Capabilities
### Summary of ArangoDB Enterprise Edition Features
[70]
Feature list of the ArangoDB core database system
### Features Summary for ArangoDB Core (v3.12)
[71]
Enterprise Server Compliance Guide by ArangoDB
The ArangoDB Enterprise Edition has the necessary measures set in place to ensure the privacy and security of your data, including Authentication, Authorization ...<|control11|><|separator|>
[72]
https://arangodb.com/enterprise-server/compliance/
[73]
ArangoDB Pricing 2025
ArangoDB has 3 pricing editions. A free trial of ArangoDB is also available. Look at different pricing editions below and see what edition and features meet ...
[74]
ArangoDB General Category | NoSQL Insights and Updates
Nov 20, 2019 · Today, we are happy to announce the launch of ArangoDB's managed service Oasis – a fully-managed graph database, document, and key-value ...
[75]
ArangoGraph Insights Platform
ArangoGraph Insights Platform is the easiest way to run ArangoDB in the cloud. Deploy ArangoDB Clusters in Google, Amazon or Azure.Missing: auto- replication migration vector AI toolkit
[76]
ArangoGraph Insights Platform
Featuring Multi-availability Zone Clusters, Managed Zero-downtime Upgrades, Managed Backups, One-Click Disaster Recovery, Monitoring & Alerting and Managed Log ...
[77]
ArangoGraph Insights Platform | ArangoDB Documentation
The ArangoGraph Insights Platform , formerly called Oasis, provides ArangoDB databases as a Service (DBaaS). It enables you to use the entire functionality ...
[78]
Getting Started with ArangoGraphML | ArangoDB Documentation
ArangoGraphML provides an easy-to-use & scalable interface to run Graph Machine Learning on ArangoDB Data. Since all of the orchestration and ML logic is ...
[79]
Public Preview of Microsoft Azure Now Available on ArangoDB Oasis
Feb 19, 2020 · You can get started with as little as $0,27/hour for a 3 node, highly available OneShard setup with 4GB memory and 10GB storage per node.
[80]
Load your data into ArangoGraph | ArangoDB Documentation
Open the deployment you want to load data into. In the Load Data section, click the Load your data button. Select your migration use case. ArangoGraph Data ...
[81]
Arangograph Enterprise: Graph Database Solution | ArangoDB
ArangoGraph offers you a fully managed graph database, document store and full-text search engine. All in one place.<|control11|><|separator|>