Performance Tuning

This guide covers common performance optimizations for ArcadeDB in production.

Memory Configuration

ArcadeDB uses both JVM heap and off-heap memory. Key settings:

# JVM heap (set via JAVA_OPTS or bin/server.sh)
JAVA_OPTS="-Xms2g -Xmx4g"

# Page cache size (default: dynamic based on available RAM)
arcadedb.pageCacheSize=1073741824

Set Xmx to no more than 50% of available RAM, leaving room for OS file cache and off-heap structures.

Index Selection

ArcadeDB supports multiple index types. Choose based on your access pattern:

Index Type Best For Trade-off

Index Type	Best For	Trade-off
LSM-Tree (default)	Write-heavy workloads, range queries	Slower point lookups than Hash
Hash	Point lookups (`WHERE key = value`)	No range queries, slower writes
LSM_VECTOR	Vector similarity search	Specialized for ANN queries
FULL_TEXT	Text search (`SEARCH_INDEX()`)	Specialized for keyword search
GEOSPATIAL	Spatial queries (`geo.*()`)	Specialized for geometry

LSM-Tree (default)

Write-heavy workloads, range queries

Slower point lookups than Hash

Hash

Point lookups (WHERE key = value)

No range queries, slower writes

LSM_VECTOR

Vector similarity search

Specialized for ANN queries

FULL_TEXT

Text search (SEARCH_INDEX())

Specialized for keyword search

GEOSPATIAL

Spatial queries (geo.*())

Specialized for geometry

Create appropriate indexes for your query patterns:

-- LSM-Tree for range queries (default)
CREATE INDEX ON User (email) UNIQUE

-- Hash for exact lookups
CREATE INDEX ON Session (token) UNIQUE HASH

-- Composite index for multi-column queries
CREATE INDEX ON Order (customerId, orderDate)

Query Optimization

Use EXPLAIN and PROFILE to understand query execution:

-- Show the query plan without executing
EXPLAIN SELECT FROM User WHERE email = '[email protected]'

-- Execute and show actual timings
PROFILE SELECT FROM User WHERE email = '[email protected]'

Common optimizations:

Add indexes for WHERE clause predicates — check EXPLAIN output for full scans
Limit result sets — Use LIMIT to avoid fetching unnecessary records
Project only needed fields — SELECT name, email FROM User is faster than SELECT * FROM User
Use parameterized queries — Avoid SQL parsing overhead for repeated queries

Bucket Configuration

ArcadeDB distributes records across buckets for parallel access. The default number of buckets per type equals the CPU core count.

For write-heavy types, increase the bucket count:

ALTER TYPE HighWriteType BUCKET +4

For read-heavy types with few writes, fewer buckets reduce overhead:

ALTER TYPE LookupTable BUCKET -2

Page Size Tuning

The default page size (64KB) works well for most workloads. Adjust for specific patterns:

# Larger pages for sequential scan workloads (analytics)
arcadedb.pageSize=131072

# Smaller pages for random access workloads
arcadedb.pageSize=32768

Connection Pool Settings

For remote applications, tune connection pool size based on concurrency:

# Maximum concurrent connections
arcadedb.serverHttpIncomingConnections=256

# Async queue size
arcadedb.asyncOperationsQueueSize=1024

Write Optimization

Batch transactions — Group related writes in a single transaction to amortize commit overhead
Async operations — Use the async API for non-blocking writes (Java Async API)
Bulk loading — Use BulkInsert via gRPC or the Java embedded API for initial data loads
WAL flush tuning — The arcadedb.txWalFlush setting controls whether each commit triggers an fsync. In production mode it defaults to 1 (fsync without metadata). For bulk imports or when your storage has battery-backed write cache, you can set it to 0 for maximum throughput. See concepts/transactions.adoc#wal-flush-durability

Read Optimization

Page cache — Increase arcadedb.pageCacheSize if cache hit rate is low
Index coverage — Ensure queries hit indexes (check with EXPLAIN)
Projections — Select only the fields you need
Materialized views — Pre-compute expensive aggregations

Performance Tuning

Memory Configuration

Index Selection

Query Optimization

Bucket Configuration

Page Size Tuning

Connection Pool Settings

Write Optimization

Read Optimization

Further Reading