Version: main

Performance Tuning

Connection Pool

config = {
    "hosts": [("node1", 3000), ("node2", 3000)],
    "max_conns_per_node": 300,   # default: 256
    "min_conns_per_node": 10,    # pre-warm
    "idle_timeout": 55,          # below server proto-fd-idle-ms (60s)
}

Read Optimization

Select Specific Bins

# Reads ALL bins from server
record = client.get(key)

# Reads only what you need (less network I/O)
record = client.select(key, ["name", "age"])

Use Batch Reads

# N sequential round-trips
results = [client.get(k) for k in keys]

# Single round-trip
batch = client.batch_read(keys, bins=["name", "age"])

NumPy Batch Reads

For numeric workloads, skip Python dict overhead entirely:

import numpy as np

dtype = np.dtype([("score", "i8"), ("rating", "f8")])
batch = client.batch_read(keys, bins=["score", "rating"], _dtype=dtype)
# batch.batch_records is a numpy structured array

See NumPy Batch Guide.

Write Optimization

Combine Operations

# Two round-trips
client.put(key, {"counter": 1})
client.put(key, {"updated_at": now})

# Single round-trip
ops = [
    {"op": aerospike.OPERATOR_WRITE, "bin": "counter", "val": 1},
    {"op": aerospike.OPERATOR_WRITE, "bin": "updated_at", "val": now},
]
client.operate(key, ops)

TTL Strategy

client.put(key, bins, meta={"ttl": aerospike.TTL_NEVER_EXPIRE})     # never expire
client.put(key, bins, meta={"ttl": aerospike.TTL_DONT_UPDATE})      # keep existing TTL
client.put(key, bins, meta={"ttl": aerospike.TTL_NAMESPACE_DEFAULT}) # use namespace default

Backpressure (High Concurrency)

When firing many concurrent operations (e.g., asyncio.gather with 100+ tasks), the upstream connection pool can run out of connections. Enable backpressure to queue excess operations instead of failing:

config = {
    "hosts": [("127.0.0.1", 3000)],
    "max_concurrent_operations": 64,       # at most 64 in-flight ops
    "operation_queue_timeout_ms": 5000,    # wait up to 5s for a slot
}

Disabled by default (zero overhead). When enabled:

Operations beyond the limit wait for a free slot instead of getting errors
Waiting operations resume as soon as a previous operation completes
BackpressureError is raised only if the timeout expires

Choosing max_concurrent_operations: Set this close to max_conns_per_node (default 256). For a 3-node cluster, 64 is a conservative starting point that prevents pool exhaustion while maintaining high throughput.

Async Client

For high-concurrency workloads (web servers, fan-out reads):

import asyncio

async def main() -> None:
    client = aerospike.AsyncClient({
        "hosts": [("127.0.0.1", 3000)],
        "max_concurrent_operations": 64,  # prevent pool exhaustion
    })
    await client.connect()

    keys = [("test", "demo", f"key{i}") for i in range(1000)]
    results = await asyncio.gather(*(client.get(k) for k in keys))

    await client.close()

Expression Filters

Push filtering to the server to reduce network transfer:

from aerospike_py import exp

# Without filter: transfers ALL records, filters in Python
results = client.query("test", "demo").results()
active = [r for r in results if r.bins.get("active")]

# With filter: server returns only matching records
expr = exp.eq(exp.bool_bin("active"), exp.bool_val(True))
results = client.query("test", "demo").results(policy={"filter_expression": expr})

Tokio Runtime Workers

aerospike-py uses an internal Tokio async runtime with 2 worker threads by default. This is sufficient for I/O-bound database operations and minimizes CPU overhead when colocated with CPU-intensive workloads (e.g. PyTorch inference).

# Override if you need more parallelism for heavy batch operations
export AEROSPIKE_RUNTIME_WORKERS=4

Workers	Use Case
2 (default)	Most applications, ML serving, web servers
4	Heavy batch operations, high-throughput pipelines
8+	Rarely needed; profile first

Timeout Guidelines

Setting	Recommendation
`socket_timeout`	1-5s. Catches hung connections.
`total_timeout`	Set based on SLA. Includes retries.
`max_retries`	2-3 for reads, 0 for writes (idempotency).

Connection Pool​

Read Optimization​

Select Specific Bins​

Use Batch Reads​

NumPy Batch Reads​

Write Optimization​

Combine Operations​

TTL Strategy​

Backpressure (High Concurrency)​

Async Client​

Expression Filters​

Tokio Runtime Workers​

Timeout Guidelines​