Your managed OpenSearch cluster shouldn't be your #1 fire drill.

KalDB replaces brittle managed OpenSearch setups with a low-cost, high-throughput log analytics platform built for S3. Real engineers report memory leaks, crashes, slow queries, and data loss.

Seven Core OpenSearch Managed Problems

Real issues reported by engineering teams running managed OpenSearch at scale

1. Memory Pressure & GC Crashes

Nodes die, indexing stops, clusters need frequent restarts. Memory leak behavior persists across versions.

2. CPU Spikes During Aggregations

Expensive queries bring clusters to their knees. High-cardinality aggregations cause cascading failures.

3. Cluster Instability

Connection blips and unhealthy node states, especially on small clusters or poorly tuned deployments.

4. Security Vulnerabilities

CVEs, misconfigured permissions, difficult-to-audit role-based access. Security adds operational burden.

5. Data Loss During Outages

Unable to recover data after outages. Backups and recovery are painful and often incomplete.

6. Operational Complexity

Teams spend weeks tuning shards, heap, and ingestion. Capacity planning is a full-time job.

7. Troubleshooting & Visibility Gaps

Confusing error messages, lack of actionable diagnostics. Average GitHub issue resolution time: 79+ days.

Additional Reported Issues:
Slow cold starts Mapping conflicts Scroll context timeouts Index corruption Replication lag Plugin compatibility Slow bulk indexing Template confusion Circuit breaker errors Snapshot failures Version incompatibilities ILM policy failures

Real Community Voices

From OpenSearch forums and community discussions

"After ~19,400 indexed documents, the process gets killed because it uses too much memory... increasing heap doesn't help."

— OpenSearch Forum

"90% memory usage, 80% CPU spikes — our cluster was on the brink of collapse during peak hours."

— Community Case Study

"OpenSearch restarted and we lost our data. We had no way to recover weeks of logs."

— Reddit Discussion

"We spend more time managing OpenSearch than actually using the data. It's become a full-time job."

— Engineering Team Lead

Root Cause Analysis

Why OpenSearch struggles with modern log workloads

Lucene/Java Heap Limits

JVM pointer compression causes practical heap limits. GC sensitivity causes catastrophic failures beyond 32GB.

Non-Linear Query Costs

Aggregations and heavy queries blow up memory/CPU without careful index design.

Operational Complexity

Shards, replication, disk growth, and upgrades add combinatorial complexity.

Platform Brittleness

UI changes and minor version bumps risk behavior changes. Upgrades are risky.

Knowledge Drain

Original architects have departed. Institutional knowledge is declining in the community.

Slow Issue Resolution

Average GitHub issue resolution time: 79.6 days. Critical bugs linger for months.

KalDB Solutions

How KalDB addresses each OpenSearch limitation

OpenSearch ProblemKalDB Solution
Memory leaks & GC crashesS3-backed storage; stateless compute nodes with no heap management
CPU spikes during aggregationsOn-demand indexing; query compute scales independently
Cluster instabilityStateless architecture; no cluster state to synchronize
Security vulnerabilitiesSimplified attack surface; data stays in your S3 bucket
Data loss & recovery issuesS3 provides 99.999999999% durability; automatic recovery
Operational complexityNo shards, no heap tuning, no capacity planning
Troubleshooting gapsSimple architecture with clear failure modes

Performance Comparison

90%
Lower Cost
99.99%
Uptime
< 1s
Query Latency
10x
Less Infrastructure

Why Teams Choose KalDB

Built for S3 from Day One

Not a bolt-on. S3 is the foundation, not an afterthought. True cloud-native architecture.

OpenSearch API Compatible

Migrate in hours, not months. Your existing tools and dashboards just work.

Production-Proven at Slack

Battle-tested at scale. Handling petabytes of logs for one of the world's largest collaboration platforms.

Apache 2.0 Open Source

No license games, no usage restrictions. Fork it, modify it, deploy it anywhere.

Ready to Make the Switch?

Try KalDB open source today or talk to us about production deployment

# Get started in minutes
git clone https://github.com/slackhq/kaldb
cd kaldb && docker-compose up
✓ Running on localhost:8080