Overlap Calculator

Calculate optimal chunk overlap for RAG

What is Chunk Overlap?

When splitting documents into chunks for RAG, overlap is the number of tokens shared between adjacent chunks. It ensures important context at chunk boundaries isn't lost during retrieval.

This calculator helps you understand the trade-offs between overlap percentage, storage overhead, and retrieval quality.

Key Concepts

Effective Stride

The actual step size between chunk starts. Stride = Chunk Size - Overlap.

Token Overhead

Extra tokens stored due to overlap. More overlap = more storage and embedding costs.

Recommended Settings

Use CaseOverlap
General text10-15%
Complex reasoning15-25%
Code / structured5-10%

FAQ

Can I have zero overlap?

Yes, but you risk splitting important context at chunk boundaries, potentially hurting retrieval quality.