ADR-001: Adopt Recursive Language Model Pattern

ADR-001: Adopt Recursive Language Model (RLM) Pattern

Status

Accepted

Context

Background and Problem Statement

Large Language Models (LLMs) have finite context windows that limit the amount of information they can process in a single interaction. When working with large codebases, documents, or long-running conversations, this limitation becomes a significant constraint. Users need a way to efficiently manage and retrieve relevant context for LLM interactions without manually curating what fits in the context window.

The research paper arXiv:2512.24601 introduces the Recursive Language Model (RLM) pattern, which provides a systematic approach to context management through recursive summarization and intelligent retrieval.

Current Limitations

Fixed context windows: LLMs cannot process arbitrarily large inputs, forcing users to manually select relevant portions
Lost context: In long conversations or large documents, important earlier context gets dropped
Inefficient retrieval: Without semantic understanding, keyword search often returns irrelevant results
No state persistence: Each LLM interaction starts fresh without memory of previous sessions

Decision Drivers

Primary Decision Drivers

Context efficiency: Enable processing of arbitrarily large documents within LLM context limits through intelligent chunking and retrieval
Semantic relevance: Retrieve contextually relevant information rather than just keyword matches
Research foundation: Build on peer-reviewed research (arXiv:2512.24601) rather than ad-hoc solutions

Secondary Decision Drivers

Extensibility: Pattern should support multiple chunking strategies and embedding models
Performance: Minimize latency in context retrieval operations
Simplicity: Keep the core abstraction simple enough for CLI usage

Considered Options

Option 1: Recursive Language Model (RLM) Pattern

Description: Implement the RLM pattern with recursive summarization, semantic chunking, and hybrid search for context retrieval.

Technical Characteristics:

Sliding window context management
Semantic embeddings for similarity search
Recursive summarization for compression
State persistence across sessions

Advantages:

Research-backed approach with theoretical foundation
Handles arbitrarily large inputs
Maintains semantic coherence in retrieved context
Supports incremental context building

Disadvantages:

Complexity in implementing recursive summarization
Requires embedding model infrastructure
Learning curve for users unfamiliar with the pattern

Risk Assessment:

Technical Risk: Medium. Novel pattern requires careful implementation
Schedule Risk: Medium. Research translation to production code takes time
Ecosystem Risk: Low. Uses standard ML/NLP components

Option 2: Simple Truncation with Keywords

Description: Truncate documents to fit context window with keyword-based retrieval.

Technical Characteristics:

Fixed-size chunking
BM25 or TF-IDF keyword search
No semantic understanding

Advantages:

Simple to implement
No ML dependencies
Fast retrieval

Disadvantages:

Loses semantic context
Poor handling of synonyms and related concepts
No intelligent summarization

Disqualifying Factor: Cannot maintain semantic coherence across large documents, defeating the purpose of intelligent context management.

Risk Assessment:

Technical Risk: Low. Well-understood approach
Schedule Risk: Low. Quick to implement
Ecosystem Risk: Low. No dependencies

Option 3: External RAG Service

Description: Integrate with external Retrieval-Augmented Generation services.

Technical Characteristics:

API-based retrieval
Cloud-hosted embeddings and search
Managed infrastructure

Advantages:

No local ML infrastructure needed
Scalable
Managed updates

Disadvantages:

Network dependency
Privacy concerns with sending data to external services
Cost at scale
Vendor lock-in

Disqualifying Factor: Privacy concerns and network dependency conflict with CLI-first, local-first design goals.

Risk Assessment:

Technical Risk: Low. Mature services available
Schedule Risk: Low. Quick integration
Ecosystem Risk: High. Vendor dependency

Decision

Adopt the Recursive Language Model (RLM) pattern as described in arXiv:2512.24601 as the foundational architecture for context management.

The implementation will use:

Semantic chunking for intelligent document segmentation
Embedding vectors for semantic similarity search
SQLite storage for persistent state and chunk management
Hybrid search combining semantic and keyword approaches

Consequences

Positive

Unlimited input size: Users can load arbitrarily large documents; the system handles chunking and retrieval automatically
Semantic understanding: Retrieved context is semantically relevant, not just keyword matches
Session persistence: Context and summaries persist across CLI invocations
Research foundation: Implementation backed by peer-reviewed research provides confidence in approach

Negative

Complexity: More complex than simple truncation, requiring embedding infrastructure
Initial latency: First-time embedding generation adds startup cost
Storage overhead: Embeddings and chunks require disk space

Neutral

Learning curve: Users must understand chunking and retrieval concepts, but CLI abstracts most complexity

Decision Outcome

The RLM pattern provides the theoretical and practical foundation for rlm-rs. It enables the core value proposition: efficient context management for LLM interactions with large documents and codebases.

Mitigations:

Lazy model loading to reduce cold start impact
Efficient SQLite schema for fast retrieval
Clear CLI interface abstracting implementation details

ADR-003: SQLite for State Persistence - Storage backend for RLM state
ADR-006: Pass-by-Reference Architecture - Chunk retrieval mechanism

More Information

Date: 2025-01-01
Source: arXiv:2512.24601 research paper
Related ADRs: ADR-003, ADR-006, ADR-007, ADR-008

Audit

2025-01-20

Status: Compliant

Findings:

Finding	Files	Lines	Assessment
RLM pattern implemented in core library	`src/lib.rs`	L1-L50	compliant
Chunking strategies available	`src/chunking/`	all	compliant
Embedding infrastructure present	`src/embedding/`	all	compliant
SQLite persistence implemented	`src/storage/`	all	compliant

Summary: The RLM pattern has been fully implemented as the foundational architecture.

Action Required: None

ADR-001: Adopt Recursive Language Model Pattern

ADR-001: Adopt Recursive Language Model (RLM) Pattern

Status

Context

Background and Problem Statement

Current Limitations

Decision Drivers

Primary Decision Drivers

Secondary Decision Drivers

Considered Options

Option 1: Recursive Language Model (RLM) Pattern

Option 2: Simple Truncation with Keywords

Option 3: External RAG Service

Decision

Consequences

Positive

Negative

Neutral

Decision Outcome

Related Decisions

Links

More Information

Audit

2025-01-20