HOME/TECHNIQUE/Data & Context Engineering/Context & memory management

TECHNIQUE

Context & memory management

Data & Context Engineering

6APPLICATIONS

10OBSERVED OPERATORS

State of Practice

CROSS-VALIDATED — 12 OPERATORS

Across the teardown pool, context and memory management is implemented as explicit runtime state, retrieval, scoping, compression, and traceable memory infrastructure rather than as prompt text alone.

Observed Practices

Maintain runtime or session memory so later agent steps or turns can reuse prior context instead of re-sending raw strings manually.

8 of 12 evidence-pool operators show explicit runtime/session/state memory.

RipplingShopifyAirbnbLinkedInHalliburtonIberdrolaMetaUber

Retrieve external context from vector stores, knowledge bases, knowledge graphs, historical databases, or stored memories before generation.

9 of 12 evidence-pool operators show retrieval-backed context or stored memory retrieval.

RipplingAgodaLinkedInAirbnbHalliburtonNew ComputerIberdrolaMetaCleric

Scope or route context before invoking the model, using domain selection, query routing, intent routing, context loaders, agent selection, or gateways.

6 of 12 evidence-pool operators show pre-generation context scoping or routing.

RipplingLinkedInAirbnbHalliburtonIberdrolaUber

Reduce context size or improve retrieval precision with reranking, filters, chunking, semantic refinement, or snapshot-key indirection.

5 of 12 evidence-pool operators show explicit context compaction, filtering, chunking, or refinement.

RipplingAlibaba CloudHalliburtonNew ComputerLinkedIn

Keep memory at different scopes: same-turn or same-session context, per-agent session state, long-term user memory, cross-deployment generalized memory, or multi-round workflow state.

6 of 12 evidence-pool operators explicitly describe memory scope or persistence horizon.

LinkedInAirbnbIberdrolaNew ComputerClericMeta

Tie context and memory workflows to traces, feedback, evaluations, logging, or observability so teams can inspect what context was used and improve retrieval or behavior.

7 of 12 evidence-pool operators show trace, feedback, evaluation, logging, or observability around context/memory workflows.

RipplingClericNew ComputerAirbnbShopifyIberdrolaMeta

Where Operators Converge

Every observed operator externalizes some part of context or memory into an explicit mechanism: retrieved context, chat history, runtime state, vector or graph stores, key-value observation stores, workflow transcripts, tool context, or persistent memories.

Where Operators Diverge

Operators differ on what they persist as memory and for how long.

APPROACH 01

Runtime or session state is carried across turns or steps in the current workflow.

RipplingShopifyAirbnbLinkedInHalliburtonIberdrola

APPROACH 02

Longer-lived user, workflow, or cross-deployment memories are retained for future personalization or investigations.

New ComputerMetaCleric

APPROACH 03

Historical institutional records are retrieved as context, such as prior incidents, experiments, or asset relationships.

AgodaMetaLinkedIn

Operators use different controls for context overload.

APPROACH 01

Rank, filter, or refine retrieved context before it reaches the model.

RipplingNew ComputerLinkedIn

APPROACH 02

Store long observations behind snapshot keys and retrieve the full observation only when needed.

Alibaba Cloud

APPROACH 03

Chunk long source documents for retrieval.

Halliburton

Operators differ on who or what decides which context to fetch.

APPROACH 01

Developers declare required context or provide dynamic context retrievers.

Airbnb

APPROACH 02

Routing logic selects domains, node types, backends, or intents before retrieval or tool use.

RipplingLinkedInHalliburton

APPROACH 03

A master or smart-solver agent chooses specialized agents based on the incident or missing context.

Iberdrola

APPROACH 04

A gateway mediates tool-context access and enforces policy on tool invocations.

Uber

Watch Items

Context completeness and retrieval quality remain operational concerns: Agoda routes cases to human review when the LLM lacks full context, LinkedIn refines context with semantic search when inaccuracies arise, and New Computer reports that semantic search, keyword/BM25, and meta-field filters work better in different cases.

Large context payloads force explicit reduction strategies: Rippling reports a huge cross-domain data model and 100–500x context pruning, Alibaba Cloud identifies context-length pressure from large logs/table entries, and Halliburton chunks long manuals.

Stateful production workflows need traceability because context and memory span dynamic environments, parallel investigations, or long-running jobs.

Sensitive production context requires controls: Alibaba Cloud cites privacy risk from transmitting confidential production data to external APIs, Uber handles prompt injection, PII redaction, and tool-level redaction, and Cleric limits production access to read-only observability data.

Implementation Menu

CURATED DEFAULTS

Name	Kind	When	Maturity
Conversation summarization compaction	pattern	long sessions compressed into running summaries within budget	established
Vector-backed long-term memory	pattern	facts recalled across sessions by similarity, not kept in context	established
Letta (MemGPT)	library	agent memory tiers managed by the framework instead of hand-rolled	emerging

Observed in Production

6 APPS

TechnologyCROSS-VALIDATED

Context & memory management

State of Practice

Observed Practices

Where Operators Converge

Where Operators Diverge

Watch Items

Implementation Menu

Observed in Production

LLM-Assisted Code Review, Test Migration, and Agent Evaluation

AI Security Decision Audit and Incident Report Generation

LLM Application Quality Assurance

AI Agent Production Debugging with Logfire MCP and Investigation Memory

Change Request and CRM Account Linking Copilot

Marketplace Co-Sell Opportunity Signal Detection