Preview — See Feature availability. Memory APIs and workflow hooks are implemented. The dedicated Memory nav in the Console is off in production (
enableMemory false). Use the Memory API, Platform MCP, and Autopilot memory tools in enabled environments.What memory stores
| Source | Episode type | How it gets in |
|---|---|---|
| Chat conversations | chat_conversation | Index conversation API or auto-index on save |
| Work files | work_file | Index file API |
| Workflow step output | run_step | Runtime memory hooks on step completion |
| Manual / scripts | external_event | Direct extract API |
When to use memory
| Use case | Approach |
|---|---|
| Agent recalls past decisions | Search memory before LLM steps (prepare-step or explicit search) |
| Onboard context from chat history | Index key conversations after milestones |
| Document Q&A across uploads | Index files from work-service, search at query time |
| Workflow learns from prior runs | run_step episodes capture step outputs automatically |
{{steps.*}} templates are run-scoped. Memory persists across runs.
Core operations
Search
Semantic search over confirmed long-term memory:memory_search. Requires project_contributor.
Index a conversation
Pull a chat transcript and queue extraction:GET /v1/memory/extract/{jobId} for status.
Index a file
Index a work-service file for extraction:Batch extraction
For large backfills, use batch mode:Get an episode
Retrieve a stored episode bundle:?detail=false to omit item payloads.
Agent memory tools
Agents and Autopilot can call memory tools via:| Tool | Purpose |
|---|---|
memory_search | Semantic search |
memory_get_episode | Fetch episode by ID |
memory_get_continuation | Latest open episode for a conversation |
memory_propose_items | Propose candidate memory items |
memory_confirm_item | Promote item to confirmed long-term memory |
Workflow integration
The workflow runtime can call prepare-step memory hooks before LLM steps, injecting retrieved context into the prompt. If memory is unavailable, the run continues without LTM context (logged as a warning, not a hard failure). Pattern:- Index relevant conversations/files into memory
- LLM workflow steps benefit from prepare-step context automatically when configured
- Optionally add an explicit
memory_searchvia Platform MCP or a future builtin tool step
Billing
Memory actions consume credits:| Action | Typical cost |
|---|---|
memory_extract_interactive | ~10 credits per job |
memory_extract_bulk | ~2 credits per job |
memory_index_file_mb | ~1 credit per MiB |
Auto-indexing (when enabled)
BFF can auto-index chat when memory is saved, controlled by deployment config:ChatMemoryAutoIndexMode—off,episode_narrow, orfull_ltmChatEpisodeOnTurn— episode append behavior per turn
Limitations
- No dedicated Memory browser in production Console yet
- Search quality depends on indexed content — empty memory returns empty results
- Extraction is async — poll job status before assuming facts are available
- Memory is tenant/project isolated — no cross-tenant search
Troubleshooting
| Issue | Fix |
|---|---|
503 memory kernel not configured | Memory service not deployed in this environment |
queue unavailable | Extract worker or Redis queue down — contact support |
work file not found | Verify file_id exists in work-service |
| Search returns nothing | Confirm indexing job completed (GET /v1/memory/extract/{jobId}) |
| High credit usage | Prefer bulk extraction for large backfills; set indexing scope narrowly |
Related docs
- Autopilot and chat — chat threads that feed memory
- Platform MCP —
memory_searchand related tools - Workflow patterns — memory-enriched LLM steps