Message Compaction

When agents run for extended periods, they accumulate a large history of messages that eventually fills up the LLM's context window, causing errors when the token limit is exceeded. The message compaction feature helps prevent this by providing agents with awareness of their token usage and tools to manage their context window.

How It Works

Token Usage Tracking

MyCoder's LLM abstraction tracks and returns:

Total tokens used in the current completion request
Maximum allowed tokens for the model/provider

This information is used to monitor context window usage and trigger appropriate actions.

Status Updates

Agents receive status updates with information about:

Current token usage and percentage of the maximum
Cost so far
Active sub-agents and their status
Active shell processes and their status
Active browser sessions and their status

Status updates are sent:

Every 5 agent interactions (periodic updates)
Whenever token usage exceeds 50% of the maximum (threshold-based updates)

Example status update:

--- STATUS UPDATE ---
Token Usage: 45,235/100,000 (45%)
Cost So Far: $0.23

Active Sub-Agents: 2
- sa_12345: Analyzing project structure and dependencies
- sa_67890: Implementing unit tests for compactHistory tool

Active Shell Processes: 3
- sh_abcde: npm test
- sh_fghij: npm run watch
- sh_klmno: git status

Active Browser Sessions: 1
- bs_12345: https://www.typescriptlang.org/docs/handbook/utility-types.html

Your token usage is high (45%). It is recommended to use the 'compactHistory' tool now to reduce context size.
--- END STATUS ---