Code Mode

For developers building agents or tools that talk to Thoughtbox directly. If you just want your existing agent to use Thoughtbox, you don't need this page — see the Quickstart. Reference for the code your agent runs, not for you.

What Code Mode is

Thoughtbox implements the Code Mode pattern, originated by Cloudflare and explored in parallel by Anthropic. It's a server-side MCP architecture that solves two well-known scaling problems with conventional MCP:

Context bloat. Every individual MCP tool definition loads into the LLM's context window whether the agent uses it or not. Cloudflare measured 1.17 million tokens to expose their full API as individual tools — exceeding most context windows entirely.
Round-trip overhead. Each conventional tool call is a full inference round-trip. A 20-step workflow burns 20 round-trips, with intermediate data passing back through the model just to feed the next call.

Code Mode collapses the entire API into two meta-tools:

search — the LLM writes a code snippet to discover available operations. The catalog never enters the LLM's context wholesale.
execute — the LLM writes a script that chains operations, handles conditionals and pagination, and returns only the final result. The script runs in an isolated server-side sandbox.

Reported impact in production:

Cloudflare — 2,500+ API endpoints, 1.17M tokens → ~1,000 tokens. ~99.9% reduction.
Anthropic — Drive-to-Salesforce workflow, 150,000 tokens → 2,000 tokens. ~98.7% reduction.

The pattern works because LLMs are trained on millions of lines of real-world code and only synthetic tool-call schemas. Code is the LLM's native orchestration language.

How Thoughtbox uses Code Mode

Thoughtbox exposes exactly two MCP tools, matching the pattern: thoughtbox_search and thoughtbox_execute. The agent writes JavaScript inside execute against the tb SDK. The SDK calls run server-side; only return values come back to the model.

Sandbox: Node vm module with a 30-second timeout (10 seconds for thoughtbox_search). The sandbox exposes only the tb SDK and setTimeout/clearTimeout — no filesystem, no require, no network access except what the SDK calls do server-side. Authentication is resolved server-side from the API key, so credentials never appear in the LLM-visible parameters.

Note: node:vm is a workload separation primitive, not a hard security boundary. It's adequate for trusting the LLM not to actively attack the host, but it's not the same isolation level as Cloudflare's V8 isolates or a separate process.

`thoughtbox_search` — discover operations

Read-only. 10-second timeout. The agent writes an arrow function that receives the catalog and returns whatever it wants.


async (catalog) => catalog.operations.session

async (catalog) => {
  const out = {};
  for (const [mod, ops] of Object.entries(catalog.operations)) {
    for (const [name, op] of Object.entries(ops)) {
      if (name.includes("export") || op.description.includes("export")) {
        out[`${mod}.${name}`] = op;
      }
    }
  }
  return out;
}

The catalog is organized by module: session, thought, knowledge, notebook, theseus, ulysses, observability. It also exposes prompts, resources, and resourceTemplates.

`thoughtbox_execute` — run operations

30-second timeout. The agent writes an arrow function using the tb SDK. Results come back unwrapped — no MCP envelope parsing.


async () => {
  return await tb.thought({
    thought: "Retry logic fails because the backoff multiplier resets on partial success.",
    thoughtType: "reasoning",
    nextThoughtNeeded: true,
    confidence: "medium"
  });
}

The `tb` SDK

`tb.thought(input)`

Record a single thought. See Sessions & Thoughts for what each thought type means.

Required: thought, thoughtType (one of seven values), nextThoughtNeeded. Optional: confidence, branchId, branchFromThought, isRevision, revisesThought (a thought number, not a UUID), sessionTitle, sessionTags, plus type-specific payloads (options, actionResult, beliefs, assumptionChange, contextData, progressData).

`tb.session.*`

| Method | Description | |--------|-------------| | tb.session.list({ limit?, offset?, tags? }) | List sessions, optionally filtered by tag | | tb.session.get(sessionId) | Get a session and all its thoughts | | tb.session.search(query, limit?) | Full-text search across sessions | | tb.session.resume(sessionId) | Continue an existing session — subsequent tb.thought calls append to it | | tb.session.export(sessionId, format?) | Export as "markdown", "cipher", or "json" | | tb.session.analyze(sessionId) | Return structural metrics (linearity, revision rate, depth, density, convergence) | | tb.session.extractLearnings(sessionId, args?) | Pull patterns from key moments. args is { keyMoments: [...], targetTypes: [...] } |

`tb.knowledge.*`

| Method | Description | |--------|-------------| | tb.knowledge.createEntity({ name, type, label, properties?, visibility? }) | Create an entity | | tb.knowledge.getEntity(entityId) | Get an entity by UUID | | tb.knowledge.listEntities({ types?, name_pattern?, limit?, offset? }) | List entities with optional filters | | tb.knowledge.addObservation({ entity_id, content }) | Add a timestamped observation to an entity | | tb.knowledge.createRelation({ from_id, to_id, relation_type }) | Create a directed edge | | tb.knowledge.queryGraph({ start_entity_id, max_depth?, relation_types? }) | Traverse the graph | | tb.knowledge.stats() | Aggregate counts |

Entity types (5): Insight, Concept, Workflow, Decision, Agent. Relation types (9): RELATES_TO, BUILDS_ON, CONTRADICTS, EXTRACTED_FROM, APPLIED_IN, LEARNED_BY, DEPENDS_ON, SUPERSEDES, MERGED_FROM.

addObservation and queryGraph take a single object argument. Positional forms return "Missing required parameter".

Examples

Record a decision


async () => tb.thought({
  thought: "Choosing between Redis and in-memory caching for session state.",
  thoughtType: "decision_frame",
  nextThoughtNeeded: true,
  options: [
    { label: "Redis — durable, shared across instances", selected: false },
    { label: "In-memory — zero dependencies", selected: false }
  ]
})

List recent sessions


async () => tb.session.list({ limit: 5 })

Export a session as markdown


async () => tb.session.export("session-uuid", "markdown")

Build a knowledge graph entry


async () => {
  const entity = await tb.knowledge.createEntity({
    name: "rate-limiting-strategy",
    type: "Decision",
    label: "Chose token bucket over sliding window"
  });

  await tb.knowledge.addObservation({
    entity_id: entity.id,
    content: "Token bucket chosen for burst tolerance and simpler implementation."
  });

  return entity;
}

Code Mode

What Code Mode is

How Thoughtbox uses Code Mode

thoughtbox_search — discover operations

thoughtbox_execute — run operations

The tb SDK

tb.thought(input)

tb.session.*

tb.knowledge.*