Home/Docs/Agent Stalls & Nudges

Agent Stalls & Nudges

Documentation and guides for OutcomeDev.

When you run complex tasks on OutcomeDev, the AI agent works behind the scenes to process your instructions, write code, run commands, and review output. Sometimes, this process can pause, requiring a "nudge" to get the agent moving again.

This guide explains why agents stall, why different models behave differently, and how OutcomeDev's nudge system handles this automatically.

Why Do Agents Stall?

An agent can appear to "stall" or pause its execution loop for several technical reasons:

1. Max Output Token Limits

Every AI model has a hard limit on the amount of text (tokens) it can generate in a single response (often 4,000 or 8,000 tokens). If a model is writing a large file or doing heavy analysis, it might simply hit its limit mid-thought. When this happens, it physically stops generating, and needs a prompt like "Continue working on the task" to pick up exactly where it left off.

2. API Latency & Timeouts

Behind the scenes, OutcomeDev runs a strict stall detector that monitors the agent's activity. If there is no activity (no tool calls, no file edits) for 3 minutes, the platform assumes the connection to the model provider has dropped or timed out.

3. Tool Calling Format Failures

The agent executes code by formatting special "tool calls" (e.g., reading a file or running a shell command). Some models are extremely consistent at this, while others occasionally output malformed JSON or get confused by the sequence of tools. When this happens, the backend silently pauses rather than executing a destructive command. A nudge acts as a safe restart, telling the agent to try formatting its thought again.

4. The Sandbox Environment (System-Level Operations)

It's critical to remember that OutcomeDev agents are not standard "web-based chat bots." Our agents are given access to a fully isolated Vercel Sandbox VM (with dedicated compute, RAM, and a full Linux filesystem).

Because the agent has system-level access, it performs heavy operations like:

  • Running npm install for large dependencies.
  • Building and compiling entire Next.js or Vite applications.
  • Searching through thousands of lines of code via grep commands.
  • Managing git trees and repositories.

Unlike a fast API response, running a real computer takes time. If an agent executes a build script that takes 90 seconds to run, the frontend may interpret the silence as a "stall." In reality, the agent is just waiting for the computer to finish its job. A nudge in these cases simply forces the agent to check the terminal output and continue.

Why Do Models Have Different "Cadences"?

You may notice that DeepSeek and Kimi tend to require more nudges than MiniMax or Claude. This is perfectly normal and relates to how each provider manages their infrastructure:

  • DeepSeek & Kimi: These models often experience higher API latency under global load or employ stricter rate limits per minute. They also engage in long "thinking" or reasoning phases. This may occasionally trigger the 3-minute timeout, resulting in an automatic nudge.
  • Token Plans & MiniMax: If you are using OutcomeDev with specific token plans (e.g., high-tier MiniMax usage), your requests may be routed through priority infrastructure. This means faster response times, fewer dropped connections, and an uninterrupted stream of thought, significantly reducing the need for nudges.
  • Verbosity: Models that are naturally more verbose will hit their max output token limits faster, leading to more frequent pauses.

The Auto-Nudge System (The Taskmaster)

To ensure you don't have to babysit long-running or autonomous tasks, OutcomeDev includes an Auto-Nudge feature. Internally, we affectionately refer to this system as the Taskmaster, accompanied by his faithful Watchdog.

When the system detects an agent has been silent for over 3 minutes, the Watchdog starts growling—this is the yellow warning banner you see appearing at the bottom of the chat.

The Taskmaster gives the agent a 30-second grace period. If you watch closely, it is a beautiful thing to witness: often, the agent is just finishing a heavy system compilation. The agent will suddenly output a new thought or tool call, the Watchdog will calm down, and the warning banner will smoothly disappear before the Taskmaster ever has to crack the whip.

If the 30 seconds pass without any progress, the Taskmaster steps in and automatically injects a "Continue working on the task" message to forcefully resume the agent's workflow.

Auto-Nudge Limits

To prevent endless loops if a model is genuinely broken or failing, the Taskmaster will only auto-nudge a maximum of 2 times.

If the agent stalls a third time, the auto-nudge will not fire. The banner will update to inform you that manual intervention is needed. At this point, you can inspect the sandbox, check the terminal logs, and either manually nudge the agent or provide specific instructions to guide it out of its rut.

Manual Nudging

You can manually nudge an agent at any time by clicking the "Nudge Now" button in the warning banner. This is useful if you know the model has paused and you want to instantly resume its work without waiting for the auto-nudge timer.