Refactor RoutineDiscoveryAgent to an LLM-driven agentic loop with tools + add execute_routine tool#121
Conversation
RoutineDiscoveryAgent from a hardcoded sequential pipeline to an LLM-driven agentic loop with tools
…o refactor_discovery_agent
|
Few thoughts: I would probably yank these for now then add back in when agent works: -- Edited at 6:24p, 1/30 Dima: Potentially remove queue management tools and manually write them into the agent run flow? Alternatively be very specific about the queue conditions when the agent can exit the loop |
|
could likely consolidate these |
consolidated in commit cc148fd |
I decided to keep |
RoutineDiscoveryAgent from a hardcoded sequential pipeline to an LLM-driven agentic loop with toolsRoutineDiscoveryAgent to an LLM-driven agentic loop with tools + add "execute_routine" tool
RoutineDiscoveryAgent to an LLM-driven agentic loop with tools + add "execute_routine" toolRoutineDiscoveryAgent to an LLM-driven agentic loop with tools + add execute_routine tool
Code ReviewFound 2 high-signal issues that need attention: Issue 1: Set ordering bug in
|
|
Some incorrect behaviors:
|
Updated in commit 0086491 Now we tell the LLM to use |
Code Review - High Signal Issues FoundI've reviewed PR #121 and found 4 high-severity issues that need to be addressed: 1. 🐛 Non-deterministic transaction ordering in
|
Fixed in commit 8779564 |
RoutineDiscoveryAgent to an LLM-driven agentic loop with tools + add execute_routine toolRoutineDiscoveryAgent to an LLM-driven agentic loop with tools + add execute_routine tool
…/bluebox-sdk into refactor_discovery_agent
…o refactor_discovery_agent
There's a naming mismatch between extracted raw variables and routine parameters. In To resolve this, I added a mapping in
|
Code ReviewFound 2 CLAUDE.md compliance issues: 1. Missing return type annotation on
|
Code reviewNo issues found. Checked for bugs and CLAUDE.md compliance. |
I think having another LLM call to "sanity check execution result content" could be potentially useful, but it needs the context of the user intent. If we don't design it carefully, it might not yield any benefits. How about we tackle it after this overhaul and tracking it here #130? |
Updated messages below, in commit 2b3b1e1 |









This PR
RoutineDiscoveryAgentfrom a hardcoded sequential pipeline to an LLM-driven agentic loop with toolsexecute_routinetool as part of the agent's loop for catching errors before finalizing the routine