Answer
Reduce model calls, use smaller models when possible, and shift work into deterministic tools (search, DB queries, APIs). Centralizing tool access behind a single endpoint can reduce integration overhead; Dedalus provides that gateway pattern and SDK.
Nuances & Considerations
The biggest cost driver is usually repeated model calls from loops or retries; constrain tool usage and limit steps.