LocalLLaMA

how are people actually debugging bad outputs in agent / RAG pipelines?

been messing around with some agent / RAG pipelines running into cases where everything executes fine (tool calls return expected outputs, parsing works etc.) but final answer is still wrong / slightly off nothing crashes, just bad outputs curious how …