This seems a result of relying on the LLM to accurately extract information that...

dartos · 2024-06-28T16:02:41 1719590561

The LLM _might_ do what you want.

Where I currently work, our function calls regularly fail only to succeed flawlessly on a retry. (I believe we’re on the order of 10s of millions openai calls a day)

These are non-deterministic systems. I wouldn’t even trust them to accurately extract text until you did a beam search or something similar to kind of average out different LLM outputs