Happy to answer whatever technical questions I can!
I would be wary of having a LLM with 85% accuracy call tools on my system. Isn’t that fairly far away from production-grade performance?
I also don’t see that the fact that accuracy can be boosted from 50% to 85% is any indication that it can be boosted further.
Great work from the Google ML teams, I’ll be trying this model out.