This helps explain what I was seeing. The message was not deterministic. That helps.
I would still prefer an API that allows developers to ask the session how many tokens they have processed.
Topic:
Machine Learning & AI
SubTopic:
Foundation Models