Thanks for your reply and this is what we already did. We conducted inference on GPU only, CPU only and on the M1 Tensor cores (Neural Engine) only, for all of these approaches we observed the same unstable results. We know, that floating point summation can cause some problems, especially if you conduct it in parallel.
And yes, we will try to contact an Apple developer directly, if there is a chance to reach one.
Topic:
Machine Learning & AI
SubTopic:
General
Tags: