No, I explicitly observe the GPU/CPU loads with Performance Monitor, and explicitly set tf.device.
In contrast, the Tesla V100 outperforms the CPU on the same code by 10 X on a decent Linux GPU cluster.
This is definitely an issue with tensorflow-metal, at least on macOS 11.6.
Topic:
Machine Learning & AI
SubTopic:
General
Tags: