It's been years, and I keep trying, keep trying different hacks to get things installed. Things do install but nothing ever runs to completion. I would expect Apple would want to play a bigger part in this rather than having us have to move to linux with Nvidia. I wish Apple would just put some resources behind this.
Selecting any option will automatically load the page
Post
Replies
Boosts
Views
Activity
I'm running this on Intell MBPro
Metal device set to: AMD Radeon Pro 5600M
2022-03-29 11:46:55.397695: I tensorflow/core/platform/cpu_feature_guard.cc:151] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: SSE4.2 AVX AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2022-03-29 11:46:55.398290: I tensorflow/core/common_runtime/pluggable_device/pluggable_device_factory.cc:305] Could not identify NUMA node of platform GPU ID 0, defaulting to 0. Your kernel may not have been built with NUMA support.
2022-03-29 11:46:55.398545: I tensorflow/core/common_runtime/pluggable_device/pluggable_device_factory.cc:271] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 0 MB memory) -> physical PluggableDevice (device: 0, name: METAL, pci bus id: )
My model gets created and as soon as it starts the training on the first epoch the kernel dies.
Epoch 1/3000
2022-03-29 11:47:10.011230: I tensorflow/core/grappler/optimizers/custom_graph_optimizer_registry.cc:113] Plugin optimizer for device_type GPU is enabled.
I would give almost anything to get this working. Tensorflow-metal has never worked for me other than saying I have a gpu.
I was excited to see the version 2.7 was out but....
As soon as I start to run my model:
It prints:
Metal device set to: AMD Radeon Pro 5600M
2022-01-13 17:02:36.447465: I tensorflow/core/platform/cpu_feature_guard.cc:151] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 FMA To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. 2022-01-13 17:02:36.448221: I tensorflow/core/common_runtime/pluggable_device/pluggable_device_factory.cc:305] Could not identify NUMA node of platform GPU ID 0, defaulting to 0. Your kernel may not have been built with NUMA support. 2022-01-13 17:02:36.448581: I tensorflow/core/common_runtime/pluggable_device/pluggable_device_factory.cc:271] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 0 MB memory) -> physical PluggableDevice (device: 0, name: METAL, pci bus id: <undefined>)
...and the kernel dies
Also:
print("Num GPUs Available: ", len(tf.config.experimental.list_physical_devices('GPU')))
Num GPUs Available: 1
Things are in place - why doesn't this work yet???
MacOS 12.1
MacBook Pro (16-inch, 2019)
8-Core Intel Core i9
64 GB