Hi, have you tried MPSMatrixMultiplication? It should use this features when possible and it supports fp16/fp32 precision.
Topic:
Graphics & Games
SubTopic:
Metal
Tags: