So the BLAS and LAPACK functions that come with Accelerate are optimized for Apple Silicon?
As I said, probably. I guess I have to be more verbose:
The docs say that these things are "optimized for high performance". We could reasonably infer that they use, for example, NEON vector instructions on ARM. But the docs aren't explicit about this.
If you want to be certain, in the absence of source code you would need to reverse-engineer the implementation. That's definitely not something that I would do, however, since it would violate the terms of the Apple developer agreement and would result in the termination of my developer account.
If you want to be certain that you're using NEON code it would be better to write it yourself, or use an open-source library where you can check exactly what it is doing.
One other thing to note is that these Accelerate functions are not inline, as far as I can tell. If you're dealing with large matrices that's probably not a concern, but if you're doing e.g. 2D/3D geometry (small vectors and matrices) then I would expect to get a measurable improvement when the compiler can inline the implementations.