mirror of https://github.com/commaai/tinygrad.git
l2 cache note
This commit is contained in:
parent
78a06a1285
commit
6fb127d5c7
|
@ -2,7 +2,7 @@
|
|||
|
||||
The Apple Neural Engine is a fancy DMA Engine that is based around convolutions. We don't have all the details worked out yet, but we can do some things with it. At its core, it runs through 0x300 ops in an hwx file. See `aneregs` for the registers used in each op.
|
||||
|
||||
It operates out of RAM or its 4MB L2 cache. The L2 cache appears to be manually managed, and only applies to the input and output, not the weights.
|
||||
It operates out of RAM or its 4MB L2 cache. The L2 "cache" appears to be manually managed, and only applies to the input and output, not the weights. The weights are usually included in the program, and it's unclear where they are copied to.
|
||||
|
||||
The 16 cores likely refer to the 16 wide Kernel DMA engine. They claim 11 TOPS total, which would be 687.5 GOPS/core. Perhaps it's a 24x24 MAC running at 600 MHz, with more work we can determine this better.
|
||||
|
||||
|
|
Loading…
Reference in New Issue