mirror of
https://github.com/sunnypilot/sunnypilot.git
synced 2026-02-20 02:23:54 +08:00
* bigmodel
* more debug print
* debugging bigmodel
* remove the tanh, debugging
* print images/buffers
* disassemble the command queues
* decompiler
* dump the shaders
* full disasm
* support patching kernel and fixing convolution_horizontal_reduced_reads_1x1
* microbenchmark
* 42 GFLOPS, 1 GB/s
* gemm benchmark
* 75 GFLOPS vs 42 GFLOPS
* 115 GFLOPS
* oops, never mind
* gemm image is slow
* this is pretty hopeless
* gemm image gets 62 GFLOPS
* this is addictive and still a waste of time
* cleanup cleanup
* that hook was dumb
* tabbing
* more tabbing
Co-authored-by: Comma Device <device@comma.ai>
old-commit-hash: 78a352a8ca
4 lines
128 B
Bash
Executable File
4 lines
128 B
Bash
Executable File
version https://git-lfs.github.com/spec/v1
|
|
oid sha256:98ccebe6e903204a6fee90ac3b689e427448dc48efed095c435c0f40dab62ae7
|
|
size 115
|