Commit Graph

6388 Commits

Author SHA1 Message Date
George Hotz 1e6bbdf4f8 readme updates 2020-11-02 08:30:43 -08:00
George Hotz f178d23ff3 gpu relu is good 2020-11-02 08:25:32 -08:00
George Hotz 8766346187 fix gpu sub 2020-11-02 08:18:58 -08:00
George Hotz 231c1134bd cute trick for GPU test 2020-11-02 08:17:17 -08:00
George Hotz 6a7896f3e9 opsgpu cl program caching 2020-11-02 07:46:26 -08:00
George Hotz 2e7f16bf3f the power of cheating 2020-11-02 07:42:11 -08:00
George Hotz fc358a07ad print sloccount 2020-11-02 07:38:13 -08:00
George Hotz 3da64acd6b the power of small 2020-11-02 07:36:34 -08:00
George Hotz 82fc842b40 in_place_op 2020-11-02 07:26:13 -08:00
George Hotz 1d793b8571 gpu SGD runs now 2020-11-02 07:19:12 -08:00
George Hotz 621c61b30d sgd now operates on Tensors 2020-11-02 07:14:54 -08:00
George Hotz 355402504e ops work 2020-11-02 07:03:23 -08:00
Marcel Bischoff af5a4e0f5a
alternative to einsum which is slow in extreme cases (#44)
* streamlined numerical_jacobian

* Got rid of the g loop in Conv2D.forward

* ereased stupid line

* nothing

* no loops in Conv2D forward

* Conv2D backprop improved

* stupid things in examples

* alternative to einsum

* Conv2D backward einsum alternative

* tidying up

* tidied up

* no ravel

* got rid of print

* Update efficientnet.py

* Update efficientnet.py

* Update efficientnet.py

* only tensordot

* 255.0

* whitespace

* aspect ratio error in efficientnet

* noprint

Co-authored-by: Marcel Bischoff <marcel@Marcels-iMac.local>
2020-11-02 06:36:07 -08:00
George Hotz 5201a8e89f matmul on GPU 2020-11-01 08:54:20 -08:00
George Hotz 41e7d59aed test dot 2020-11-01 07:51:35 -08:00
George Hotz 1f544d6ece test mnist on GPU 2020-11-01 07:46:17 -08:00
George Hotz 499604d69b convert tensor to gpu 2020-11-01 07:34:50 -08:00
George Hotz bac6bfda11 with that at least it's only 10x slower 2020-11-01 07:30:19 -08:00
George Hotz 9ac1ad40d6
Add GPU Support! (do not merge yet) (#41)
* copy tensors to and from gpu

* add on GPU

* adding works

* we stick shapes in

* works on cpu and gpu

* test changes, not passing yet

* something else

* op tests pass

* add, mean, and sum have working forward/backward

* mul ops test

* no gpu support, no problem

* test pass, clean up later

* gpu cleanup

* cleanup test ops, don't let div fail

* revert more

* aimpler dispatcher

* clean up grad

* GPU and

* grad is a Tensor now

* gate test on GPU

* cleanups

* late loading gpu

* GPU as input option

* last cleanups
2020-11-01 07:00:49 -08:00
liej6799 c06a4fcc80
Fix temp location for windows (#39)
* Fix temp location for windows

* Make sure get tempfile for other os also.

* Update utils.py
2020-10-31 17:21:14 -07:00
George Hotz 7c0dc8f48b more whitespace 2020-10-31 11:05:11 -07:00
George Hotz e01e35e545 14 ops to write for GPU 2020-10-31 10:59:30 -07:00
George Hotz 06928cf3cc ENET WORKS 2020-10-31 10:42:58 -07:00
George Hotz 68cba88e8f fix enet bugs, now is mousetrap 2020-10-31 10:28:07 -07:00
George Hotz a852143572 simpler conv2d 2020-10-31 09:20:20 -07:00
Marcel Bischoff e5153d401f
Conv2D forward without loops much faster, backward slightly faster (#38)
* streamlined numerical_jacobian

* Got rid of the g loop in Conv2D.forward

* ereased stupid line

* nothing

* no loops in Conv2D forward

* Conv2D backprop improved

Co-authored-by: Marcel Bischoff <marcel@Marcels-iMac.local>
2020-10-31 09:08:32 -07:00
George Hotz 053c319708 optional look at cat 2020-10-30 09:30:15 -07:00
George Hotz 2c7e75d733
group conv: forward pass works (#34)
* forward pass works

* got the backward pass

* okay, it's now a coho
2020-10-30 09:19:20 -07:00
George Hotz 339a35b081 div needs help 2020-10-30 08:32:16 -07:00
George Hotz 71aedc2309 it thinks it's washer. it's cat. bad net. you do bad. 2020-10-30 08:28:05 -07:00
George Hotz c14473f87d unit test for batchnorm2d 2020-10-30 08:19:58 -07:00
George Hotz 843b1cb7d6 only warn once 2020-10-30 08:04:48 -07:00
Marcel Bischoff 16d324064e
streamlined numerical_jacobian (#32)
Co-authored-by: Marcel Bischoff <marcel@Marcels-iMac.local>
2020-10-30 02:20:12 -07:00
Liam 127f878fa9
Clean init function (#29) 2020-10-29 17:45:55 -07:00
Afshin1354 c13b255847
Update gradcheck.py (#30) 2020-10-29 17:45:34 -07:00
Omar Ben Brahim d0c3912517
fix for invalid GPU error caused by (test/test_net_speed.py::TestConvSpeed::test_mnist) when testing in CI (#31)
* install the pytorch cpu only version

* get rid of torch gpu version

* test passed seems to get rid of invalid gpu error

* added the libs to requirements.txt

* lol
2020-10-29 17:45:16 -07:00
George Hotz 1dde4ce609 oops, stay in float32 2020-10-29 08:24:12 -07:00
George Hotz 5e7e359706 fix tests 2020-10-29 08:19:07 -07:00
George Hotz 9ae3e9daf3 shape has to be a kwarg now, idk why this didn't break before 2020-10-29 08:13:05 -07:00
George Hotz 2db670ef26 enet refactor + no sigmoid warning 2020-10-29 08:08:21 -07:00
George Hotz 17fa74c15b factor out a couple nn ops 2020-10-29 08:01:12 -07:00
George Hotz f84f6c1edd write sqrt and div using pow 2020-10-29 07:57:25 -07:00
Omar Ben Brahim ec7bdb87ca
clean up CI (#28) 2020-10-29 07:43:31 -07:00
George Hotz cc17e3271a try to recognize cat. do not succeed 2020-10-27 21:41:52 -07:00
George Hotz 03d9c98f5b enet work 2020-10-27 21:23:02 -07:00
George Hotz 0ec279951f enet weight loading 2020-10-27 21:01:48 -07:00
George Hotz e84ad3e27d enet runs 2020-10-27 20:16:29 -07:00
George Hotz 9166eb58bb more enet 2020-10-27 19:37:21 -07:00
George Hotz 41828d768f sigmoid and more enet 2020-10-27 19:13:47 -07:00
George Hotz 09d1ebcdaa examples is better 2020-10-27 18:57:00 -07:00