Commit Graph

274 Commits

Author SHA1 Message Date
George Hotz ce6c408d78 readmee 2020-11-07 12:26:57 -08:00
George Hotz 3ab8fb2189 oops, timing 2020-11-07 12:15:52 -08:00
George Hotz e4c5fb0219 webcam support 2020-11-07 12:15:16 -08:00
George Hotz 5486135f2d readme 2020-11-07 11:41:27 -08:00
George Hotz 9b8fefe4e3 fix broadcast bugggg 2020-11-07 11:38:56 -08:00
George Hotz b1ca4dd327 who loves speeeeed 2020-11-07 11:36:52 -08:00
George Hotz e6c8321e5b enet is running on GPU 2020-11-07 11:33:46 -08:00
George Hotz 30442a086a some broadcasting, pool test is fail 2020-11-07 11:29:42 -08:00
Ryan Neph 7c630ca355
GPU {Avg,Max}Pool2D forward pass (#66) 2020-11-07 11:13:24 -08:00
George Hotz 40c538c299 fix reshape 2020-11-07 11:10:29 -08:00
George Hotz 19c2950310 refactor ops to prepare for broadcasting 2020-11-07 11:02:45 -08:00
George Hotz 94d44c97bf add pad2d on GPU 2020-11-07 10:46:36 -08:00
George Hotz fbff6ab2e5 fix strided convs, GPU env var for enet 2020-11-07 10:26:37 -08:00
George Hotz ec03eb44bd tinygrad does forward pass convs on GPU 2020-11-07 10:15:56 -08:00
Manjeet Singh 23c39d9f52
fetch() 'ran out of input' fix (#65)
If requests.get fails during a download, the incomplete file is passed
without a size check. This causes errors during image/model loading.
Not sure if this is a problem on all systems.
2020-11-07 09:45:10 -08:00
George Hotz bc7758cc5b getting convs to work on gpu 2020-11-07 09:17:57 -08:00
George Hotz 3302286e68 yayay test_sgd_gpu passes 2020-11-07 08:48:17 -08:00
Ryan Neph 98d1a4f740
GPU Selection works when len(answers)>ngpus (#64)
Fixes TypeError: sequence item 0: expected str instance, int found
triggered by https://github.com/inducer/pyopencl/blob/v2020.2.2/pyopencl/__init__.py#L1500
in pyopencl v2020.2.2
2020-11-07 08:06:59 -08:00
George Hotz ce0c80e1bd ugh, corrupt downloads. when did this break? 2020-11-07 08:04:43 -08:00
George Hotz 9332fe1989 weee, logsoftmax on GPU. very slow 2020-11-07 07:59:37 -08:00
George Hotz 38e112cccd logsoftmax test 2020-11-07 07:26:53 -08:00
George Hotz 237bbe3625 sigmoid on GPU 2020-11-07 07:23:46 -08:00
Rakib Fiha f40dbd791c
Use --upgrade since its in active dev (#63) 2020-11-07 07:15:05 -08:00
Rene Delgado cd54697fd8
fix gpu sum forward (#61)
* ignore venv

* add sum test

* fix sum forward
2020-11-05 21:59:16 -08:00
gallanoe aea1069f63
Div on CPU (#58)
* Added Div on CPU

* Removed eps. value

* Fixed tabs
2020-11-05 21:58:37 -08:00
NeuralLink cc605da36d
Stable Sigmoid op (#59)
* 🔨 Added stable sigmoid

*  added sigmoid test

* 🔧 suppressed overflow warning

* 🔧 clean up
2020-11-05 21:57:50 -08:00
George Hotz cc852efa64 make OpenCL work elsewhere 2020-11-05 07:59:55 -08:00
Ryan Neph 22a5f9975d
pow+div on GPU (#57) 2020-11-05 07:49:45 -08:00
adamritter b16fadc5c6
Speed up sum backward pass (#54)
Co-authored-by: holonomicjl <58403584+holonomicjl@users.noreply.github.com>
2020-11-04 19:55:45 -08:00
George Hotz 0be3d2cdba install pyopencl 2020-11-04 11:22:11 -08:00
George Hotz 3efb4f4df4 chicken.jpg 2020-11-04 11:20:22 -08:00
Marcel Bischoff c69582744d
Update efficientnet.py (#53)
from https://github.com/lukemelas/EfficientNet-PyTorch/blob/master/efficientnet_pytorch/utils.py
```
    blocks_args = [
        'r1_k3_s11_e1_i32_o16_se0.25',
        'r2_k3_s22_e6_i16_o24_se0.25',
        'r2_k5_s22_e6_i24_o40_se0.25',
       'r3_k3_s22_e6_i40_o80_se0.25',
        'r3_k5_s11_e6_i80_o112_se0.25',
        'r4_k5_s22_e6_i112_o192_se0.25',
        'r1_k3_s11_e6_i192_o320_se0.25',
    ]
```

now it's a persian cat.
2020-11-04 09:36:27 -08:00
Marcel Bischoff c40862241d
Revert "choose aspect ration dependent on landscape or portrait (#48)" (#51)
This reverts commit 2808411421.
2020-11-02 19:31:07 -08:00
Anders Aaen Springborg 2808411421
choose aspect ration dependent on landscape or portrait (#48) 2020-11-02 18:10:22 -08:00
Marcel Bischoff d24363f421
Update setup.py (#49)
I think `:=` in tinygrad/test/test_mnist.py actually needs 3.8
2020-11-02 18:09:31 -08:00
George Hotz 940e14c6ca more readme 2020-11-02 08:33:48 -08:00
George Hotz 1e6bbdf4f8 readme updates 2020-11-02 08:30:43 -08:00
George Hotz f178d23ff3 gpu relu is good 2020-11-02 08:25:32 -08:00
George Hotz 8766346187 fix gpu sub 2020-11-02 08:18:58 -08:00
George Hotz 231c1134bd cute trick for GPU test 2020-11-02 08:17:17 -08:00
George Hotz 6a7896f3e9 opsgpu cl program caching 2020-11-02 07:46:26 -08:00
George Hotz 2e7f16bf3f the power of cheating 2020-11-02 07:42:11 -08:00
George Hotz fc358a07ad print sloccount 2020-11-02 07:38:13 -08:00
George Hotz 3da64acd6b the power of small 2020-11-02 07:36:34 -08:00
George Hotz 82fc842b40 in_place_op 2020-11-02 07:26:13 -08:00
George Hotz 1d793b8571 gpu SGD runs now 2020-11-02 07:19:12 -08:00
George Hotz 621c61b30d sgd now operates on Tensors 2020-11-02 07:14:54 -08:00
George Hotz 355402504e ops work 2020-11-02 07:03:23 -08:00
Marcel Bischoff af5a4e0f5a
alternative to einsum which is slow in extreme cases (#44)
* streamlined numerical_jacobian

* Got rid of the g loop in Conv2D.forward

* ereased stupid line

* nothing

* no loops in Conv2D forward

* Conv2D backprop improved

* stupid things in examples

* alternative to einsum

* Conv2D backward einsum alternative

* tidying up

* tidied up

* no ravel

* got rid of print

* Update efficientnet.py

* Update efficientnet.py

* Update efficientnet.py

* only tensordot

* 255.0

* whitespace

* aspect ratio error in efficientnet

* noprint

Co-authored-by: Marcel Bischoff <marcel@Marcels-iMac.local>
2020-11-02 06:36:07 -08:00
George Hotz 5201a8e89f matmul on GPU 2020-11-01 08:54:20 -08:00