George Hotz
78576915de
Add needed contiguous to DiskBuffer. SHM support on OSX ( #1891 )
...
* add some contiguous
* remove second contig
* Revert "remove second contig"
This reverts commit fc164f7dca1ad75b1e466e4e45a05eca58b7e0e0.
* shm on osx
* can repro bug
* don't contig zeros and ones
2023-09-22 09:16:42 +08:00
qazal
d0e752003d
fixes ( #1893 )
2023-09-22 07:20:27 +08:00
wozeparrot
009a99a0b1
feat: way cleaner hip wrapper ( #1895 )
2023-09-22 07:20:03 +08:00
Yixiang Gao
cb5d6576cb
cifar step time 65ms while stay above 94% ( #1888 )
...
* change reduceop heruistics
* add model ema and jit hack
* add ema eval
* have to create a duplicate eval function for jit
* remove manual seed
* 94% achieveable with normal eval
* ema is outputting the same results as normal
* fix ema bug
* ema achieves 94% with fix seed
* multigpu tested
* constant fold decay, fix jit, adjust message for multigpu
* pull SpeedyResNet out of train_cifar()
2023-09-21 11:19:32 +08:00
kormann
864746d6aa
polish print_tree ( #1868 )
...
* fix
* isinstance
2023-09-21 11:13:10 +08:00
chenyu
a5090f0ee9
remove NumNode.int() ( #1876 )
2023-09-21 10:29:16 +08:00
Gijs Koning
9eb6310686
Fix gpt optimization ( #1885 )
...
* fix for gpt
* the actual fix
* Remove change in symbolic
* small comment
2023-09-21 10:28:18 +08:00
Szymon Ożóg
bd3444797b
make ssa assign r[u] ( #1887 )
2023-09-21 10:20:20 +08:00
nimlgen
9450e41f70
no import when Python is shutting down ( #1875 )
2023-09-20 12:47:02 -04:00
Yixiang Gao
84ab47a90a
add branch up-to-date check ( #1879 )
2023-09-20 12:41:51 -04:00
nimlgen
504bb6d0ea
support symbolic jit in HIP ( #1877 )
2023-09-20 01:44:26 -04:00
chenyu
cd66c9e249
no numnode in shape ( #1871 )
2023-09-17 07:49:45 +08:00
Yixiang Gao
18ec5a9e09
add comment bot to CI ( #1873 )
2023-09-16 12:22:06 -04:00
Yixiang Gao
a27f6c7d62
add diff mode to sz.py ( #1872 )
2023-09-16 00:43:47 -04:00
nimlgen
4c31dfafb3
add seed to gpt-2 ( #1869 )
2023-09-15 17:34:14 -04:00
wozeparrot
c870764940
Revert "add line changes diff bot to CI ( #1863 )" ( #1870 )
2023-09-15 16:56:42 -04:00
Yixiang Gao
789c84a7a3
add line changes diff bot to CI ( #1863 )
2023-09-15 16:29:58 -04:00
chenyu
29ac8293d7
run gpt2 in CI ( #1866 )
2023-09-15 04:37:02 +08:00
chenyu
1b46de1a3e
fix type of helpers.prod, add test cases ( #1859 )
2023-09-14 05:16:55 +08:00
chenyu
e67306ba04
symbolic shape type with TypeGuard ( #1852 )
2023-09-13 05:27:22 +08:00
Roelof van Dijk
c91b44f7bf
refactor: move size to view ( #1848 )
...
* refactor: move size to view
* fix: pylint
---------
Co-authored-by: Roelof van Dijk <roelof.van.dijk@vitestro.com>
2023-09-11 07:16:04 -07:00
chenyu
9e9ea20784
Fix view, CI cpu test with python 3.8 ( #1845 )
2023-09-10 22:37:58 -04:00
chenyu
3ec301c2d7
apply view.py patch ( #1844 )
2023-09-10 17:32:15 -07:00
Yixiang Gao
a32951a001
add test_tensor_copy ( #1840 )
...
* add test_tensor_copy
* fix whitespace
* add value check
2023-09-10 16:01:58 -07:00
Roelof van Dijk
1bc52c60df
fix: minor tweaks to view ( #1842 )
...
Co-authored-by: Roelof van Dijk <roelof.van.dijk@vitestro.com>
2023-09-10 15:55:57 -07:00
George Hotz
47e602f717
view: do not trade complexity for speed ( #1839 )
...
* view: do not trade complexity for speed
* staticmethods
* view create
2023-09-10 11:29:53 -07:00
chenyu
c0bc4cfbaf
DivNode.b is int ( #1833 )
2023-09-10 09:04:29 -07:00
nimlgen
13790b1e20
cast types in render_load ( #1837 )
2023-09-10 07:58:13 -07:00
David Hou
e74a6ca7e4
expand in terms of substitute ( #1827 )
2023-09-09 14:43:00 -07:00
George Hotz
0e3e2bac13
amd wino: upload results
2023-09-09 13:57:14 -07:00
George Hotz
6f95c5f284
winograd speed test for AMD ( #1826 )
2023-09-09 13:56:33 -07:00
George Hotz
0f2bd10d00
add winograd CIFAR to mac tests ( #1825 )
...
* add winograd CIFAR to mac tests
* symlink already done
2023-09-09 13:45:24 -07:00
nimlgen
31fca43706
kopt works with local+grouped reduce and tests ( #1824 )
2023-09-09 13:22:09 -07:00
chenyu
9da40c8448
move Node.__lt__ SumNode special case to SumNode ( #1823 )
2023-09-09 13:20:38 -07:00
Francis Lam
651205fa5c
linearizer: support local and group_for_reduce dimensions together ( #1821 )
...
also minor changes to test_speed_v_torch.py and size of UOps.SPECIAL
2023-09-08 12:39:27 -07:00
segf00lt
9e8c1dbf34
patch to remove hack from stable_diffusion.py ( #1814 )
...
* patch to remove hack from stable_diffusion.py
* sorry linter
* realize after assign?
* float16 broken in llvmlite use float64 for now
* int32
* idiot forgot to change test array dtype
2023-09-08 09:26:50 -07:00
chenyu
ebcda8a714
Move var_vals from ShapeTracker to LazyBuffer ( #1819 )
2023-09-08 09:25:10 -07:00
kormann
7ac65a93b4
utils.printtree ( #1816 )
...
* utils.printtree
* linter compliance
* rename to print_tree
2023-09-07 23:08:57 -07:00
George Hotz
4613c9e77c
add tvm example, formatting ( #1813 )
...
* add tvm example
* no realize
2023-09-07 11:50:41 -07:00
nimlgen
5b15a972b5
no functions with same names in test/ ( #1811 )
2023-09-07 11:27:31 -07:00
George Hotz
722823dee1
stable diffusion: force fp16 free
2023-09-06 15:11:05 -07:00
chenyu
928cb1a64a
AndNode.substitute short circuit ( #1800 )
...
* AndNode substitute short circuit
* Node.__bool__ is faster than Node.__eq__
2023-09-06 14:58:49 -07:00
nimlgen
a78a1fa499
fix jit buffer reuse when freed ( #1802 )
...
* fix jit buffer reuse when freed
* Firbid output_buffer reusage
2023-09-06 14:41:57 -07:00
Yixiang Gao
22cf15e9d0
convert function into tinygrad ( #1803 )
2023-09-06 14:41:26 -07:00
Pavol Rusnak
52a92bf95d
use class Foo: instead of class Foo(): ( #1797 )
...
* use class Foo: instead of class Foo():
* add ruff linter, copy settings from .flake8 to ruff.toml
2023-09-06 12:20:25 -07:00
badcc
fd25792c8b
Ensure freqs as type float32 in freqs_cis ( #1798 )
2023-09-06 10:24:15 -07:00
chenyu
35072877ef
sym_infer is noop for int input ( #1795 )
2023-09-06 09:17:20 -07:00
George Hotz
f67638b27a
delete broken DDPG example
2023-09-06 08:01:12 -07:00
George Hotz
78a43ad2c7
add uop fixup ( #1793 )
2023-09-06 07:55:22 -07:00
geohotstan
1bbf26d7fd
fix try except not catching fxn() in benchmark ( #1783 )
...
* have function raise notimplementederror
* more lines
* revert back to 2 lines :D
* aahhhhhhhh shoooot im stupid
* keep it minimal?
2023-09-06 07:36:43 -07:00