Commit Graph

2324 Commits

Author SHA1 Message Date
Dhairya Gandhi 370fd978fa
Merge pull request #986 from FluxML/restructure
Destructure/restructure for models
2020-01-13 13:04:48 +05:30
Dhairya Gandhi 58a7941386 reduce bors timeout 2020-01-13 11:24:04 +05:30
Dhairya Gandhi 0411b9a3e8 rm second slash 2020-01-12 17:35:04 +05:30
Mike Innes f96270c213 free zygote 2020-01-09 17:16:41 +00:00
Mike J Innes 17732e7023 restructure; closes #747 2020-01-06 11:53:47 +00:00
Dhairya Gandhi e92da0cf85
Merge pull request #973 from FluxML/sf/nnpack_tolerance
Give `NNPACK` a bit of numerical leeway
2019-12-23 15:57:56 +05:30
Elliot Saba 0fdcc00923 Give `NNPACK` a bit of numerical leeway 2019-12-23 01:31:26 -08:00
Viral B. Shah 8a1e2f19d7
Update README.md 2019-12-19 09:44:17 -05:00
Dhairya Gandhi ac4c49b63e
Merge pull request #954 from FluxML/decaydocs
[WIP] Decaydocs
2019-12-10 12:11:23 +05:30
Fredrik Bagge Carlson e67f09c06d Correct some comments in decay docs 2019-12-03 15:32:23 +08:00
Fredrik Bagge Carlson 6e94e59afd Improve docs for decay optimisers 2019-12-03 15:27:44 +08:00
Mike J Innes f46b5243db
Merge pull request #946 from FluxML/pkg-up
compat, pkg up
2019-11-29 12:55:47 +00:00
Mike J Innes 0c99f7f4b7 Merge branch 'dg/news' into pkg-up 2019-11-29 10:42:28 +00:00
Dhairya Gandhi 4b63e69b65 bump version to v0.10 2019-11-29 00:02:59 +05:30
Dhairya Gandhi 8519833d17 Merge branch 'dg/news' of https://github.com/FluxML/Flux.jl into dg/news 2019-11-28 23:57:30 +05:30
Dhairya Gandhi 73d572b1a9 rm RADAM 2019-11-28 23:57:01 +05:30
Mike Innes b65b491e51 compat, pkg up 2019-11-28 16:23:22 +00:00
Dhairya Gandhi c17dc34e38
phew
Co-Authored-By: Mike J Innes <mike.j.innes@gmail.com>
2019-11-28 21:49:34 +05:30
Dhairya Gandhi 1ae554d82c rm new line 2019-11-28 21:47:37 +05:30
Dhairya Gandhi 4481c74f50 v0.10 changes 2019-11-28 21:45:06 +05:30
Mike J Innes 75d609ecc8
Update README.md 2019-11-28 16:00:55 +00:00
Mike J Innes 99f98ca800
Update README.md 2019-11-28 16:00:21 +00:00
Tim Besard ab450477f3
Merge pull request #944 from FluxML/rnn-fix
RNN failure hackaround
2019-11-27 16:06:13 +01:00
Mike Innes 1c0e9acc45 Update CuArrays to include the workspace fix. 2019-11-27 14:31:03 +01:00
bors[bot] 90a38a3201
Merge #937
937: Fix Glorot initialization, add He initialization r=MikeInnes a=Sleort

Should fix #442 .
Adds He weight initialization as a bonus :-)

Co-authored-by: Troels Arnfred Bojesen <tr-ab@online.no>
2019-11-26 16:17:06 +00:00
bors[bot] fb4a48f970
Merge #943
943: Fixes #900 r=MikeInnes a=dhairyagandhi96

Thoughts on the test?

cc @MikeInnes

Co-authored-by: Dhairya Gandhi <dhairya@juliacopmuting.com>
2019-11-26 15:09:27 +00:00
Dhairya Gandhi 59bb0d81b0 add TODO 2019-11-26 16:23:09 +05:30
Mike J Innes 4c69b44a7c
Merge pull request #940 from matsueushi/feature/cuda-logitbc
Fix logitbinarycrossentropy on CuArrays
2019-11-26 10:18:07 +00:00
Dhairya Gandhi c031ae1a94 correct channel value 2019-11-24 13:31:31 +05:30
Tim Besard fbb377a7b4
Merge pull request #941 from FluxML/tb/include_during_precompile
Don't include the CUDA module during precompilation.
2019-11-24 08:55:43 +01:00
Dhairya Gandhi 5f21238d1a no grad dims helper 2019-11-24 13:25:02 +05:30
Tim Besard 4ece13c649 Don't include the CUDA module during precompilation.
If we do, we could end up replacing it at runtime.
2019-11-22 18:03:51 +01:00
matsueushi a0314ce682 Fix logitbinarycrossentropy on CuArrays 2019-11-22 05:23:24 +00:00
Troels Arnfred Bojesen 3f97701d4c Merge branch 'HEAD' into weight_init_patch 2019-11-20 13:25:32 +09:00
Troels Arnfred Bojesen 60a29abaf1 Merge branch 'weight_init_patch' into HEAD 2019-11-20 13:25:19 +09:00
Troels Arnfred Bojesen 3b83828e4e Merge branch 'HEAD' into weight_init_patch 2019-11-20 13:24:48 +09:00
Troels Arnfred Bojesen af96a197c1 Fix Glorot initialization
Should fix #442
2019-11-20 13:20:42 +09:00
Mike J Innes 5839e166f6
Merge pull request #860 from dsweber2/activations
Activations
2019-11-19 16:44:25 +00:00
Tim Besard 2fa3e5673e
Merge pull request #924 from FluxML/tb/cuda_init
CUDA package initialization improvements
2019-11-19 16:48:45 +01:00
Tim Besard c45cec4cba Simplify warning. 2019-11-19 16:05:41 +01:00
Tim Besard bd734ed957 Bump CUDA dependencies. 2019-11-19 15:55:25 +01:00
Tim Besard 69bf84278f Remove wrong warning. 2019-11-19 15:53:43 +01:00
Mike J Innes 4f73e434a4
Merge pull request #935 from baggepinnen/patch-4
Fix AMSGrad on GPU
2019-11-19 12:58:37 +00:00
Troels Arnfred Bojesen 2b80573248 Fix Glorot initialization, add He initialization
Should fix #442 .
Adds He weight initialization as a bonus :-)
2019-11-19 18:16:29 +09:00
bors[bot] 8638bcdcd7
Merge #936
936: Avoid unnecessary conversion r=MikeInnes a=baggepinnen

This initialization works for both cpu and gpu

Co-authored-by: Fredrik Bagge Carlson <baggepinnen@gmail.com>
2019-11-19 09:05:23 +00:00
Fredrik Bagge Carlson 2da22f31f0
Avoid unnecessary conversion
This initialization works for both cpu and gpu
2019-11-19 16:31:04 +08:00
Fredrik Bagge Carlson df7ffb0ef8
Fix AMSGrad on GPU
The previous initialization created a CPU array. Now, the same type of array as `x` is created.
2019-11-19 16:27:44 +08:00
Troels Arnfred Bojesen 4530ac65c7 Fix Glorot initialization, add He initialization
Should fix the issue reported at https://github.com/FluxML/Flux.jl/issues/442 .
Adds He weight initialization as a bonus :-)
2019-11-19 16:50:40 +09:00
Mike J Innes 967cc1c175
Merge pull request #927 from heliosdrm/patch-1
Extend docs about `train!`
2019-11-18 12:22:16 +00:00
dsweber2 dea29532ef Merge branch 'master' into activations 2019-11-15 17:19:43 -08:00