Commit Graph

1626 Commits

Author SHA1 Message Date
Elliot Saba
113ddc8760 Update Flux code for new NNlib branch 2019-04-25 09:55:23 -07:00
Hossein Pourbozorg
7f06b15f67 use https instead of http for web links 2019-04-25 11:04:03 +00:00
Jake Topping
ff7adda74b
Swap comma for full stop
"ERROR: LoadError: UndefVarError: G not defined" caused by "gn,G" rather than "gn.G" in line 386. Swapping for full stop should fix this
2019-04-22 17:08:36 +01:00
Zachary P Christensen
83eb5a1df6
Fix typo in Maxout 2019-04-19 17:02:26 -04:00
thebhatman
31a50ab16a Fixed ExpDecay 2019-04-11 17:28:06 +05:30
Mike J Innes
54d9229be9
Merge pull request #710 from johnnychen94/master
naive implementation of activations
2019-04-05 15:33:31 +01:00
Johnny Chen
a300376f71
fix a typo in comment
`inplementation` --> `implementation`
2019-04-05 19:19:30 +08:00
JohnnyChen
3cafbbad02 simplify the implementation 2019-04-05 18:44:00 +08:00
JohnnyChen
de7a5f4024 correct the function behavior; support Any type 2019-04-05 18:16:44 +08:00
thebhatman
b84ab7ac95 Removed logcosh 2019-04-05 03:16:54 +05:30
bors[bot]
bd9d73a941 Merge #655
655: Added support for Float64 for DepthwiseConv r=dhairyagandhi96 a=thebhatman

DepthwiseConv was giving errors for Float64. This fixes the issue.

Co-authored-by: Manjunath Bhat <manjunathbhat9920@gmail.com>
2019-04-04 17:25:52 +00:00
chengchingwen
261235311c change dims as unbroadcasted dims and keyword argument 2019-04-05 01:19:20 +08:00
Dhairya Gandhi
1963f30911
Merge pull request #726 from dhairyagandhi96/iris
use cached iris dataset
2019-04-04 22:46:21 +05:30
Dhairya Gandhi
9c8175b1c0 fixes 2019-04-04 22:32:01 +05:30
Dhairya Gandhi
4f754d33cb switch to http link 2019-04-04 22:18:38 +05:30
Dhairya Gandhi
38cc216a4b switch to azure 2019-04-04 22:03:01 +05:30
Dhairya Gandhi
77274b4af7 change iris link 2019-04-04 21:07:46 +05:30
Dhairya Gandhi
2952bcdab1 fixes 2019-04-04 19:28:40 +05:30
Dhairya Gandhi
5b9c53439b recreate OHV 2019-04-04 19:19:47 +05:30
Dhairya Gandhi
4f1336905f fix colon indexing 2019-04-04 19:16:14 +05:30
Shreyas
4cb7b9278b Minor changes to docstring according to guidelines 2019-03-30 00:28:23 +05:30
JohnnyChen
82595648e2 change 4-spaces tab to 2-spaces tab 2019-03-28 22:40:24 +08:00
Shreyas
b6fcd1d837 Added export to Maxout 2019-03-28 19:15:16 +05:30
JohnnyChen
13c58494ec add x into results 2019-03-28 19:28:59 +08:00
JohnnyChen
5c2a071713 add support for 0-element Chain 2019-03-28 17:20:41 +08:00
JohnnyChen
ccfe0f8720 naive implementation of activations 2019-03-28 17:07:04 +08:00
Shreyas
61c1fbd013 Made Requested Changes 2019-03-28 01:33:04 +05:30
Shreyas
671aed963e Made a few fixes. Added tests 2019-03-28 00:51:50 +05:30
thebhatman
4efcc69ba5 logcosh averaged 2019-03-26 23:23:02 +05:30
Shreyas
595f1cf6eb Made Requested Changes 2019-03-26 21:42:49 +05:30
Manjunath Bhat
930adb122d
Avoided promotion to Float64 in hinge. 2019-03-25 23:43:06 +05:30
thebhatman
6f078857be Added reference links to loss functions 2019-03-26 03:15:28 +05:30
thebhatman
c4d12e57fe Loss function names in lowercase 2019-03-26 03:09:48 +05:30
Lyndon White
f0cc4a328d make Maxout trainable 2019-03-25 16:02:46 +00:00
Tim Besard
0734eeb50e Check CuArrays major version. 2019-03-22 14:15:26 +01:00
Dhairya Gandhi
bc06861320 fix indirect import 2019-03-22 14:15:26 +01:00
Tim Besard
959dd247bf Import CUDAdrv stuff through CuArrays. 2019-03-22 14:15:26 +01:00
Tim Besard
df509ce9f0 Adapt to the new CUDAdrv.CuPtr pointer type. 2019-03-22 14:15:26 +01:00
Mike J Innes
b637311642
Merge pull request #647 from oxinabox/ox/maxout
Add MaxOut layer
2019-03-22 12:18:53 +00:00
Lyndon White
401d3da884 no arg closures 2019-03-21 17:04:52 +00:00
Lyndon White
7d247ea25b update docstring 2019-03-18 12:20:46 +00:00
Nick Robinson
f222555deb Update src/Flux.jl
Co-Authored-By: oxinabox <oxinabox@ucc.asn.au>
2019-03-18 12:20:46 +00:00
Lyndon White
ca68bf9bec correct casing 2019-03-18 12:20:46 +00:00
Lyndon White
838047f708 fix docs 2019-03-18 12:19:44 +00:00
Kristoffer Carlsson
b84a60e74e Update src/layers/basic.jl
Co-Authored-By: oxinabox <oxinabox@ucc.asn.au>
2019-03-18 12:19:44 +00:00
Lyndon White
fcc3ec471a Add MaxOut layer 2019-03-18 12:19:44 +00:00
Lyndon White
79de829fdc move Dense's overloads to be near its defn 2019-03-18 12:18:14 +00:00
chengchingwen
934f0840b2 change API to dims 2019-03-14 21:51:28 +08:00
Manjunath Bhat
633f0df01f
Added new loss functions. 2019-03-12 02:31:42 +05:30
Joshua Whittemore
f061df3d23 resolves pull request #652 merge conflicts 2019-03-09 12:51:20 -08:00
Mike J Innes
b348e31f07
Merge pull request #667 from FluxML/donottrack
rm Tracker
2019-03-08 11:38:37 +00:00
Josh Whittemore
930ebaf217 Add module to make iris dataset available. 2019-03-07 16:56:23 -08:00
Manjunath Bhat
c6e51f5cc2
Made lambda and alpha of eltype(x) 2019-03-07 23:42:38 +05:30
Manjunath Bhat
47c1324476
Merge branch 'master' into patch-3 2019-03-07 23:08:40 +05:30
Manjunath Bhat
1d310d4532
Removed {typeof(p)} 2019-03-07 21:55:26 +05:30
thebhatman
f4543b7adf Value of alpha updated and dot operations changed 2019-03-08 03:21:26 +05:30
David Pollack
7b9b64f1cb change IN to in 2019-03-07 09:46:44 +01:00
David Pollack
83b4b3a714 changes based on PR comments 2019-03-07 09:46:44 +01:00
David Pollack
c41f891005 changes based on the improved batchnorm in PR#633 2019-03-07 09:46:44 +01:00
David Pollack
129a708b6f instance normalization 2019-03-07 09:46:44 +01:00
Mike J Innes
b5a148fa37 rm Tracker 2019-03-07 01:33:02 +00:00
Mike J Innes
3a4c6274fa
Merge pull request #651 from FluxML/mji/dogfood
Refactor training loop
2019-03-06 16:53:24 +00:00
Mike J Innes
fc6232b779
Merge pull request #633 from Sklan/patch-3
Improving BatchNorm
2019-03-06 16:23:03 +00:00
thebhatman
8e5965ac41 Indentation fixed 2019-03-05 16:28:05 +05:30
thebhatman
d6608682fc Suggested changes made 2019-03-05 16:18:50 +05:30
Manjunath Bhat
29b853e0bb
Made sure Gradients are not lost. 2019-03-04 22:17:19 +05:30
Manjunath Bhat
b5533ee00b
Exported AlphaDropout 2019-03-04 01:09:05 +05:30
Manjunath Bhat
97f874abcf
Added AlphaDropout which is used in SNNs. 2019-03-04 01:05:46 +05:30
Manjunath Bhat
704be49483
Added support for Float64 for DepthwiseConv
DepthwiseConv was giving errors for Float64. This fixes the issue.
2019-03-01 15:04:05 +05:30
Mike Innes
4cf43c0c41 simpler/nicer training loop 2019-02-28 14:58:42 +00:00
Mike Innes
cd091ad005 in place implicit gradients 2019-02-28 14:08:01 +00:00
Mike Innes
8b4bc7cc52 organise params 2019-02-28 13:44:54 +00:00
Dhairya Gandhi
6825639f79 mapreduce for onehotmatrix 2019-02-28 09:17:18 +05:30
Rohith Pentaparthy
1b1dff1266 Added an example of Conv to Flux.jl/src/layers/conv.jl, and clarified what WHCN means 2019-02-23 14:31:27 -06:00
Sklan
7463f09591
Update normalise.jl 2019-02-21 23:56:19 +05:30
Sklan
6044421c5c
Update normalise.jl 2019-02-20 13:47:31 +05:30
pshashk
b0a5844afb Remove dims=1 from normalise (#619)
* remove `dims=1`

* add dims arg

* fix test

* remove dims=1 only from deprecated version
2019-02-11 16:11:47 +00:00
Dhairya Gandhi
2ec35861b5 removing non-allocating functions and tests 2019-02-11 21:22:32 +05:30
Dhairya Gandhi
35cd9761a8 adding tests 2019-02-09 22:32:02 +05:30
pshashk
b074b2491a
fix docstring 2019-02-08 21:49:53 +03:00
pshashk
c3e04392d8
drop dims type restriction 2019-02-08 16:15:37 +03:00
pshashk
911c901294
dims kwarg 2019-02-08 16:00:32 +03:00
pshashk
368c29e5e3
Add corrected argument to std
Fixes ffe037c485/src/layers/stateless.jl (L49)
2019-02-08 15:23:27 +03:00
Mike J Innes
ffe037c485
Merge pull request #603 from FluxML/kf/namedtupletree
Treat NamedTuple like Tuple for treelike purposes
2019-02-08 11:06:12 +00:00
Mike J Innes
601e2d8ae0
Merge pull request #586 from KristofferC/kc/batchnorm
work around extreme slowdown in BatchNorm due to julia performance bug in broadcast fusion
2019-02-08 11:00:33 +00:00
Mike J Innes
fe712bf338
Merge pull request #596 from IvanYashchuk/ivan/topic-issue-542
Fixed issue #542.
2019-02-08 10:38:23 +00:00
Ivan Yashchuk
e00ac88016 Added tracking of logdet and logabsdet. Added gradtests. 2019-02-08 09:55:33 +02:00
Keno Fischer
1e452a3042 Treat NamedTuple like Tuple for treelike purposes 2019-02-06 11:11:00 -05:00
KristofferC
9914c531f6 work around extreme slowdown due julia performance bug 2019-02-06 16:19:29 +01:00
Mike J Innes
ecc55ec9e1
Revert "Fix OneHotVector/Matrix performance on GPU" 2019-02-06 14:31:15 +00:00
Mike J Innes
e8b2ec6f67
Merge pull request #311 from tejank10/conv_transpose
2D Conv transpose support
2019-02-06 14:14:14 +00:00
Moksh Jain
046f7b4eae fix std arguments in normalise 2019-02-05 18:36:04 +05:30
Ivan Yashchuk
f790fff59a Use other definition for grad(det(A)). 2019-02-05 14:36:28 +02:00
Moksh Jain
c6409d7686 add support for n-dimensional input to normalise layer 2019-02-05 17:09:22 +05:30
Ivan Yashchuk
aa64d2157d Fixed issue #542.
Added tracking of LinearAlgebra.det and its grad method.
2019-02-05 11:38:27 +02:00
Mike J Innes
940b1e6dbf
Merge pull request #587 from KristofferC/patch-2
use uncorrected standard deviation in normalise
2019-02-04 14:35:25 +00:00
Mike J Innes
7fc920240d
Merge pull request #591 from dhairyagandhi96/onehot
Fix OneHotVector/Matrix performance on GPU
2019-02-04 13:53:55 +00:00
Mike J Innes
17f33b4a6a
Merge pull request #583 from KristofferC/kc/small_fixes
clarify docs on single batch image to conv
2019-02-04 12:33:34 +00:00
Mike J Innes
e774053126
Merge pull request #590 from oxinabox/patch-2
Default to zero'ed initial state for all RNN
2019-02-04 12:28:38 +00:00
Mike J Innes
329c8f8f95
Merge pull request #585 from KristofferC/kc/verify_download
add hash verification to datasets
2019-02-04 11:20:53 +00:00
Mike J Innes
cfe6859186 auto-collect in forward 2019-02-04 10:37:02 +00:00
Mike J Innes
838070968e vcat with scalars 2019-02-04 00:05:16 +00:00
Dhairya Gandhi
30aa814c4d fixes #582 2019-02-03 18:43:16 +05:30
Dhairya Gandhi
e243950e28 comment fix 2019-02-03 04:00:08 +05:30
Dhairya Gandhi
bd6158d7f9 onehotvector/matrix behaviour 2019-02-03 03:57:41 +05:30
Lyndon White
26550dacda
Default to zero'ed initial state 2019-02-02 20:01:28 +00:00
Tejan Karmali
84eabcd2ae
fixed DepthwiseConv dilation 2019-02-02 12:19:35 +05:30
Tejan Karmali
e54df2de06
Merge branch 'master' into conv_transpose 2019-02-02 10:20:45 +05:30
Kristoffer Carlsson
fd0f1c7a82
use uncorrected standard deviation in normalise
fixes https://github.com/FluxML/Flux.jl/issues/529
2019-01-30 17:42:19 +01:00
Kristoffer Carlsson
f60079d07c add hash verification to datasets 2019-01-30 13:11:26 +01:00
Mike J Innes
0469394715
Merge pull request #576 from mcabbott/patch-1
PermutedDimsArray
2019-01-29 14:55:55 +00:00
Mike J Innes
9e553adbf7 add hessian 2019-01-29 08:37:30 +00:00
Michael Abbott
031d1b3d57
PermutedDimsArray like permutedims
e.g. PermutedDimsArray(rand(2,3) |> param, (2,1))
2019-01-28 18:15:32 +01:00
Mike J Innes
0f8a4a48c6 extend update! with an optimiser 2019-01-28 14:10:09 +00:00
Mike J Innes
0f2975d905 update -> apply 2019-01-28 13:59:23 +00:00
Mike J Innes
bf0b5c5cef
Merge pull request #535 from asbisen/master
fixed stack/unstack function - in utils.jl for v1.0
2019-01-28 12:23:07 +00:00
Mike Innes
af8fdcc7af fix #573 2019-01-28 10:54:58 +00:00
Mike J Innes
013b421b08
Merge pull request #570 from avik-pal/ap/batchnorm_fixes
Patches for default initializers
2019-01-28 10:40:55 +00:00
Mike Innes
1c3a63c42f fixes #574 2019-01-28 10:11:07 +00:00
Mike J Innes
58ac415f6b forward mode 2019-01-25 16:14:24 +00:00
Mike J Innes
2b1a3e92da mapparams 2019-01-25 10:11:46 +00:00
Mike J Innes
791939709b numeric precision utilities 2019-01-25 10:06:37 +00:00
Mike J Innes
1cf37ab9eb rm some old deprecations 2019-01-25 09:54:32 +00:00
Avik Pal
733879681e Change initializer to glorot_uniform 2019-01-24 18:48:30 +05:30
Avik Pal
bb72c528e1 Change initializers to Float32 2019-01-24 18:43:39 +05:30
Mike Innes
ca1c73ed35 fixup 2019-01-24 11:15:57 +00:00
Kristoffer Carlsson
325e3a4f70 clarify docs on single batch image to conv
fixes #309
2019-01-24 11:24:10 +01:00
Mike J Innes
62d780c77f onecold fix 2019-01-24 10:16:41 +00:00
chengchingwen
06003b72c7 noise shape for dropout 2019-01-22 23:51:38 +08:00
Dhairya Gandhi
4be08fe194 remove debug statement 2019-01-22 17:29:12 +05:30
Mike J Innes
152ce4a164 conversions for dual numbers 2019-01-22 10:07:42 +00:00
Mike J Innes
496dbfabd2 make chain collectable 2019-01-22 00:31:55 +00:00
Mike J Innes
f6397e7358
Merge pull request #517 from FluxML/fix_adamw
Fix decay argument in ADAMW
2019-01-18 10:06:23 +00:00
Mike J Innes
058b4dc7fb
Merge pull request #557 from dhairyagandhi96/dg/transpose
fix transpose/ adjoint gradient
2019-01-16 15:46:44 +00:00
Mike J Innes
9d56807bcd cuarrays version check 2019-01-15 11:43:57 -05:00
Dhairya Gandhi
0060cc3453 fixes transpose/ adjoint gradient 2019-01-15 21:59:32 +05:30
Mike J Innes
a3e0de1ee5 fixes #516 2019-01-15 15:49:18 +00:00
Mike J Innes
67d9016319
Merge pull request #538 from KristofferC/kc/promote
fix promotion by avoiding integer division in mse and crossentropy
2019-01-15 13:20:46 +00:00
Kristoffer Carlsson
c74aa67c5d fix promotion by avoiding integer division in mse and crossentropy
oops

add tests
2019-01-15 14:15:05 +01:00
Mike J Innes
827a7b8ed5
Merge pull request #546 from ChrisRackauckas/random
Support random numbers as constants
2019-01-11 10:06:54 +00:00
Mike J Innes
aa1b4f410f simplify 2019-01-11 10:06:14 +00:00
Christopher Rackauckas
f6faa10ee2 remove non-type dispatches 2019-01-10 08:57:10 -08:00
Mike J Innes
f0d5624ed2
Merge pull request #493 from dhairyagandhi96/master
[WIP] New Optimiser Docs
2019-01-10 11:10:38 +00:00
Dhairya Gandhi
4291c1a833 pull master 2019-01-10 16:35:57 +05:30
Mike J Innes
e6f925f977 train docstring simplification 2019-01-10 11:05:21 +00:00
Dhairya Gandhi
f00e1cdedf [docs] replace :stop with Flux.stop() 2019-01-10 16:34:07 +05:30
Mike J Innes
735b970c12 fix update for scalars 2019-01-10 10:19:05 +00:00
Christopher Rackauckas
3ee5a99794 hit all possibilities 2019-01-09 23:15:21 -08:00
Christopher Rackauckas
cf061e9207 support random numbers as constants 2019-01-09 23:04:12 -08:00
Dhairya Gandhi
7484c54f03 fix train! API syntax docstring 2019-01-08 00:32:55 +05:30
Anand Bisen
ec8dde79c3 fixed stack/unstack function - in utils.jl for v1.0 2019-01-03 17:32:11 -08:00
Mike J Innes
6b11c552f3 better h/vcat, fixes #378 2018-12-19 11:19:01 +00:00
Mike J Innes
cdfc97f7c6 fix fix_dec 2018-12-19 11:18:44 +00:00
Dhairya Gandhi
e48268ff06 fix argument name in ADAMW 2018-12-12 16:47:42 +05:30
Tejan Karmali
ed835f26fe printing ConvTranspose layer 2018-12-09 12:50:09 -05:00
Tejan Karmali
1648414a5d fixes for layer and test 2018-12-04 11:08:40 -05:00
Tejan Karmali
519c3db5c0 clean code 2018-11-28 11:48:53 -05:00
Tejan Karmali
95e490a2c5 merge conflict resolved 2018-11-28 11:10:22 -05:00
Tejan Karmali
89f2709b61 resolved conflicts 2018-11-28 11:07:43 -05:00
Tejan Karmali
bc9bda9a85 in accordance with conv_filter api 2018-11-28 10:55:21 -05:00
Tejan Karmali
10f3a8eae2 conv_filter api changes 2018-11-28 10:55:21 -05:00
Tejan Karmali
ca8ad63fb6 in accordance with conv_data api 2018-11-28 10:55:21 -05:00
Tejan Karmali
9c3e34b15f conv_data grad api change 2018-11-28 10:55:21 -05:00
Tejan Karmali
a71ee386d0 1.0 fix for conv transpose 2018-11-28 10:55:21 -05:00
Mike J Innes
a32c8a2e60
Merge pull request #499 from willtebbutt/wct/leq
Deal with <= for TrackedReals
2018-11-28 00:37:32 +00:00
Mike J Innes
dd154ca049
Merge pull request #294 from avik-pal/cudnn_batchnorm
Wrapper for CuDNN BatchNorm
2018-11-27 23:51:32 +00:00
Mike J Innes
1c36504768 fixup 2018-11-27 18:44:07 -05:00
Mike J Innes
7992de5cba update requires syntax 2018-11-27 18:31:05 -05:00
Avik Pal
1d5b3429ea Missing brackets 2018-11-20 09:26:48 +05:30
Will Tebbutt
c7f5026bd9 Deal with <= for TrackedReals 2018-11-18 13:06:32 +00:00
Mike J Innes
3d41dca338 immutable chain 2018-11-16 12:22:15 +00:00
Mike J Innes
6ac5345339 better printing 2018-11-14 23:53:30 +00:00
Mike J Innes
325035cf60 array conversions 2018-11-14 23:48:32 +00:00
Christopher Rackauckas
f20fa65848
Add missing eps overload for TrackedReal
`eps` can be called on the number type as well, and this is missing from the TrackedReal overloads.
2018-11-14 09:58:41 -08:00
Avik Pal
dfd680646c Fix conflict 2018-11-14 22:18:57 +05:30
Mike J Innes
3ef6bfc0ac
Merge pull request #473 from avik-pal/patch-2
Update CUDNN function calls
2018-11-14 16:07:02 +00:00
Mike J Innes
cbc29c889a old cuarrays compat 2018-11-14 10:53:26 -05:00
Mike J Innes
a57f66e58a adapt updates 2018-11-14 15:36:18 +00:00
Mike J Innes
b3331205d1 faster default gradient performance 2018-11-12 23:39:25 +00:00
Mike J Innes
75ecc0b6ba downconversion for conv 2018-11-12 20:21:27 +00:00
Mike J Innes
903db70673 float32 param initialisers 2018-11-12 20:10:47 +00:00
Dhairya Gandhi
1ea8c5a293 [WIP] add docstrings and doc improvements 2018-11-12 19:17:10 +05:30
Avik Pal
d6aacf4135 Fix reshape 2018-11-10 11:43:49 +05:30
Avik Pal
e2ae8b4e8d Fix dimensions 2018-11-10 11:35:58 +05:30
Avik Pal
3bc809f49e dropdims to make the array 2d 2018-11-10 11:25:37 +05:30
Avik Pal
4d703b31a1 Reshape 2D tensors to use cudnn batchnorm 2018-11-08 19:23:07 +05:30
Avik Pal
564518e448 Merge branch 'master' of https://github.com/FluxML/Flux.jl into cudnn_batchnorm 2018-11-08 19:13:34 +05:30
Avik Pal
02efc264e7 Fix unintentional change to spaces 2018-11-08 19:12:38 +05:30
Mike J Innes
30486f9c03
Merge pull request #441 from Paethon/rm_initn
Removes initn initialization
2018-11-08 13:25:02 +00:00
Mike J Innes
5e572df557
Merge pull request #485 from dhairyagandhi96/master
Add call back
2018-11-08 13:18:17 +00:00
Dhairya Gandhi
392c3c942b re-add removed call function 2018-11-08 18:44:57 +05:30
Mike J Innes
a88b7528bf constructor deprecations 2018-11-06 08:19:46 -05:00
Mike J Innes
0c19dad700 include cudnn.jl 2018-11-06 12:39:54 +00:00
Mike J Innes
39dcfd3933
Merge pull request #469 from invenia/ed/hang-draw-and-quarter
Stop type treason with show of the TrackedArray type
2018-11-06 11:54:07 +00:00
Mike J Innes
4763473079 fixed method 2018-11-06 11:50:04 +00:00
Mike J Innes
8042198475
Merge pull request #479 from dhairyagandhi96/master
Fix deprecations of optimisers
2018-11-05 13:01:59 +00:00
Mike J Innes
d071014fae
Merge pull request #448 from JobJob/adam-match-paper
Match paper for Adam implementation and make epsilon use more consistent
2018-11-05 12:57:30 +00:00
Mike J Innes
d0e4fbb1e0 Merge branch 'master' into ed/diagm-pair 2018-11-05 11:51:29 +00:00
Mike J Innes
5df48fbc5d fix 2018-11-05 11:49:38 +00:00
Eric Davies
6b0b51e390 Stop type treason with show of the TrackedArray type 2018-11-02 16:00:58 -05:00
Joel Mason
29832aca92 Move some epsilons about 2018-11-02 22:59:04 +11:00
Dhairya Gandhi
5ec70fe29d allow array parameters to old optimisers 2018-11-01 22:17:54 +05:30
Mike J Innes
c71c610747 separate gradient library 2018-11-01 15:35:55 +00:00
Dhairya Gandhi
ca4e01ac26 use user defined decay in ADAMW 2018-11-01 15:58:40 +05:30
Dhairya Gandhi
58a6c3f225 fix deprecations 2018-11-01 15:02:00 +05:30
Avik Pal
4ba891f666
Remove unnecessary import 2018-11-01 09:37:48 +05:30
Avik Pal
c67e33f387
Make the changes backward compatible 2018-11-01 09:37:16 +05:30
Mike J Innes
43c5f90d93
Merge pull request #379 from dhairyagandhi96/master
New optimisers interface
2018-10-31 16:38:40 +00:00
Mike J Innes
46049b9f44 tweak update rule 2018-10-31 16:08:18 +00:00
Mike J Innes
554c4c7c7a return Params from params 2018-10-31 15:50:08 +00:00
Mike J Innes
4a54d30cbf correct SGD deprecation 2018-10-31 15:30:30 +00:00
Mike J Innes
bffaceee02 tweaks 2018-10-31 14:58:55 +00:00
Mike J Innes
70283e1971
Merge pull request #465 from FluxML/mji/once
Destroy AD graph when doing in-place gradients
2018-10-31 14:14:38 +00:00
Mike J Innes
9312536b96
Merge pull request #461 from Roger-luo/roger-patch-1
Support view for TrackedArray
2018-10-30 15:24:05 +00:00
Mike J Innes
77178b7d67 remove old-style definition and test 2018-10-30 14:21:22 +00:00
Avik Pal
7804d980b2
Update cudnn.jl 2018-10-30 01:08:21 +05:30
Dhairya Gandhi
bebf4eb95f fixed ExpDecay update! rule 2018-10-29 23:12:24 +05:30
Keno Fischer
baf868e851
Add VERSION check around broadcast piracy 2018-10-28 16:07:26 -04:00
Dhairya Gandhi
32ce2d78b8 fixed ExpDecay test 2018-10-27 19:53:06 +05:30
Dhairya Gandhi
815e8c206d decay fixes 2018-10-27 19:26:42 +05:30
Mike J Innes
b77433cdfd 0.7 fix 2018-10-27 12:23:14 +01:00
Eric Davies
9f9803eec6 Add new-style diagm to tracker 2018-10-26 14:44:59 -05:00
Roger-luo
e5d58699e6 fix and add test 2018-10-26 14:06:17 -04:00
Mike J Innes
c21d768b7c destroy AD graph when doing in-place gradients 2018-10-26 16:57:19 +01:00
Tejan Karmali
a657c287d0 in accordance with conv_filter api 2018-10-26 11:31:34 -04:00
Avik Pal
b838c0bc04 Update the libcudnn_handle 2018-10-26 10:24:30 +05:30
Roger-luo
a3cda9016c apply Mike's change 2018-10-25 13:48:33 -04:00
Roger-luo
5f99e5775a fix #458 2018-10-24 15:40:10 -04:00
Tejan Karmali
387df8c095 conv_filter api changes 2018-10-24 13:28:22 -04:00
Tejan Karmali
fca93471b3 in accordance with conv_data api 2018-10-24 12:52:43 -04:00
Avik Pal
ec2c00783d
Add missing export for DepthwiseConv 2018-10-24 22:18:26 +05:30
Tejan Karmali
0dc4ec4d6b conv_data grad api change 2018-10-24 07:04:49 -04:00
Tejan Karmali
f540a0daf7 merge with upstream 2018-10-23 13:40:06 -04:00
Avik Pal
2559e7b4e6 Fix merge conflicts 2018-10-23 21:53:29 +05:30
Mike J Innes
bbccdb3eec
Merge pull request #279 from avik-pal/depthwiseconv
Adds support for Depthwise Convolutions
2018-10-23 17:22:15 +01:00
Mike J Innes
96dbae2d20 Omega and Turing fix 2018-10-23 11:30:37 +01:00
Tejan Karmali
e9bf86dbff Merge branch 'master' of https://github.com/FluxML/Flux.jl into conv_transpose 2018-10-19 02:08:25 -04:00
Sebastian Stabinger
94e5e9f993 Removes initn initialization
Is replaced with glorot_uniform for Conv following Keras
2018-10-17 17:11:16 +02:00
Avik Pal
3899907164
Update conv.jl 2018-10-11 21:39:35 +05:30
Dhairya Gandhi
1f0f2a5ac2 fixed DescentWeightDecay parameters 2018-10-11 10:21:29 +05:30
Dhairya Gandhi
d8394298bb fix merge conflicts 2018-10-11 10:15:59 +05:30
Dhairya Gandhi
fe8c147f72 fixed weight decay definition 2018-10-11 10:07:16 +05:30
Mike J Innes
ab0763fd41
Merge pull request #428 from tejank10/rnn-fixes
[WIP] Fixes for RNN tests
2018-10-10 16:58:44 +01:00
Tejan Karmali
8987e2c423 rm comments 2018-10-10 11:55:10 -04:00
Tejan Karmali
6b4bbd4fce reverted back the weights changes in rnndesc 2018-10-10 10:29:15 -04:00
Mike J Innes
9f6c3d5a2c fixes #403 2018-10-10 12:26:03 +01:00
Tejan Karmali
7b3e9c35ad changed index to view 2018-10-09 12:57:20 -04:00
JohnnyChen
de7623ac94 use variable assignment to do "copy" 2018-10-09 03:49:17 +08:00
JohnnyChen
eaacec852f Bug fix 2018-10-09 03:40:02 +08:00
JohnnyChen
27fec15fcc Add explicit copy(x::TrackedArray) method 2018-10-09 03:34:41 +08:00
Tejan Karmali
4d1a6c305b fixed params getting zero 2018-10-08 13:59:29 -04:00
JohnnyChen
36f5f274a5 Support copy(::TrackedArray)
1. fix issue https://github.com/FluxML/Flux.jl/issues/416
2. change test code to pass the test: some broken tests are not broken now...
2018-10-09 01:53:32 +08:00
Mike J Innes
73385b5dbd
Merge pull request #372 from johnnychen94/issue-#354
Type restriction for Dense layer
2018-10-05 15:03:03 +01:00
Mike Innes
c6740c5cdd fix unbroadcast 2018-10-05 14:14:43 +01:00
Mike J Innes
325d2ce212
Merge pull request #418 from c-p-murphy/add-fashion-mnist
Add FashionMNIST
2018-10-05 14:05:50 +01:00
Mike Innes
61fb6cdf05 jit macro 2018-10-05 14:02:00 +01:00
Mike Innes
69afdd61a6 avoid a warning 2018-10-05 13:59:58 +01:00
Mike Innes
bfe85e65f1 compose tweaks 2018-10-05 13:52:26 +01:00
Mike Innes
0f2019eba5 compose tweaks 2018-10-05 12:57:03 +01:00
Mike Innes
9bc9771a8d tweaks 2018-10-05 12:43:03 +01:00
Mike Innes
4abe518599 newline fixes 2018-10-05 12:37:47 +01:00
Tejan Karmali
2ff54ee0fd cudnn_available() update 2018-10-04 11:31:29 -04:00
Christopher Murphy
73a526b1de reuse utils from mnist.jl 2018-10-03 12:40:24 -04:00
Mike J Innes
683bbec71c
Merge pull request #413 from mcabbott/patch-2
evaluate both 2-ary DiffRules only when needed
2018-10-03 12:02:12 +01:00
Mike J Innes
fe6793fde5
closes #411 2018-10-03 11:45:29 +01:00
Robert Luciani
252e34e173 1.0+ updates - indices to axes, Vector init with undef 2018-10-02 21:39:00 +02:00
Christopher Murphy
95d72d7f79 update comments 2018-10-02 15:31:44 -04:00
Christopher Murphy
aff4c7898e add FashionMNIST 2018-10-01 15:26:26 -04:00
Dhairya Gandhi
b661db3797 added deprecations and compose 2018-10-01 05:30:53 +05:30
Michael Abbott
d25e05d9ee
evaluate both 2-ary DiffRules only when needed 2018-09-27 10:40:44 +02:00
Alex Bird
d131853587 add inv/ldivide/rdivide + test 2018-09-19 13:08:30 +01:00
Dhairya Gandhi
6665189ff1 added remaining optimizers and tests 2018-09-16 17:34:51 +05:30
Isaac Tay
e803117e25
updated loadparams! function 2018-09-15 16:45:04 +08:00
Avik Pal
eb9b408c0f
Merge branch 'master' into depthwiseconv 2018-09-15 10:21:31 +05:30
Mike J Innes
08fb9b7df1
Merge pull request #397 from FluxML/nest-bcast
Nested Derivatives of Broadcast
2018-09-14 20:23:28 +01:00
Mike Innes
d797999fc5 fix sentiment model 2018-09-14 18:10:24 +01:00
Dhairya Gandhi
63bc71698b updated tests 2018-09-14 20:32:56 +05:30
Dhairya Gandhi
4860c1d48b fixed white lines 2018-09-11 18:35:21 +05:30
Dhairya Gandhi
d933f2079b pulled tracker from upstream 2018-09-11 18:30:24 +05:30
Avik Pal
7d06f654f0 Fix tests 2018-09-11 16:58:05 +05:30
Avik Pal
7e7a501efd Fix tests 2018-09-11 16:32:14 +05:30
Avik Pal
c4f87ff15c Minor fixes: 2018-09-11 16:21:55 +05:30
Avik Pal
7e83852862 Fixes 2018-09-11 15:58:17 +05:30
Avik Pal
5fd8ffa47e CuRNN updates 2018-09-11 15:44:07 +05:30
Avik Pal
8bea60d980
Merge branch 'master' into cudnn_batchnorm 2018-09-11 15:34:25 +05:30
Tejan Karmali
e86365ed3f 1.0 fix for conv transpose 2018-09-08 15:44:06 -04:00
James Bradbury
e7783ace12 1.0 compat for normalise 2018-09-06 18:38:11 -07:00
Mike J Innes
6bbed07e96 enable nested broadcast 2018-09-07 02:05:03 +01:00
Dhairya Gandhi
0b440f16ff Merge branch 'master' of https://github.com/FluxML/Flux.jl 2018-09-06 22:48:03 +06:00
Johnny Chen
44049ce00c
Merge branch 'master' into issue-#354 2018-09-06 09:39:31 -05:00
Mike J Innes
5e4ee827e9
Merge pull request #371 from johnnychen94/issue-#323
Fix issue #323
2018-09-06 15:28:15 +01:00
Mike J Innes
ec16a2c77d todone: nicer syntax on 0.7 2018-09-05 15:55:08 +01:00
Mike J Innes
1e0fd07b09 use expand 2018-09-04 14:30:02 +01:00
Mike J Innes
e6be639436 Merge branch 'master' into HEAD 2018-09-04 14:03:46 +01:00
Mike J Innes
93c4a6b4b5 fixes #343 2018-09-04 13:37:54 +01:00
Mike J Innes
a2d2d068aa initial sketch 2018-08-28 17:55:59 +05:30
Mike Innes
53be49b102 fix #377 2018-08-28 11:02:38 +01:00
Mike J Innes
fac06751ea
Merge pull request #361 from dhairyagandhi96/with_stop
Add stop() to train loop when callback conditions are met
2018-08-28 10:56:15 +01:00
Mike Innes
2ca189bc96 newlines 2018-08-28 10:54:50 +01:00
Dhairya Gandhi
89bca2d98d remove merge conflicts 2018-08-28 15:14:12 +05:30