Commit Graph

782 Commits

Author SHA1 Message Date
CarloLucibello
6d3a2a2210 change argument name for better clarity 2017-10-17 21:04:18 +01:00
Mike J Innes
23674b2555 logitcrossentropy tweaks 2017-10-17 17:58:32 +01:00
pevnak
4aa7741ba9 logit cross entropy 2017-10-17 17:57:46 +01:00
Mike J Innes
6dff8ca8d3 rename crossentropy loss 2017-10-17 17:36:18 +01:00
Mike J Innes
1800c8f523 deprecate mapparams 2017-10-17 17:35:30 +01:00
Mike J Innes
949fd9ba97 loss function tweaks 2017-10-17 17:30:11 +01:00
Mike J Innes
c764b74eba rename and fix mapleaves 2017-10-17 01:08:15 +01:00
Mike J Innes
7aa0b43ceb onehot sanity check 2017-10-17 00:07:58 +01:00
Mike J Innes
e02e320008 more general fmap 2017-10-17 00:07:15 +01:00
Mike J Innes
64e242e96c export param 2017-10-16 08:53:46 +01:00
Mike J Innes
d3db051ca0 flip 2017-10-16 08:53:39 +01:00
Mike J Innes
9a155abecd batch and batchseq apis 2017-10-15 23:44:40 +01:00
Mike J Innes
646720cd05 fix 2017-10-15 23:44:16 +01:00
Mike J Innes
c6556a29e6 order-stable params 2017-10-10 12:16:32 +01:00
GenaBitu
ef6d10886d Exposed all optimisers 2017-10-06 14:20:09 +01:00
GenaBitu
2084df96ae
Merge branch 'master' into cat-fix 2017-10-06 15:00:26 +02:00
pevnak
bfcc1ac25d exposing optimisers 2017-10-05 12:36:18 +01:00
Mike J Innes
1abc4febe6 more general adaptors 2017-10-04 18:55:56 +01:00
Dave Kleinschmidt
2b95aff158 actually use init argument in LSTMCell 2017-10-03 19:26:42 +01:00
Mike J Innes
5fd1b7d9a2 remove gc hack 2017-10-02 20:50:18 +01:00
Mike J Innes
1b91e6b38d store onehotmatrix height 2017-10-02 20:50:11 +01:00
Mike J Innes
7c8dba0b85 gc in training loop 2017-09-27 23:14:58 +01:00
Mike J Innes
a32ae4914c onehotmatrix cuda support 2017-09-27 22:51:00 +01:00
Mike J Innes
a60a754d68 beginnings of gpu support 2017-09-27 21:58:34 +01:00
Mike J Innes
120a6db2bb Merge branch 'master' of github.com:MikeInnes/Flux.jl 2017-09-27 21:16:23 +01:00
Mike J Innes
4bafa2b374 generic tree functions 2017-09-27 21:11:21 +01:00
Mike J Innes
2ec8401d2c remove compiler 2017-09-27 20:48:39 +01:00
Mike J Innes
94e38c05b8 more informative 2017-09-27 18:33:23 +01:00
GenaBitu
136f9bbf74
Hack which doesn't break backprop 2017-09-22 11:47:04 +02:00
GenaBitu
a5fe5b6e65
Added multi-variable vcat for TrackedVector 2017-09-22 11:22:21 +02:00
Mike J Innes
f2052739c1 tweaks 2017-09-12 14:11:03 +01:00
Mike J Innes
a3fe89e348 rnn tanh by default 2017-09-12 13:12:25 +01:00
Mike J Innes
6728295355 Merge pull request #63 from JobJob/rnncell-args
Enables passing an activation function to RNN/RNNCell
2017-09-12 13:10:43 +01:00
Mike J Innes
28bbef81b9 f 2017-09-12 13:06:32 +01:00
Mike J Innes
972ecab9f9 rm Over Seq 2017-09-12 13:03:16 +01:00
Joel Mason
00439555d1 Enables passing an activation function to RNN/RNNCell
Also, fixes it not using the `init` function provided
2017-09-12 20:54:56 +10:00
Mike J Innes
b9652f1812 typo, fixes #61 2017-09-12 10:45:07 +01:00
Mike J Innes
5f24d61ba3 important 2017-09-11 14:10:12 +01:00
Mike J Innes
7041ab9960 rm chainseq 2017-09-11 14:02:43 +01:00
Mike J Innes
c80fb999ff one hot docs 2017-09-11 13:40:11 +01:00
Mike J Innes
3f83be7bb7 more flexible training loop 2017-09-11 13:11:55 +01:00
Mike J Innes
33a5d26e57 chain utility note 2017-09-09 20:02:48 -04:00
Mike J Innes
fedee95b14 docs updates 2017-09-09 19:58:32 -04:00
Mike J Innes
a36d6d2af3 layer docs 2017-09-08 17:52:41 -04:00
Mike J Innes
f55b8cd20e track -> param 2017-09-07 15:13:04 -04:00
Mike J Innes
085d3aa9b4 handle epoch elsewhere 2017-09-07 00:29:55 -04:00
Mike J Innes
aeaa138b6d cb convenience 2017-09-07 00:27:16 -04:00
Mike J Innes
e837bb0745 rnn stuff 2017-09-07 00:27:04 -04:00
Mike J Innes
a93c440c1e style 2017-09-06 23:09:39 -04:00
Mike J Innes
cca4d25a10 efficient traversal 2017-09-06 23:09:32 -04:00
Mike J Innes
7cfc42d166 grad refactor 2017-09-06 21:21:35 -04:00
Mike J Innes
3ef72a9d7b utils updates 2017-09-06 18:59:43 -04:00
Mike J Innes
4083c34547 seq stuff 2017-09-06 18:59:36 -04:00
Mike J Innes
1855a37319 onehot 2017-09-06 18:58:55 -04:00
Mike J Innes
d7e3f7d6e1 fix stack/squeeze usage 2017-09-06 16:02:38 -04:00
Mike J Innes
1946c46e29 basic seq functionality 2017-09-06 14:03:25 -04:00
Mike J Innes
2c8b7bc64b remove these for now 2017-09-06 14:03:12 -04:00
Mike J Innes
4c12d18033 yet another vcat method 2017-09-05 19:25:42 -04:00
Mike J Innes
c95e9376a5 constructors 2017-09-05 19:25:34 -04:00
Mike J Innes
b023da1b7d lstm initialisation 2017-09-05 02:42:32 -04:00
Mike J Innes
61de692b50 lstm nonzero hidden state 2017-09-05 02:37:48 -04:00
Mike J Innes
ec02f1fabd batching in rnns 2017-09-05 02:29:31 -04:00
Mike J Innes
830d7fa611 vcat fix 2017-09-05 02:28:11 -04:00
Mike J Innes
363caeddc6 repmat forward 2017-09-05 02:12:53 -04:00
Mike J Innes
a322c07fc8 vcat back 2017-09-05 02:11:28 -04:00
Mike J Innes
788d7d35f0 better numeric grads 2017-09-03 17:10:35 -04:00
Mike J Innes
8f4ccdd5ba scalar getindex backprop 2017-09-03 17:10:23 -04:00
Mike J Innes
47ba702747 tweak optimiser interface 2017-09-03 17:10:04 -04:00
Mike J Innes
d4211b1f23 sgd export 2017-09-03 17:09:53 -04:00
Mike J Innes
f33a8edd25 meh 2017-09-03 02:45:46 -04:00
Mike J Innes
e57ae77bbb juno progress 2017-09-03 02:44:32 -04:00
Mike J Innes
bd5822fd71 cleaner lstm 2017-09-03 02:24:47 -04:00
Mike J Innes
cf58748680 nicer trackedarray type printing 2017-09-03 02:12:54 -04:00
Mike J Innes
9642ae8cd6 basic recurrence 2017-09-03 02:12:44 -04:00
Mike J Innes
f6771b98cd clearer name for dense 2017-09-02 16:50:11 -04:00
Mike J Innes
fe2b35facc add callbacks back 2017-09-01 23:59:44 -04:00
Mike J Innes
bf098d551c fuck 2017-09-01 23:41:44 -04:00
Mike J Innes
107d9daa8f add some non-differentiable functions 2017-09-01 23:33:05 -04:00
Mike J Innes
387686eb41 optimisers rework 2017-09-01 17:06:51 -04:00
Mike J Innes
892a779ed1 tracked transpose 2017-09-01 11:42:18 -04:00
Mike J Innes
b95dae1868 opt refactor 2017-08-31 14:55:23 -04:00
Mike J Innes
7cd13789dd fix removed import 2017-08-29 17:14:01 -04:00
ylxdzsw
97ecb26003 wip optimisers 2017-08-29 17:00:24 -04:00
Mike J Innes
7bba38274b Merge branch 'master' of github.com:MikeInnes/Flux.jl 2017-08-28 01:41:11 +01:00
Mike J Innes
0b89e1374c gpu-friendly 2017-08-28 01:40:59 +01:00
Mike J Innes
73166c52a0 cleaner broadcasting fix 2017-08-27 09:49:42 +01:00
Mike J Innes
12dc6b66c5 whoops 2017-08-24 22:23:05 +01:00
Mike J Innes
52f5f4a4c0 initial cuarrays integration 2017-08-24 17:00:48 +01:00
Mike J Innes
e7f26370d7 training tweaks 2017-08-24 16:10:04 +01:00
Mike J Innes
1526b13691 basic training loop 2017-08-24 11:42:29 +01:00
Mike J Innes
9ce0439943 better mse 2017-08-24 11:40:51 +01:00
Mike J Innes
23690e0083 not useful enough 2017-08-24 11:40:19 +01:00
Mike J Innes
d162e028bb utility method 2017-08-23 17:50:49 +01:00
Mike J Innes
e4e9794f5e loss function gradients 2017-08-23 17:50:43 +01:00
Mike J Innes
60c3090981 broadcasting fix 2017-08-23 17:21:02 +01:00
Mike J Innes
23c5a1b163 softmax gradient 2017-08-23 02:03:17 +01:00
Mike J Innes
5eee653a64 gradient checks 2017-08-23 01:43:45 +01:00
Mike J Innes
56ed6f5680 de-broadcasting 2017-08-23 00:25:19 +01:00
Mike J Innes
bafecfede1 sgd 2017-08-22 22:25:18 +01:00
Mike J Innes
f2dd7b0e90 fix include case 2017-08-22 17:18:27 +01:00
Mike J Innes
0ce8c0cee4 param collection 2017-08-22 17:13:03 +01:00
Mike J Innes
1179269355 remove old params 2017-08-22 15:21:08 +01:00
Mike J Innes
f43eef8d7d consistent case 2017-08-22 15:13:24 +01:00
Mike J Innes
cd45df1eca vector sum 2017-08-22 15:12:12 +01:00
Mike J Innes
97af9db181 scalar sum 2017-08-22 12:24:08 +01:00
Mike J Innes
ef681f16ea use nnlib for activations 2017-08-21 17:53:04 +01:00
Mike J Innes
3e0f45046c nicer show 2017-08-21 17:20:09 +01:00
Mike J Innes
227e41c37b gpu-friendly dualify 2017-08-21 16:35:39 +01:00
Mike J Innes
65a49188e6 mat-vec mul 2017-08-20 13:48:43 +01:00
Mike J Innes
09974caba0 extend affine with activation 2017-08-20 13:35:35 +01:00
Mike J Innes
8e59160df6 inferable broadcast 2017-08-20 13:35:20 +01:00
Mike J Innes
18e69b33c9 forwarddiff does these 2017-08-19 22:05:50 +01:00
Mike J Innes
ad0e0ea5a7 explicitly broadcast sigmoid 2017-08-19 22:04:47 +01:00
Mike J Innes
4a9dc40e7c simplify organisation 2017-08-19 20:52:29 +01:00
Mike J Innes
318e503d9b remove rnns 2017-08-19 20:52:17 +01:00
Mike J Innes
a581856954 remove params from compiler 2017-08-19 20:38:45 +01:00
Mike J Innes
b18d8cd08f rename chain.jl 2017-08-19 20:26:07 +01:00
Mike J Innes
0222103c7f fix these later 2017-08-19 20:05:22 +01:00
Mike J Innes
bd6bffde48 silo the compiler 2017-08-19 20:04:21 +01:00
Mike J Innes
8ed4d569b3 export from flux 2017-08-19 18:23:21 +01:00
Mike J Innes
f12b1d0ca1 avoid back! fallback 2017-08-19 17:40:07 +01:00
Mike J Innes
c9eb58f146 printing tweaks 2017-08-19 16:38:40 +01:00
Mike J Innes
de76472d99 exports 2017-08-19 16:20:53 +01:00
Mike J Innes
6c1a38e157 some more derivatives 2017-08-19 16:02:19 +01:00
Mike J Innes
380d32dac9 remove Core for now 2017-08-19 16:01:32 +01:00
Mike J Innes
70393138bc submodule 2017-08-19 11:11:25 +01:00
Mike J Innes
1889ccd316 tracker submodule 2017-08-19 11:06:19 +01:00
Mike J Innes
15c4f38130 printing 2017-08-19 11:00:55 +01:00
Mike J Innes
f1dda12a54 matmul 2017-08-19 10:14:50 +01:00
Mike J Innes
d9c30db2e3 AD skeleton 2017-08-18 16:50:27 +01:00
Mike J Innes
f8482ff80c remove back!, update! 2017-08-18 10:18:45 +01:00
Mike J Innes
21089fea9c remove some explicit back! 2017-08-18 01:19:10 +01:00
Mike J Innes
052cc52ada remove inference, for now 2017-08-18 01:19:10 +01:00
Mike J Innes
e79a1657d4 remove batching and training 2017-08-18 01:19:06 +01:00
Mike J Innes
5f9d8702a4 remove old cb macro 2017-08-18 01:07:58 +01:00
Mike J Innes
bfe1f8eeee shouldn't do this 2017-08-18 01:05:46 +01:00
Mike J Innes
4ed0929ac1 to be rewritten 2017-08-18 01:05:46 +01:00
Mike J Innes
8869f1f953 these are now useless 2017-08-18 01:05:46 +01:00
Mike J Innes
536949891d remove backends 2017-08-18 01:05:42 +01:00
Mike J Innes
cd9521a762 Merge branch 'master' into tf-train 2017-08-17 23:52:12 +01:00
Mike J Innes
f5b41d2e89 Merge pull request #50 from alha02/add-more-tf-ops
Add more tf ops
2017-08-17 21:47:47 +01:00
ylxdzsw
74a4a48162 fix 2017-08-01 13:28:14 +08:00
Mike J Innes
3000c7bbcb style tweak 2017-07-27 20:43:53 +01:00
ylxdzsw
88fa163c95 throttle 2017-07-21 16:31:12 +08:00
ylxdzsw
dd82edd818 fix training on the tensorflow backend 2017-07-21 11:22:40 +08:00
Ali Hamdi
ec7a0bd8f7 add expand_dims 2017-07-08 11:59:17 +02:00
Ali Hamdi
6d106c914d add randn 2017-07-08 11:59:17 +02:00
Ali Hamdi
cd8c2c2f83 add randu 2017-07-08 11:59:17 +02:00
Ali Hamdi
c8d320233a add triangular_solve 2017-07-08 11:59:17 +02:00
Ali Hamdi
015e373a34 add solve 2017-07-08 11:59:17 +02:00
Ali Hamdi
edbf698ed9 add cast 2017-07-08 11:59:17 +02:00
Ali Hamdi
f25f985109 add fill 2017-07-08 11:59:17 +02:00
Ali Hamdi
2fce3b195e add tile 2017-07-08 11:59:17 +02:00
Ali Hamdi
7aad224206 add reshape 2017-07-08 11:58:31 +02:00
Mike J Innes
e915f614aa remove confusing comments 2017-07-03 20:47:41 +01:00
Mike J Innes
e3b432165c Merge pull request #43 from alha02/add-more-tf-ops
Add tf ops: diag, diagm, size, chol
2017-07-03 18:48:32 +01:00
Mike J Innes
7e4801832b Merge pull request #44 from ylxdzsw/train-naive
training julia models
2017-07-03 18:41:39 +01:00
Mike J Innes
ea5d43ed77 show methods 2017-06-19 16:49:29 -07:00
Mike J Innes
0a6b6e1d6f cleaner chunk util 2017-06-19 16:49:17 -07:00
Mike J Innes
b3ef370456 inner constructor depwarn 2017-06-19 16:49:04 -07:00
Mike J Innes
5ee6e8f504 overload correct repeated 2017-06-19 16:48:52 -07:00
Mike J Innes
7d2a34b55d unroll multiple inputs 2017-06-17 19:21:39 -07:00
Mike J Innes
8306ed2ed7 don't force params here 2017-06-17 19:18:09 -07:00
ylxdzsw
c9ae219613 simplify back! of Chain 2017-06-14 21:58:43 +08:00
ylxdzsw
cca21a617c training julia models 2017-06-12 19:39:34 +08:00
Ali Hamdi
422b3bd5d8 Merge remote-tracking branch 'origin/add-more-tf-ops-2' into add-more-tf-ops 2017-06-10 11:45:30 +02:00
Ali Hamdi
5143410313 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-10 11:43:27 +02:00
Mike J Innes
e4a7e7b335 more general seq/tuple support 2017-06-10 08:33:17 +01:00
Ali Hamdi
0b9436cfcd add chol 2017-06-09 21:50:25 +02:00
Ali Hamdi
7559196ab9 add size 2017-06-09 21:46:29 +02:00
Ali Hamdi
697a300ec3 add diagm 2017-06-09 21:35:31 +02:00
Ali Hamdi
6dfb554e95 add diag 2017-06-09 21:22:02 +02:00
Ali Hamdi
e6db3b0e89 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-09 20:53:17 +02:00
Mike J Innes
358ba650ad more robust batches 2017-06-09 18:57:18 +01:00
Mike J Innes
65400f20ab nested batch tweaks 2017-06-09 18:55:21 +01:00
Mike J Innes
9c8dbb6b4b feedforward fix 2017-06-09 18:54:35 +01:00
Ali Hamdi
d80c1abf5e Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-09 17:55:00 +02:00
Mike J Innes
9b18fd639a mxarray ctx 2017-06-09 14:29:47 +01:00
Ali Hamdi
6b4db60f05 add length 2017-06-09 12:13:30 +02:00
Ali Hamdi
74b36b0094 add cat 2017-06-09 12:09:20 +02:00
Ali Hamdi
57c733af67 add permutedims 2017-06-09 11:55:26 +02:00
Ali Hamdi
d78963ab27 add transpose 2017-06-09 07:38:07 +02:00
Ali Hamdi
a098158ff5 Merge branch 'master' of https://github.com/MikeInnes/Flux.jl into add-more-tf-ops 2017-06-09 07:15:20 +02:00
Mike J Innes
1cc8100456 ctx methods for seq models 2017-06-09 01:35:07 +01:00
Mike J Innes
020ae616cc custom mxnet context 2017-06-09 00:56:52 +01:00
Mike J Innes
fe0bddd98d pass args correctly 2017-06-09 00:55:54 +01:00
Ali Hamdi
c350bfb672 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops 2017-06-08 11:59:59 +02:00
Mike J Innes
4ccbbbb284 dup params fix 2017-06-08 10:49:39 +01:00
Mike J Innes
ab0f184d29 old import 2017-06-08 10:47:58 +01:00
Ali Hamdi
547ca64206 add det 2017-06-08 08:16:12 +02:00
Ali Hamdi
92616a8c3d add inv 2017-06-08 07:31:23 +02:00
Ali Hamdi
ee2e388228 add svd 2017-06-08 07:29:17 +02:00
Mike J Innes
dac80081ba these are probably the wrong abstraction 2017-06-06 18:03:36 +01:00
Mike J Innes
72e8e5c7f9 remove iterators 2017-06-06 18:03:14 +01:00
Mike J Innes
66c0403ffd seq rpad 2017-06-06 17:15:29 +01:00
Mike J Innes
e5df1c31dd batch vcat 2017-06-06 17:15:22 +01:00
Mike J Innes
3c52622896 more constructors 2017-06-06 16:58:57 +01:00
Mike J Innes
5186aa392f refactor 2017-06-06 15:59:10 +01:00
Mike J Innes
21fd79da16 fix 2017-06-05 22:51:08 +01:00
Mike J Innes
13f4af2492 generic and consistent conversions 2017-06-05 22:49:31 +01:00