Commit Graph

971 Commits

Author SHA1 Message Date
Mike J Innes
efa51f02e7 basic batch type 2017-11-02 11:49:42 +00:00
Mike J Innes
21ea93ffcd rename treelike 2017-11-02 11:47:34 +00:00
Iblis Lin
6c7613e02b batchnorm: leverage TrackedArray mean 2017-11-02 14:20:34 +08:00
Iblis Lin
88bd8a8fbd batchnorm: make CuArrays happy 2017-11-02 14:02:41 +08:00
Iblis Lin
477da75428 batchnorm: fix mapchildren 2017-11-02 13:32:12 +08:00
Iblis Lin
5253841acc batchnorm: update docs 2017-11-02 13:32:12 +08:00
Iblis Lin
b3356cc6bb batchnorm: batch σ correct coefficient 2017-11-02 13:32:12 +08:00
Iblis Lin
e0201be770 batchnorm: parameterize momentum and epsilon 2017-11-02 13:32:12 +08:00
Iblis Lin
669273b008 layer: implement BatchNorm layer
See [Batch Normalization: Accelerating Deep Network Training by Reducing
Internal Covariate Shift](https://arxiv.org/pdf/1502.03167.pdf)
2017-11-02 13:32:12 +08:00
Mike J Innes
e7a510da9a add cmudict dataset 2017-11-01 16:01:55 +00:00
Mike J Innes
0f8ba87dc6 treelike tuples 2017-10-31 16:37:41 +00:00
Mike J Innes
e943a39ee7 combine special cases 2017-10-31 16:37:33 +00:00
Iblis Lin
3d8b7250ae add scalar mean 2017-10-31 10:42:32 +00:00
Iblis Lin
c43bda019b TrackedArray: implement mean
```julia
julia> p
Tracked 2×3 Array{Float64,2}:
 1.0  3.0  5.0
 2.0  4.0  6.0
```

Before
```julia
julia> @benchmark Flux.Tracker.back!(sum($p, 2) ./ size($p, 2), ones(2, 1))
BenchmarkTools.Trial:
  memory estimate:  3.44 KiB
  allocs estimate:  75
  --------------
  minimum time:     20.438 μs (0.00% GC)
  median time:      21.239 μs (0.00% GC)
  mean time:        22.354 μs (1.68% GC)
  maximum time:     3.811 ms (98.51% GC)
  --------------
  samples:          10000
  evals/sample:     1
```

After
```julia
julia> @benchmark Flux.Tracker.back!(mean($p, 2), ones(2, 1))
BenchmarkTools.Trial:
  memory estimate:  1008 bytes
  allocs estimate:  21
  --------------
  minimum time:     5.973 μs (0.00% GC)
  median time:      6.310 μs (0.00% GC)
  mean time:        6.630 μs (1.96% GC)
  maximum time:     680.709 μs (97.28% GC)
  --------------
  samples:          10000
  evals/sample:     6
```
2017-10-30 16:21:02 +08:00
Mike J Innes
4c1b1eb18c Merge pull request #92 from CarloLucibello/drop
add Dropout layer
2017-10-26 12:07:28 +01:00
Mike J Innes
84efbbcc84 tracker predicate tweaks 2017-10-26 12:06:29 +01:00
Mike J Innes
cf6b930f63 reorganise 2017-10-26 11:46:12 +01:00
Mike J Innes
0df300299f clearer error message, fixes #93 2017-10-26 11:15:14 +01:00
GenaBitu
df06c3351d
Merge branch 'master' into cat-fix 2017-10-26 00:52:29 +02:00
CarloLucibello
711ea09d99 address comments 2017-10-25 02:35:27 +02:00
CarloLucibello
536ab3861d setmode! -> testmode! 2017-10-23 16:23:29 +02:00
CarloLucibello
00a9e5f01f construct TrackedScalar with params(1) 2017-10-23 10:49:45 +01:00
CarloLucibello
86c7c9246e add == and < for tracked arrays 2017-10-23 11:41:08 +02:00
CarloLucibello
2e1ed4c3fc add dropout 2017-10-23 10:12:53 +02:00
Mike J Innes
2a66545ef8 rnn state reset 2017-10-19 17:21:08 +01:00
Mike J Innes
99a7697d13 adam eta default arg 2017-10-19 14:31:34 +01:00
Mike J Innes
e5c8f6d835 only export known good optimisers 2017-10-19 11:26:11 +01:00
Mike J Innes
5b6a5667ed tracked array restructure 2017-10-18 22:54:58 +01:00
Mike J Innes
c8d4844da4 chunk util 2017-10-18 17:07:58 +01:00
Mike J Innes
07ad7cfa40 learning rate as default arg 2017-10-18 17:07:49 +01:00
Mike J Innes
e82428bb83 batching docs 2017-10-18 16:40:14 +01:00
Mike J Innes
b817ce632c syntax highlighting 2017-10-18 15:44:06 +01:00
Mike J Innes
fd249b773e rnn docs 2017-10-18 15:30:05 +01:00
Mike J Innes
190f48a709 nnlib docs 2017-10-18 14:40:58 +01:00
Mike J Innes
12944ae125 nnlib exports 2017-10-18 12:56:58 +01:00
Mike J Innes
0fbc8dff61 typoe 2017-10-18 12:48:58 +01:00
Mike J Innes
d6dd27dae5 dense layer example 2017-10-18 12:47:45 +01:00
Mike J Innes
7426faf37d optimiser docs 2017-10-18 12:09:48 +01:00
CarloLucibello
041079237e add docsting to train! 2017-10-17 21:04:18 +01:00
CarloLucibello
6d3a2a2210 change argument name for better clarity 2017-10-17 21:04:18 +01:00
Mike J Innes
23674b2555 logitcrossentropy tweaks 2017-10-17 17:58:32 +01:00
pevnak
4aa7741ba9 logit cross entropy 2017-10-17 17:57:46 +01:00
Mike J Innes
6dff8ca8d3 rename crossentropy loss 2017-10-17 17:36:18 +01:00
Mike J Innes
1800c8f523 deprecate mapparams 2017-10-17 17:35:30 +01:00
Mike J Innes
949fd9ba97 loss function tweaks 2017-10-17 17:30:11 +01:00
Mike J Innes
c764b74eba rename and fix mapleaves 2017-10-17 01:08:15 +01:00
Mike J Innes
7aa0b43ceb onehot sanity check 2017-10-17 00:07:58 +01:00
Mike J Innes
e02e320008 more general fmap 2017-10-17 00:07:15 +01:00
Mike J Innes
64e242e96c export param 2017-10-16 08:53:46 +01:00
Mike J Innes
d3db051ca0 flip 2017-10-16 08:53:39 +01:00
Mike J Innes
9a155abecd batch and batchseq apis 2017-10-15 23:44:40 +01:00
Mike J Innes
646720cd05 fix 2017-10-15 23:44:16 +01:00
Mike J Innes
c6556a29e6 order-stable params 2017-10-10 12:16:32 +01:00
GenaBitu
ef6d10886d Exposed all optimisers 2017-10-06 14:20:09 +01:00
GenaBitu
2084df96ae
Merge branch 'master' into cat-fix 2017-10-06 15:00:26 +02:00
pevnak
bfcc1ac25d exposing optimisers 2017-10-05 12:36:18 +01:00
Mike J Innes
1abc4febe6 more general adaptors 2017-10-04 18:55:56 +01:00
Dave Kleinschmidt
2b95aff158 actually use init argument in LSTMCell 2017-10-03 19:26:42 +01:00
Mike J Innes
5fd1b7d9a2 remove gc hack 2017-10-02 20:50:18 +01:00
Mike J Innes
1b91e6b38d store onehotmatrix height 2017-10-02 20:50:11 +01:00
Mike J Innes
7c8dba0b85 gc in training loop 2017-09-27 23:14:58 +01:00
Mike J Innes
a32ae4914c onehotmatrix cuda support 2017-09-27 22:51:00 +01:00
Mike J Innes
a60a754d68 beginnings of gpu support 2017-09-27 21:58:34 +01:00
Mike J Innes
120a6db2bb Merge branch 'master' of github.com:MikeInnes/Flux.jl 2017-09-27 21:16:23 +01:00
Mike J Innes
4bafa2b374 generic tree functions 2017-09-27 21:11:21 +01:00
Mike J Innes
2ec8401d2c remove compiler 2017-09-27 20:48:39 +01:00
Mike J Innes
94e38c05b8 more informative 2017-09-27 18:33:23 +01:00
GenaBitu
136f9bbf74
Hack which doesn't break backprop 2017-09-22 11:47:04 +02:00
GenaBitu
a5fe5b6e65
Added multi-variable vcat for TrackedVector 2017-09-22 11:22:21 +02:00
Mike J Innes
f2052739c1 tweaks 2017-09-12 14:11:03 +01:00
Mike J Innes
a3fe89e348 rnn tanh by default 2017-09-12 13:12:25 +01:00
Mike J Innes
6728295355 Merge pull request #63 from JobJob/rnncell-args
Enables passing an activation function to RNN/RNNCell
2017-09-12 13:10:43 +01:00
Mike J Innes
28bbef81b9 f 2017-09-12 13:06:32 +01:00
Mike J Innes
972ecab9f9 rm Over Seq 2017-09-12 13:03:16 +01:00
Joel Mason
00439555d1 Enables passing an activation function to RNN/RNNCell
Also, fixes it not using the `init` function provided
2017-09-12 20:54:56 +10:00
Mike J Innes
b9652f1812 typo, fixes #61 2017-09-12 10:45:07 +01:00
Mike J Innes
5f24d61ba3 important 2017-09-11 14:10:12 +01:00
Mike J Innes
7041ab9960 rm chainseq 2017-09-11 14:02:43 +01:00
Mike J Innes
c80fb999ff one hot docs 2017-09-11 13:40:11 +01:00
Mike J Innes
3f83be7bb7 more flexible training loop 2017-09-11 13:11:55 +01:00
Mike J Innes
33a5d26e57 chain utility note 2017-09-09 20:02:48 -04:00
Mike J Innes
fedee95b14 docs updates 2017-09-09 19:58:32 -04:00
Mike J Innes
a36d6d2af3 layer docs 2017-09-08 17:52:41 -04:00
Mike J Innes
f55b8cd20e track -> param 2017-09-07 15:13:04 -04:00
Mike J Innes
085d3aa9b4 handle epoch elsewhere 2017-09-07 00:29:55 -04:00
Mike J Innes
aeaa138b6d cb convenience 2017-09-07 00:27:16 -04:00
Mike J Innes
e837bb0745 rnn stuff 2017-09-07 00:27:04 -04:00
Mike J Innes
a93c440c1e style 2017-09-06 23:09:39 -04:00
Mike J Innes
cca4d25a10 efficient traversal 2017-09-06 23:09:32 -04:00
Mike J Innes
7cfc42d166 grad refactor 2017-09-06 21:21:35 -04:00
Mike J Innes
3ef72a9d7b utils updates 2017-09-06 18:59:43 -04:00
Mike J Innes
4083c34547 seq stuff 2017-09-06 18:59:36 -04:00
Mike J Innes
1855a37319 onehot 2017-09-06 18:58:55 -04:00
Mike J Innes
d7e3f7d6e1 fix stack/squeeze usage 2017-09-06 16:02:38 -04:00
Mike J Innes
1946c46e29 basic seq functionality 2017-09-06 14:03:25 -04:00
Mike J Innes
2c8b7bc64b remove these for now 2017-09-06 14:03:12 -04:00
Mike J Innes
4c12d18033 yet another vcat method 2017-09-05 19:25:42 -04:00
Mike J Innes
c95e9376a5 constructors 2017-09-05 19:25:34 -04:00
Mike J Innes
b023da1b7d lstm initialisation 2017-09-05 02:42:32 -04:00
Mike J Innes
61de692b50 lstm nonzero hidden state 2017-09-05 02:37:48 -04:00
Mike J Innes
ec02f1fabd batching in rnns 2017-09-05 02:29:31 -04:00
Mike J Innes
830d7fa611 vcat fix 2017-09-05 02:28:11 -04:00
Mike J Innes
363caeddc6 repmat forward 2017-09-05 02:12:53 -04:00
Mike J Innes
a322c07fc8 vcat back 2017-09-05 02:11:28 -04:00
Mike J Innes
788d7d35f0 better numeric grads 2017-09-03 17:10:35 -04:00
Mike J Innes
8f4ccdd5ba scalar getindex backprop 2017-09-03 17:10:23 -04:00
Mike J Innes
47ba702747 tweak optimiser interface 2017-09-03 17:10:04 -04:00
Mike J Innes
d4211b1f23 sgd export 2017-09-03 17:09:53 -04:00
Mike J Innes
f33a8edd25 meh 2017-09-03 02:45:46 -04:00
Mike J Innes
e57ae77bbb juno progress 2017-09-03 02:44:32 -04:00
Mike J Innes
bd5822fd71 cleaner lstm 2017-09-03 02:24:47 -04:00
Mike J Innes
cf58748680 nicer trackedarray type printing 2017-09-03 02:12:54 -04:00
Mike J Innes
9642ae8cd6 basic recurrence 2017-09-03 02:12:44 -04:00
Mike J Innes
f6771b98cd clearer name for dense 2017-09-02 16:50:11 -04:00
Mike J Innes
fe2b35facc add callbacks back 2017-09-01 23:59:44 -04:00
Mike J Innes
bf098d551c fuck 2017-09-01 23:41:44 -04:00
Mike J Innes
107d9daa8f add some non-differentiable functions 2017-09-01 23:33:05 -04:00
Mike J Innes
387686eb41 optimisers rework 2017-09-01 17:06:51 -04:00
Mike J Innes
892a779ed1 tracked transpose 2017-09-01 11:42:18 -04:00
Mike J Innes
b95dae1868 opt refactor 2017-08-31 14:55:23 -04:00
Mike J Innes
7cd13789dd fix removed import 2017-08-29 17:14:01 -04:00
ylxdzsw
97ecb26003 wip optimisers 2017-08-29 17:00:24 -04:00
Mike J Innes
7bba38274b Merge branch 'master' of github.com:MikeInnes/Flux.jl 2017-08-28 01:41:11 +01:00
Mike J Innes
0b89e1374c gpu-friendly 2017-08-28 01:40:59 +01:00
Mike J Innes
73166c52a0 cleaner broadcasting fix 2017-08-27 09:49:42 +01:00
Mike J Innes
12dc6b66c5 whoops 2017-08-24 22:23:05 +01:00
Mike J Innes
52f5f4a4c0 initial cuarrays integration 2017-08-24 17:00:48 +01:00
Mike J Innes
e7f26370d7 training tweaks 2017-08-24 16:10:04 +01:00
Mike J Innes
1526b13691 basic training loop 2017-08-24 11:42:29 +01:00
Mike J Innes
9ce0439943 better mse 2017-08-24 11:40:51 +01:00
Mike J Innes
23690e0083 not useful enough 2017-08-24 11:40:19 +01:00
Mike J Innes
d162e028bb utility method 2017-08-23 17:50:49 +01:00
Mike J Innes
e4e9794f5e loss function gradients 2017-08-23 17:50:43 +01:00
Mike J Innes
60c3090981 broadcasting fix 2017-08-23 17:21:02 +01:00
Mike J Innes
23c5a1b163 softmax gradient 2017-08-23 02:03:17 +01:00
Mike J Innes
5eee653a64 gradient checks 2017-08-23 01:43:45 +01:00
Mike J Innes
56ed6f5680 de-broadcasting 2017-08-23 00:25:19 +01:00
Mike J Innes
bafecfede1 sgd 2017-08-22 22:25:18 +01:00
Mike J Innes
f2dd7b0e90 fix include case 2017-08-22 17:18:27 +01:00
Mike J Innes
0ce8c0cee4 param collection 2017-08-22 17:13:03 +01:00
Mike J Innes
1179269355 remove old params 2017-08-22 15:21:08 +01:00
Mike J Innes
f43eef8d7d consistent case 2017-08-22 15:13:24 +01:00
Mike J Innes
cd45df1eca vector sum 2017-08-22 15:12:12 +01:00
Mike J Innes
97af9db181 scalar sum 2017-08-22 12:24:08 +01:00
Mike J Innes
ef681f16ea use nnlib for activations 2017-08-21 17:53:04 +01:00
Mike J Innes
3e0f45046c nicer show 2017-08-21 17:20:09 +01:00
Mike J Innes
227e41c37b gpu-friendly dualify 2017-08-21 16:35:39 +01:00
Mike J Innes
65a49188e6 mat-vec mul 2017-08-20 13:48:43 +01:00
Mike J Innes
09974caba0 extend affine with activation 2017-08-20 13:35:35 +01:00
Mike J Innes
8e59160df6 inferable broadcast 2017-08-20 13:35:20 +01:00
Mike J Innes
18e69b33c9 forwarddiff does these 2017-08-19 22:05:50 +01:00
Mike J Innes
ad0e0ea5a7 explicitly broadcast sigmoid 2017-08-19 22:04:47 +01:00
Mike J Innes
4a9dc40e7c simplify organisation 2017-08-19 20:52:29 +01:00
Mike J Innes
318e503d9b remove rnns 2017-08-19 20:52:17 +01:00
Mike J Innes
a581856954 remove params from compiler 2017-08-19 20:38:45 +01:00
Mike J Innes
b18d8cd08f rename chain.jl 2017-08-19 20:26:07 +01:00
Mike J Innes
0222103c7f fix these later 2017-08-19 20:05:22 +01:00
Mike J Innes
bd6bffde48 silo the compiler 2017-08-19 20:04:21 +01:00
Mike J Innes
8ed4d569b3 export from flux 2017-08-19 18:23:21 +01:00
Mike J Innes
f12b1d0ca1 avoid back! fallback 2017-08-19 17:40:07 +01:00
Mike J Innes
c9eb58f146 printing tweaks 2017-08-19 16:38:40 +01:00
Mike J Innes
de76472d99 exports 2017-08-19 16:20:53 +01:00
Mike J Innes
6c1a38e157 some more derivatives 2017-08-19 16:02:19 +01:00
Mike J Innes
380d32dac9 remove Core for now 2017-08-19 16:01:32 +01:00
Mike J Innes
70393138bc submodule 2017-08-19 11:11:25 +01:00
Mike J Innes
1889ccd316 tracker submodule 2017-08-19 11:06:19 +01:00
Mike J Innes
15c4f38130 printing 2017-08-19 11:00:55 +01:00
Mike J Innes
f1dda12a54 matmul 2017-08-19 10:14:50 +01:00
Mike J Innes
d9c30db2e3 AD skeleton 2017-08-18 16:50:27 +01:00
Mike J Innes
f8482ff80c remove back!, update! 2017-08-18 10:18:45 +01:00
Mike J Innes
21089fea9c remove some explicit back! 2017-08-18 01:19:10 +01:00
Mike J Innes
052cc52ada remove inference, for now 2017-08-18 01:19:10 +01:00
Mike J Innes
e79a1657d4 remove batching and training 2017-08-18 01:19:06 +01:00
Mike J Innes
5f9d8702a4 remove old cb macro 2017-08-18 01:07:58 +01:00
Mike J Innes
bfe1f8eeee shouldn't do this 2017-08-18 01:05:46 +01:00
Mike J Innes
4ed0929ac1 to be rewritten 2017-08-18 01:05:46 +01:00
Mike J Innes
8869f1f953 these are now useless 2017-08-18 01:05:46 +01:00
Mike J Innes
536949891d remove backends 2017-08-18 01:05:42 +01:00
Mike J Innes
cd9521a762 Merge branch 'master' into tf-train 2017-08-17 23:52:12 +01:00
Mike J Innes
f5b41d2e89 Merge pull request #50 from alha02/add-more-tf-ops
Add more tf ops
2017-08-17 21:47:47 +01:00
ylxdzsw
74a4a48162 fix 2017-08-01 13:28:14 +08:00
Mike J Innes
3000c7bbcb style tweak 2017-07-27 20:43:53 +01:00
ylxdzsw
88fa163c95 throttle 2017-07-21 16:31:12 +08:00
ylxdzsw
dd82edd818 fix training on the tensorflow backend 2017-07-21 11:22:40 +08:00
Ali Hamdi
ec7a0bd8f7 add expand_dims 2017-07-08 11:59:17 +02:00
Ali Hamdi
6d106c914d add randn 2017-07-08 11:59:17 +02:00
Ali Hamdi
cd8c2c2f83 add randu 2017-07-08 11:59:17 +02:00
Ali Hamdi
c8d320233a add triangular_solve 2017-07-08 11:59:17 +02:00
Ali Hamdi
015e373a34 add solve 2017-07-08 11:59:17 +02:00
Ali Hamdi
edbf698ed9 add cast 2017-07-08 11:59:17 +02:00
Ali Hamdi
f25f985109 add fill 2017-07-08 11:59:17 +02:00
Ali Hamdi
2fce3b195e add tile 2017-07-08 11:59:17 +02:00
Ali Hamdi
7aad224206 add reshape 2017-07-08 11:58:31 +02:00
Mike J Innes
e915f614aa remove confusing comments 2017-07-03 20:47:41 +01:00
Mike J Innes
e3b432165c Merge pull request #43 from alha02/add-more-tf-ops
Add tf ops: diag, diagm, size, chol
2017-07-03 18:48:32 +01:00
Mike J Innes
7e4801832b Merge pull request #44 from ylxdzsw/train-naive
training julia models
2017-07-03 18:41:39 +01:00
Mike J Innes
ea5d43ed77 show methods 2017-06-19 16:49:29 -07:00
Mike J Innes
0a6b6e1d6f cleaner chunk util 2017-06-19 16:49:17 -07:00
Mike J Innes
b3ef370456 inner constructor depwarn 2017-06-19 16:49:04 -07:00
Mike J Innes
5ee6e8f504 overload correct repeated 2017-06-19 16:48:52 -07:00
Mike J Innes
7d2a34b55d unroll multiple inputs 2017-06-17 19:21:39 -07:00
Mike J Innes
8306ed2ed7 don't force params here 2017-06-17 19:18:09 -07:00
ylxdzsw
c9ae219613 simplify back! of Chain 2017-06-14 21:58:43 +08:00
ylxdzsw
cca21a617c training julia models 2017-06-12 19:39:34 +08:00
Ali Hamdi
422b3bd5d8 Merge remote-tracking branch 'origin/add-more-tf-ops-2' into add-more-tf-ops 2017-06-10 11:45:30 +02:00
Ali Hamdi
5143410313 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-10 11:43:27 +02:00
Mike J Innes
e4a7e7b335 more general seq/tuple support 2017-06-10 08:33:17 +01:00
Ali Hamdi
0b9436cfcd add chol 2017-06-09 21:50:25 +02:00
Ali Hamdi
7559196ab9 add size 2017-06-09 21:46:29 +02:00
Ali Hamdi
697a300ec3 add diagm 2017-06-09 21:35:31 +02:00
Ali Hamdi
6dfb554e95 add diag 2017-06-09 21:22:02 +02:00
Ali Hamdi
e6db3b0e89 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-09 20:53:17 +02:00
Mike J Innes
358ba650ad more robust batches 2017-06-09 18:57:18 +01:00
Mike J Innes
65400f20ab nested batch tweaks 2017-06-09 18:55:21 +01:00
Mike J Innes
9c8dbb6b4b feedforward fix 2017-06-09 18:54:35 +01:00
Ali Hamdi
d80c1abf5e Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-09 17:55:00 +02:00
Mike J Innes
9b18fd639a mxarray ctx 2017-06-09 14:29:47 +01:00
Ali Hamdi
6b4db60f05 add length 2017-06-09 12:13:30 +02:00
Ali Hamdi
74b36b0094 add cat 2017-06-09 12:09:20 +02:00
Ali Hamdi
57c733af67 add permutedims 2017-06-09 11:55:26 +02:00
Ali Hamdi
d78963ab27 add transpose 2017-06-09 07:38:07 +02:00
Ali Hamdi
a098158ff5 Merge branch 'master' of https://github.com/MikeInnes/Flux.jl into add-more-tf-ops 2017-06-09 07:15:20 +02:00
Mike J Innes
1cc8100456 ctx methods for seq models 2017-06-09 01:35:07 +01:00
Mike J Innes
020ae616cc custom mxnet context 2017-06-09 00:56:52 +01:00
Mike J Innes
fe0bddd98d pass args correctly 2017-06-09 00:55:54 +01:00
Ali Hamdi
c350bfb672 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops 2017-06-08 11:59:59 +02:00
Mike J Innes
4ccbbbb284 dup params fix 2017-06-08 10:49:39 +01:00
Mike J Innes
ab0f184d29 old import 2017-06-08 10:47:58 +01:00
Ali Hamdi
547ca64206 add det 2017-06-08 08:16:12 +02:00
Ali Hamdi
92616a8c3d add inv 2017-06-08 07:31:23 +02:00
Ali Hamdi
ee2e388228 add svd 2017-06-08 07:29:17 +02:00
Mike J Innes
dac80081ba these are probably the wrong abstraction 2017-06-06 18:03:36 +01:00
Mike J Innes
72e8e5c7f9 remove iterators 2017-06-06 18:03:14 +01:00
Mike J Innes
66c0403ffd seq rpad 2017-06-06 17:15:29 +01:00
Mike J Innes
e5df1c31dd batch vcat 2017-06-06 17:15:22 +01:00
Mike J Innes
3c52622896 more constructors 2017-06-06 16:58:57 +01:00
Mike J Innes
5186aa392f refactor 2017-06-06 15:59:10 +01:00
Mike J Innes
21fd79da16 fix 2017-06-05 22:51:08 +01:00
Mike J Innes
13f4af2492 generic and consistent conversions 2017-06-05 22:49:31 +01:00
Mike J Innes
bf3db99bc1 shared conversions 2017-06-05 19:04:14 +01:00
Mike J Innes
b2baf512e1 more code sharing 2017-06-05 18:46:36 +01:00
Mike J Innes
7505ee3a16 nicer display, share more code 2017-06-05 18:10:43 +01:00
Mike J Innes
ce6c96c2be rename 2017-06-05 18:00:44 +01:00
Mike J Innes
8e019e0f30 move batch and seq together 2017-06-05 17:51:30 +01:00
Mike J Innes
a7a46e3a82 that was what I wanted 2017-06-05 17:50:00 +01:00
Mike J Innes
957e23eb6b abstract batch type 2017-06-05 17:45:11 +01:00
Mike J Innes
da1697a257 fix juno import 2017-06-05 17:31:43 +01:00
Mike J Innes
1d7ac3ef3c tweak descriptions 2017-06-05 17:16:38 +01:00
Mike J Innes
2717ace397 split out core 2017-06-05 17:16:28 +01:00
Mike J Innes
9a460e12f2 make Batches submodule 2017-06-05 16:56:44 +01:00
Mike J Innes
cf8227c02f move @ Chain 2017-06-05 16:56:24 +01:00
Mike J Innes
099b9fbeb7 collect exports 2017-06-05 16:37:06 +01:00
Mike J Innes
415c5f6963 fix backend imports 2017-06-05 16:32:16 +01:00
Mike J Innes
ddcd576a74 give up and use AbstractArray 2017-06-05 16:09:06 +01:00
Mike J Innes
837173d65b model abstract is pretty useless 2017-06-05 16:08:23 +01:00
Mike J Innes
4685d2e672 strip down non-obvious exports 2017-06-05 15:47:26 +01:00
Mike J Innes
eafba7a55f graph for seqmodel 2017-06-03 15:52:43 +01:00
Mike J Innes
10abb64f4b fix stateful in backends 2017-06-02 16:28:31 +01:00
Mike J Innes
5232a3300c don't return hidden states 2017-06-02 16:22:55 +01:00
Mike J Innes
baaf9f2682 unrolled graphs only have one output 2017-06-02 16:22:39 +01:00
Mike J Innes
50bbb87674 stateless 2017-06-02 16:11:09 +01:00
Mike J Innes
d1f370a2f1 move stateful 2017-06-02 16:02:47 +01:00
Mike J Innes
2b85c76785 preserve original param states 2017-06-02 15:46:24 +01:00
Mike J Innes
af65b9200c mxnet repeated 2017-06-02 15:34:30 +01:00
Mike J Innes
4698cc0156 this might be a bit dirty 2017-06-02 15:22:23 +01:00
Mike J Innes
215e997540 broadcastto 2017-06-02 15:02:30 +01:00
Mike J Innes
edb1099cec more mxnet ops 2017-06-02 14:42:15 +01:00
Mike J Innes
e076bee00a mxnet broadcastto 2017-06-02 13:44:29 +01:00
Mike J Innes
4712568ac2 group tuple functions 2017-06-02 13:43:58 +01:00
Mike J Innes
4245f860e0 mxnet exp. 2017-06-02 13:42:59 +01:00
Mike J Innes
011449aa24 tuple sum 2017-06-02 13:40:38 +01:00
Mike J Innes
07c523a13d fix flatten/softmax batching behaviour 2017-06-01 19:28:02 +01:00
Mike J Innes
b54281bdea hadamard product 2017-06-01 19:27:46 +01:00
Mike J Innes
8cf5afdd79 add note 2017-06-01 18:37:20 +01:00
Mike J Innes
c7f8d86f9e fix mxnet 2017-06-01 16:57:39 +01:00
Mike J Innes
7a2a72a74a fix tensorflow 2017-06-01 16:56:48 +01:00
Mike J Innes
f7eb5179b1 fix basic interpreters 2017-06-01 16:55:48 +01:00
Mike J Innes
3532c7174f early throw on cyclic graphs 2017-06-01 16:52:49 +01:00
Mike J Innes
45c5502f90 obviate mapconst 2017-06-01 16:52:49 +01:00
Mike J Innes
c969d13306 mxnet vec 2017-05-31 18:23:33 +01:00
Mike J Innes
673abbcc1d mxnet hcat 2017-05-31 18:08:11 +01:00
Mike J Innes
033ed13f69 Merge pull request #38 from staticfloat/sf/act_on_your_feelings_young_one
Fix activation functions
2017-05-31 14:22:52 +01:00
Mike J Innes
b3e9dfa264 fix exec check 2017-05-30 18:04:31 +01:00
Mike J Innes
5654e15167 this is gone 2017-05-30 17:47:21 +01:00
Mike J Innes
790a58be1d support maps over tuples 2017-05-30 17:23:34 +01:00
Mike J Innes
d788cc8c54 remove old imap 2017-05-30 16:59:12 +01:00
Mike J Innes
5fa1eb4034 allow non-param states 2017-05-30 16:53:21 +01:00
Mike J Innes
b145b46cbb fix for recurrent anon models 2017-05-30 16:37:44 +01:00
Elliot Saba
dfa4dcccb5 Fix activation functions 2017-05-25 18:12:58 -07:00
Mike J Innes
e37973c3d5 Merge pull request #35 from oxinabox/patch-1
Replace deprecated `unpack` with `unstack`
2017-05-25 15:48:14 +01:00
Mike J Innes
fb8872b0b9 Merge pull request #24 from tkelman/nopiracy
Don't extend base functions on base types
2017-05-25 15:47:24 +01:00
Mike J Innes
9909af72a4 updates for new broadcasting approach 2017-05-24 12:02:03 +01:00
Lyndon White
d0fd9cd622 Replace deprecated unpack with unstack 2017-05-24 10:35:17 +08:00
Tony Kelman
41ea071f3a Use a package-local squeeze function instead of extending Base 2017-05-22 04:08:46 -04:00
Tony Kelman
5cbb47a13d Don't extend base functions on base types
better broadcast syntax
2017-05-22 04:05:57 -04:00
Mike J Innes
0ed4e65d2f refactor 2017-05-11 19:04:08 +01:00
ylxdzsw
1bd0a43b7d batch iterator 2017-05-11 19:03:53 +01:00
ylxdzsw
019e34194d catmat fix 2017-05-11 15:49:13 +01:00
Ali Hamdi
92d39f9ff7 dim defaults to nothing for reduction ops 2017-05-09 16:50:32 +02:00
Ali Hamdi
d5b073db74 add more basic math ops to tf backend 2017-05-09 14:39:10 +02:00
Ali Hamdi
eda5c2c776 add .^ and reduction ops for tf backend 2017-05-09 01:29:15 +02:00
Mike J Innes
bd97c0f93b strip away params 2017-05-04 17:01:10 +01:00
Mike J Innes
5be9ce45d8 support constant arrays in MXNet 2017-05-04 15:09:18 +01:00
Mike J Innes
a2db4b5319 factor out common tests 2017-05-04 13:52:31 +01:00
Mike J Innes
70286c0bf3 src docs 2017-05-04 13:17:21 +01:00
Mike J Innes
e04dcbd460 reorganise recurrent stuff 2017-05-04 10:45:44 +01:00
Mike J Innes
c025cddc73 runmodel no longer needed 2017-05-04 10:32:53 +01:00
Mike J Innes
51bf904179 better return value 2017-05-03 19:13:33 +01:00
Mike J Innes
c9dcc815dc add this for convenience 2017-05-03 18:33:23 +01:00
Mike J Innes
8c10a95bf4 more appropriate naming 2017-05-03 18:33:14 +01:00
Mike J Innes
d7ff193ad6 basic back pass 2017-05-01 19:44:58 +01:00
Mike J Innes
af30ee78d2 remove old code 2017-05-01 19:44:26 +01:00
Mike J Innes
b35f50571c use params instead of vars 2017-05-01 18:27:52 +01:00
Mike J Innes
796d7d7e99 recurrent models in tf 2017-05-01 18:05:17 +01:00
Mike J Innes
3998be2244 tuple i/o tests 2017-05-01 17:44:20 +01:00
Mike J Innes
2467ca4187 tuple inputs in tensorflow 2017-05-01 17:41:42 +01:00
Mike J Innes
357f989de5 pull out tuple utils 2017-05-01 16:57:51 +01:00
Mike J Innes
2934607115 multi-tuple output 2017-05-01 16:30:24 +01:00
Mike J Innes
f5b0f3e530 identity function 2017-05-01 16:28:39 +01:00
Mike J Innes
ba54552be5 remove special-cased training code
splits model and exec to allow multi inputs
2017-05-01 16:12:46 +01:00
Mike J Innes
d4ee8a6a2a use juno's info function 2017-05-01 15:14:29 +01:00
Mike J Innes
b19e31714d allow access to loop vars 2017-05-01 14:00:39 +01:00
Mike J Innes
33c6781c99 remove from training.jl 2017-05-01 13:47:08 +01:00
Mike J Innes
6778d00dbf this is no longer specific to training 2017-05-01 13:46:23 +01:00
Mike J Innes
d095c640c9 this should be a general util 2017-05-01 13:45:24 +01:00
Mike J Innes
eae13c533f training callbacks 2017-05-01 13:43:07 +01:00
Mike J Innes
5dce8df678 fix accuracy for batches 2017-05-01 13:40:11 +01:00
Mike J Innes
dba6bef245 this should be progress'd too 2017-05-01 12:46:02 +01:00
Mike J Innes
38852964f6 organise training and utils 2017-05-01 12:41:54 +01:00
Mike J Innes
0e6bb17709 import this in compiler 2017-05-01 12:31:00 +01:00
Mike J Innes
36f4103d7d cost is really a layer 2017-05-01 12:30:28 +01:00
Mike J Innes
7425584c91 more correct 2017-05-01 11:33:24 +01:00
Mike J Innes
7415e61287 remove mse!, export logloss 2017-05-01 11:28:41 +01:00
Mike J Innes
360d128b1a logloss 2017-04-28 17:14:27 +01:00
Mike J Innes
ef4ec5be4b customisable loss 2017-04-28 17:14:21 +01:00
Mike J Innes
63b328142a print epochs again 2017-04-27 17:43:38 +01:00
Mike J Innes
121af0579a update for rnn wrappers 2017-04-27 17:27:08 +01:00
Mike J Innes
f4acd286c8 fix tensorflow dep 2017-04-27 12:48:11 +01:00
Mike J Innes
52a7199d10 gradients for recurrent models 2017-04-26 17:42:47 +01:00
Mike J Innes
19cf3e2b62 split out runseq 2017-04-19 17:33:55 +01:00
Mike J Innes
edfb0211e6 better for nested batches 2017-04-19 17:18:40 +01:00
Mike J Innes
14afe54143 fixes for recurrent networks 2017-04-19 17:17:37 +01:00
Mike J Innes
358334a893 mxnet back! for multi outputs 2017-04-19 17:13:57 +01:00
Mike J Innes
42a8117704 seqmodel with raw arrays 2017-04-19 15:13:17 +01:00
Mike J Innes
f8a3b02c1d fix tf train 2017-04-19 14:48:10 +01:00
Mike J Innes
42688f8aa8 update training process, mnist example 2017-04-19 14:23:48 +01:00
Mike J Innes
f6334e7e96 still need batchone 2017-04-19 13:26:37 +01:00
Mike J Innes
5357b1e9f9 remove fake batching vestiges 2017-04-19 13:19:18 +01:00
Mike J Innes
228f7d487c remove fake batch semantics 2017-04-18 21:04:21 +01:00
Mike J Innes
88cf6d9e61 sensible default for unsqueeze 2017-04-18 20:55:59 +01:00