Commit Graph

1356 Commits

Author SHA1 Message Date
Mike J Innes
e3b31b9b87
Merge pull request #169 from jessebett/jessechanges
Reshape with Tuple Dimensions and Kronecker Product
2018-02-16 14:16:42 +00:00
Mike J Innes
60f21d3ff2 don't override base method 2018-02-16 14:15:40 +00:00
Mike J Innes
5e861101f3 epochs util 2018-02-16 11:17:57 +00:00
Mike J Innes
7aa6854c64 more correct 2018-02-16 00:06:15 +00:00
Mike J Innes
ee3784964e fix for external modules 2018-02-15 22:27:00 +00:00
Mike J Innes
63862c2324 easier initialisation with weights 2018-02-15 20:52:29 +00:00
Mike J Innes
01c31e7fcc conv bias 2018-02-15 20:15:41 +00:00
Mike J Innes
bdd07a8bc6 fix 2018-02-14 22:34:11 +00:00
Mike J Innes
1b8b1cd7b1 check params by identity 2018-02-14 21:00:50 +00:00
Mike J Innes
5ea0ef6764 tracker fix 2018-02-13 16:15:36 +00:00
Mike J Innes
1baa7227e3 reorganise batches 2018-02-13 16:05:07 +00:00
Mike J Innes
34217b1fa2 Merge branch 'treebank' 2018-02-13 15:44:27 +00:00
Mike J Innes
49584fb72b rm logsigmoid 2018-02-13 14:52:29 +00:00
Mike J Innes
2f29733888 Merge branch 'master' into HEAD 2018-02-13 14:45:37 +00:00
Mike J Innes
8432d8db06 batchnorm fix 2018-02-13 14:02:35 +00:00
Mike J Innes
820cd3ae42 fixes #164 2018-02-13 13:31:35 +00:00
Mike J Innes
066cb45a38 remove old accuracy fn 2018-02-13 11:12:21 +00:00
Mike J Innes
236edbffec fixes #111 2018-02-13 10:20:38 +00:00
Mike J Innes
f22cfb5b43 re-enable printf 2018-02-12 15:05:09 +00:00
Mike J Innes
334ae9e1cb fixes #171 2018-02-12 12:31:15 +00:00
Mike J Innes
0b3c02fe8d document regularisation, fixes #160 2018-02-09 19:00:26 +00:00
Mike J Innes
0e0057b0c4 basics 2018-02-09 13:51:07 +00:00
jessebett
f84ee8eab0 reshape with tupled dimensions and kronecker product 2018-02-08 14:27:57 -05:00
Mike J Innes
70fbbf48fa humble beginnings of compiler 2018-02-08 18:11:26 +00:00
Mike J Innes
fc157a8c59 TrackedNumber -> TrackedReal 2018-02-08 17:18:40 +00:00
Mike J Innes
d1c56ca768 number fix 2018-02-08 17:04:48 +00:00
Mike J Innes
0f7a1ec022 test params funct 2018-02-08 16:13:20 +00:00
Mike J Innes
961de2ba44
Merge pull request #161 from FluxML/curnn
WIP: CUDNN RNNs
2018-02-08 13:06:52 +00:00
Iblis Lin
f7fdfbe3a9 fix params 2018-02-08 12:56:10 +00:00
Mike J Innes
fcbdc49d6b fix reserve usage 2018-02-08 10:27:26 +00:00
Mike J Innes
bc452fcd81 rewrite tests 2018-02-08 02:37:55 +00:00
Mike J Innes
d592f4e327 batch support 2018-02-08 01:45:48 +00:00
Mike J Innes
b8f148b012 hook up backward passes 2018-02-08 00:49:39 +00:00
Mike J Innes
a1d1930097 Merge branch 'master' into curnn 2018-02-07 23:23:02 +00:00
Mike J Innes
4511936a87 fixes #116 2018-02-07 23:21:04 +00:00
Mike J Innes
0ac924e8e1 fixups 2018-02-07 22:52:46 +00:00
Mike J Innes
39f7f8fdf3 tracked tuples 2018-02-07 22:21:42 +00:00
Mike J Innes
79e4e25fea seperate number type 2018-02-07 20:39:36 +00:00
Mike J Innes
282889970d seperate tracking infrastructure from array wrapper 2018-02-07 17:43:25 +00:00
Mike J Innes
30b3437c56 backward passes 2018-02-06 18:56:17 +00:00
Mike J Innes
f866fbe575 nullable c refactor 2018-02-06 15:01:48 +00:00
Mike J Innes
07e1b1e0a9 avoid val 2018-02-06 12:44:18 +00:00
boathit
7e37a96c6f Register back! for logsigmoid and implement (logit)binarycrossentropy 2018-02-06 19:36:16 +08:00
boathit
6e65789828 Register back! for logsigmoid and implement (logit)binarycrossentropy 2018-02-06 19:32:46 +08:00
Mike J Innes
a4bf5936b0 diagm 2018-02-05 18:29:35 +00:00
Mike J Innes
2fec75005d
Merge pull request #123 from GenaBitu/cat-fix
Added vcat for multiple TrackedVectors
2018-02-05 18:10:48 +00:00
Mike J Innes
47cebab26e test multiple inputs/dims 2018-02-05 18:09:54 +00:00
Mike J Innes
2a2475a9c2 get tracker graph 2018-02-05 17:40:07 +00:00
Mike J Innes
14086b8c2d train forward pass 2018-02-02 17:48:08 +00:00
Mike J Innes
9a6fcf057b hook up interface 2018-02-02 16:42:18 +00:00
Mike J Innes
b1c5786012 Merge branch 'master' into curnn 2018-02-02 15:56:44 +00:00
Mike J Innes
49e1e78f67 make data/value available 2018-02-02 15:56:04 +00:00
Mike J Innes
0f1e7b5578 update rnn structure 2018-02-01 20:57:39 +00:00
Mike J Innes
106502a75d typo 2018-01-31 21:57:04 +00:00
Mike J Innes
af3ccf85ff coagulate gates 2018-01-31 16:56:27 +00:00
Mike J Innes
4bfb603da6 gru forward 2018-01-31 13:46:55 +00:00
Mike J Innes
b1bb05403c basic forward pass 2018-01-30 18:18:37 +00:00
Mike J Innes
0b886507dc param offsets 2018-01-30 14:43:39 +00:00
Mike J Innes
af0c5523ff rnnTrainingReserveSize 2018-01-30 14:43:39 +00:00
Mike J Innes
3fb83d642d rnnWorkspaceSize 2018-01-30 14:43:39 +00:00
Mike J Innes
6b4e114d5d rnnParamSize 2018-01-30 14:43:39 +00:00
Mike J Innes
ee6c3e18a9 basic RNNDesc 2018-01-30 14:43:39 +00:00
Mike J Innes
842bf03051 typo 2018-01-30 14:43:05 +00:00
Mike J Innes
0c9549c469 rm lazy 2018-01-24 13:28:52 +00:00
Mike J Innes
5118ef9163 remove batching work for now 2018-01-24 13:12:38 +00:00
boathit
374d7a5f1e Registering backward function for logsoftmax 2018-01-21 15:20:59 +08:00
Mike J Innes
72eabde373 load data 2018-01-17 16:39:55 +00:00
Mike J Innes
bd57359535 docstrings 2018-01-17 16:12:12 +00:00
Mike J Innes
8cca7accf2 mnist 2018-01-17 15:55:37 +00:00
Mike J Innes
4207fb98f2 basic GPU tests 2018-01-16 17:58:14 +00:00
GenaBitu
bc8a32bc56
Merge branch 'master' into cat-fix 2018-01-16 11:01:31 +01:00
Mike J Innes
1beb30e19a closes #118 2018-01-15 17:00:47 +00:00
Mike J Innes
8f8589a7f4 fix initialisation 2018-01-10 14:11:52 +00:00
Mike J Innes
b44237468e Merge branch 'master' into gru 2018-01-10 13:59:33 +00:00
Mike J Innes
805cb9178f fixes #146 2018-01-10 12:48:50 +00:00
Mehul Tikekar
2fef799109 fix typo in conv.jl (fixes #133) 2018-01-08 16:46:58 -05:00
Mike J Innes
468f641f66 use Adapt 2018-01-08 16:34:22 +00:00
Mike J Innes
98b362729d pool padding 2017-12-18 18:18:14 +00:00
Mike J Innes
e3577d759c conv docs 2017-12-18 18:05:48 +00:00
Mike J Innes
269d8f36b9 conv padding 2017-12-18 18:05:38 +00:00
Mike J Innes
51f93d9f0e conv polish 2017-12-15 16:24:45 +00:00
Mike J Innes
386eafc443 reshape 2017-12-15 16:18:16 +00:00
Mike J Innes
73ae25289d remove old util 2017-12-15 16:18:01 +00:00
Mike J Innes
6890a61587 todo 2017-12-15 16:17:45 +00:00
Mike J Innes
9b833a4345 more onehot indexing 2017-12-15 16:17:39 +00:00
Mike J Innes
9d0dd9fb7e layer wip 2017-12-15 13:22:57 +00:00
Mike J Innes
0bf22dfb8e pool gradients 2017-12-15 02:29:14 +00:00
Mike J Innes
d949b31aa5 conv gradient 2017-12-15 02:24:32 +00:00
Mike J Innes
5b97d2ba04 closes #127 2017-12-13 18:24:56 +00:00
Mike J Innes
95d1287455 Merge branch 'master' into jacobian 2017-12-13 17:06:23 +00:00
Mike J Innes
27d896943e
Merge pull request #120 from staticfloat/sf/dense_initialization
Better default initialization for Dense layers
2017-12-13 16:18:02 +00:00
Mike J Innes
e3a688e706 use kwarg 2017-12-13 15:27:15 +00:00
Mike J Innes
128725cefd Merge branch 'master' into sf/weighted_crossentropy 2017-12-13 15:14:47 +00:00
Mike J Innes
29787eba45 fixes #114 2017-12-12 17:23:15 +00:00
Mike J Innes
b7b6c975bc fixes #110 2017-12-12 17:07:39 +00:00
Mike J Innes
403cc26327 Merge branch 'master' into gru 2017-12-12 16:54:00 +00:00
Mike J Innes
86097e76fd tweak batchnorm example 2017-12-08 19:34:34 +00:00
Mike J Innes
6f997e798a Merge branch 'master' into batchnorm 2017-12-08 19:31:50 +00:00
Mike J Innes
1d916c81b5 Merge branch 'master' into HEAD 2017-12-08 18:31:55 +00:00
Mike J Innes
24a6569589 Merge branch 'master' into amsgrad 2017-12-08 18:20:53 +00:00
Mike J Innes
f82dbf4798 Merge branch 'master' into HEAD 2017-12-08 17:00:31 +00:00
Mike J Innes
951c21366a fix regex 2017-12-08 16:42:30 +00:00
GenaBitu
7e51418679
Added back for multi-parameter vcat 2017-12-08 16:10:09 +01:00
baggepinnen
385dee9d16 Add jacobian function 2017-12-08 14:46:12 +01:00
GenaBitu
41f3eedc39
Proper multi-variable vcat 2017-12-07 17:50:18 +01:00
Elliot Saba
41446d547f Add weighted_crossentropy for imbalanced classification problems 2017-12-05 17:09:05 -08:00
Elliot Saba
c59b820bed Add glorot (Xavier) initialization
Set default `Dense` and `RNN` inits to `glorot_uniform()` for `W`, `zeros` for `b`.
2017-12-05 14:24:48 -08:00
GenaBitu
62b3600eca
Merge branch 'master' into cat-fix 2017-12-05 11:13:29 +01:00
baggepinnen
41febee9c1 Export and indent 2017-12-04 09:34:27 +01:00
baggepinnen
36001d085a Implement AMSGrad optimiser 2017-12-04 09:17:05 +01:00
Mike J Innes
cab235a578 gpu compat 2017-11-30 13:51:31 +00:00
Mike J Innes
19039f4881 export sigmoid 2017-11-30 13:37:38 +00:00
Mike J Innes
2d33f19346 onehot unk arg 2017-11-29 16:45:50 +00:00
baggepinnen
fa718c7475 Implement Gated Recurrent Unit 2017-11-24 14:33:06 +01:00
CarloLucibello
13b934c250 improve optimizers 2017-11-24 12:12:20 +01:00
Mike J Innes
dc1f08a709
Merge pull request #98 from FluxML/log
GPU-ready log function
2017-11-23 17:17:39 +00:00
Mike J Innes
9f5c4dd3e9
Merge pull request #104 from baggepinnen/patch-1
Allow array of optimisers to train!
2017-11-21 17:16:35 +01:00
Mike J Innes
351d3d4771 std derivative 2017-11-21 17:04:04 +01:00
Mike J Innes
b06884b912 LayerNorm tweaks 2017-11-21 16:32:36 +01:00
skariel
11d53781b2 adding layer normalization 2017-11-21 16:30:24 +01:00
Mike J Innes
979949d01a style 2017-11-21 15:25:09 +01:00
Fredrik Bagge Carlson
8991ce028c
Fix bug in rmsprop and adadelta
`@. p.Δ = η * p.Δ / √acc` parses correctly while `@. p.Δ /= √acc*η` seems to parse like `@. p.Δ /= (√acc*η)`, hence the step size was de facto interpreted as `1/η`
2017-11-14 17:32:16 +01:00
Mike J Innes
e0657d93ec mv numeric.jl to nnlib 2017-11-09 15:06:29 +00:00
Mike J Innes
2cb94981a0 gpu-ready log 2017-11-09 15:04:01 +00:00
Mike J Innes
e5d99d784e fixes #79 2017-11-09 14:53:26 +00:00
Mike J Innes
ccdc046546 fixes #79 2017-11-09 14:52:28 +00:00
Mike J Innes
752a9e2808 tree utilities 2017-11-08 22:19:01 +00:00
Mike J Innes
6eb2ec154b sentiment treebank loader 2017-11-08 22:19:01 +00:00
Mike J Innes
8777362eee exports 2017-11-08 22:19:01 +00:00
Mike J Innes
8b05317895 basic tree 2017-11-08 22:19:01 +00:00
Mike J Innes
7e9468d8f8 treebank skeleton 2017-11-08 22:19:01 +00:00
Mike J Innes
fcd091e8f0 Ac_mul_B derivatives 2017-11-08 22:18:45 +00:00
Mike J Innes
d4229c4815 useful params method 2017-11-08 22:18:45 +00:00
Mike J Innes
d6423eefe5 matrix-vector fast path 2017-11-08 22:18:45 +00:00
Fredrik Bagge Carlson
97244e0a68
Allow array of optimisers to train!
This allows an array of optimisers to be sent to `train!`
2017-11-04 13:27:32 +01:00
Mike J Innes
efa51f02e7 basic batch type 2017-11-02 11:49:42 +00:00
Mike J Innes
21ea93ffcd rename treelike 2017-11-02 11:47:34 +00:00
Iblis Lin
6c7613e02b batchnorm: leverage TrackedArray mean 2017-11-02 14:20:34 +08:00
Iblis Lin
88bd8a8fbd batchnorm: make CuArrays happy 2017-11-02 14:02:41 +08:00
Iblis Lin
477da75428 batchnorm: fix mapchildren 2017-11-02 13:32:12 +08:00
Iblis Lin
5253841acc batchnorm: update docs 2017-11-02 13:32:12 +08:00
Iblis Lin
b3356cc6bb batchnorm: batch σ correct coefficient 2017-11-02 13:32:12 +08:00
Iblis Lin
e0201be770 batchnorm: parameterize momentum and epsilon 2017-11-02 13:32:12 +08:00
Iblis Lin
669273b008 layer: implement BatchNorm layer
See [Batch Normalization: Accelerating Deep Network Training by Reducing
Internal Covariate Shift](https://arxiv.org/pdf/1502.03167.pdf)
2017-11-02 13:32:12 +08:00
Mike J Innes
e7a510da9a add cmudict dataset 2017-11-01 16:01:55 +00:00
Mike J Innes
0f8ba87dc6 treelike tuples 2017-10-31 16:37:41 +00:00
Mike J Innes
e943a39ee7 combine special cases 2017-10-31 16:37:33 +00:00
Iblis Lin
3d8b7250ae add scalar mean 2017-10-31 10:42:32 +00:00
Iblis Lin
c43bda019b TrackedArray: implement mean
```julia
julia> p
Tracked 2×3 Array{Float64,2}:
 1.0  3.0  5.0
 2.0  4.0  6.0
```

Before
```julia
julia> @benchmark Flux.Tracker.back!(sum($p, 2) ./ size($p, 2), ones(2, 1))
BenchmarkTools.Trial:
  memory estimate:  3.44 KiB
  allocs estimate:  75
  --------------
  minimum time:     20.438 μs (0.00% GC)
  median time:      21.239 μs (0.00% GC)
  mean time:        22.354 μs (1.68% GC)
  maximum time:     3.811 ms (98.51% GC)
  --------------
  samples:          10000
  evals/sample:     1
```

After
```julia
julia> @benchmark Flux.Tracker.back!(mean($p, 2), ones(2, 1))
BenchmarkTools.Trial:
  memory estimate:  1008 bytes
  allocs estimate:  21
  --------------
  minimum time:     5.973 μs (0.00% GC)
  median time:      6.310 μs (0.00% GC)
  mean time:        6.630 μs (1.96% GC)
  maximum time:     680.709 μs (97.28% GC)
  --------------
  samples:          10000
  evals/sample:     6
```
2017-10-30 16:21:02 +08:00
Mike J Innes
4c1b1eb18c Merge pull request #92 from CarloLucibello/drop
add Dropout layer
2017-10-26 12:07:28 +01:00
Mike J Innes
84efbbcc84 tracker predicate tweaks 2017-10-26 12:06:29 +01:00
Mike J Innes
cf6b930f63 reorganise 2017-10-26 11:46:12 +01:00
Mike J Innes
0df300299f clearer error message, fixes #93 2017-10-26 11:15:14 +01:00
GenaBitu
df06c3351d
Merge branch 'master' into cat-fix 2017-10-26 00:52:29 +02:00
CarloLucibello
711ea09d99 address comments 2017-10-25 02:35:27 +02:00
CarloLucibello
536ab3861d setmode! -> testmode! 2017-10-23 16:23:29 +02:00
CarloLucibello
00a9e5f01f construct TrackedScalar with params(1) 2017-10-23 10:49:45 +01:00
CarloLucibello
86c7c9246e add == and < for tracked arrays 2017-10-23 11:41:08 +02:00
CarloLucibello
2e1ed4c3fc add dropout 2017-10-23 10:12:53 +02:00
Mike J Innes
2a66545ef8 rnn state reset 2017-10-19 17:21:08 +01:00
Mike J Innes
99a7697d13 adam eta default arg 2017-10-19 14:31:34 +01:00
Mike J Innes
e5c8f6d835 only export known good optimisers 2017-10-19 11:26:11 +01:00
Mike J Innes
5b6a5667ed tracked array restructure 2017-10-18 22:54:58 +01:00
Mike J Innes
c8d4844da4 chunk util 2017-10-18 17:07:58 +01:00
Mike J Innes
07ad7cfa40 learning rate as default arg 2017-10-18 17:07:49 +01:00
Mike J Innes
e82428bb83 batching docs 2017-10-18 16:40:14 +01:00
Mike J Innes
b817ce632c syntax highlighting 2017-10-18 15:44:06 +01:00
Mike J Innes
fd249b773e rnn docs 2017-10-18 15:30:05 +01:00
Mike J Innes
190f48a709 nnlib docs 2017-10-18 14:40:58 +01:00
Mike J Innes
12944ae125 nnlib exports 2017-10-18 12:56:58 +01:00
Mike J Innes
0fbc8dff61 typoe 2017-10-18 12:48:58 +01:00
Mike J Innes
d6dd27dae5 dense layer example 2017-10-18 12:47:45 +01:00
Mike J Innes
7426faf37d optimiser docs 2017-10-18 12:09:48 +01:00
CarloLucibello
041079237e add docsting to train! 2017-10-17 21:04:18 +01:00
CarloLucibello
6d3a2a2210 change argument name for better clarity 2017-10-17 21:04:18 +01:00
Mike J Innes
23674b2555 logitcrossentropy tweaks 2017-10-17 17:58:32 +01:00
pevnak
4aa7741ba9 logit cross entropy 2017-10-17 17:57:46 +01:00
Mike J Innes
6dff8ca8d3 rename crossentropy loss 2017-10-17 17:36:18 +01:00
Mike J Innes
1800c8f523 deprecate mapparams 2017-10-17 17:35:30 +01:00
Mike J Innes
949fd9ba97 loss function tweaks 2017-10-17 17:30:11 +01:00
Mike J Innes
c764b74eba rename and fix mapleaves 2017-10-17 01:08:15 +01:00
Mike J Innes
7aa0b43ceb onehot sanity check 2017-10-17 00:07:58 +01:00
Mike J Innes
e02e320008 more general fmap 2017-10-17 00:07:15 +01:00
Mike J Innes
64e242e96c export param 2017-10-16 08:53:46 +01:00
Mike J Innes
d3db051ca0 flip 2017-10-16 08:53:39 +01:00
Mike J Innes
9a155abecd batch and batchseq apis 2017-10-15 23:44:40 +01:00
Mike J Innes
646720cd05 fix 2017-10-15 23:44:16 +01:00
Mike J Innes
c6556a29e6 order-stable params 2017-10-10 12:16:32 +01:00
GenaBitu
ef6d10886d Exposed all optimisers 2017-10-06 14:20:09 +01:00
GenaBitu
2084df96ae
Merge branch 'master' into cat-fix 2017-10-06 15:00:26 +02:00
pevnak
bfcc1ac25d exposing optimisers 2017-10-05 12:36:18 +01:00
Mike J Innes
1abc4febe6 more general adaptors 2017-10-04 18:55:56 +01:00
Dave Kleinschmidt
2b95aff158 actually use init argument in LSTMCell 2017-10-03 19:26:42 +01:00
Mike J Innes
5fd1b7d9a2 remove gc hack 2017-10-02 20:50:18 +01:00
Mike J Innes
1b91e6b38d store onehotmatrix height 2017-10-02 20:50:11 +01:00
Mike J Innes
7c8dba0b85 gc in training loop 2017-09-27 23:14:58 +01:00
Mike J Innes
a32ae4914c onehotmatrix cuda support 2017-09-27 22:51:00 +01:00
Mike J Innes
a60a754d68 beginnings of gpu support 2017-09-27 21:58:34 +01:00
Mike J Innes
120a6db2bb Merge branch 'master' of github.com:MikeInnes/Flux.jl 2017-09-27 21:16:23 +01:00
Mike J Innes
4bafa2b374 generic tree functions 2017-09-27 21:11:21 +01:00
Mike J Innes
2ec8401d2c remove compiler 2017-09-27 20:48:39 +01:00
Mike J Innes
94e38c05b8 more informative 2017-09-27 18:33:23 +01:00
GenaBitu
136f9bbf74
Hack which doesn't break backprop 2017-09-22 11:47:04 +02:00
GenaBitu
a5fe5b6e65
Added multi-variable vcat for TrackedVector 2017-09-22 11:22:21 +02:00
Mike J Innes
f2052739c1 tweaks 2017-09-12 14:11:03 +01:00
Mike J Innes
a3fe89e348 rnn tanh by default 2017-09-12 13:12:25 +01:00
Mike J Innes
6728295355 Merge pull request #63 from JobJob/rnncell-args
Enables passing an activation function to RNN/RNNCell
2017-09-12 13:10:43 +01:00
Mike J Innes
28bbef81b9 f 2017-09-12 13:06:32 +01:00
Mike J Innes
972ecab9f9 rm Over Seq 2017-09-12 13:03:16 +01:00
Joel Mason
00439555d1 Enables passing an activation function to RNN/RNNCell
Also, fixes it not using the `init` function provided
2017-09-12 20:54:56 +10:00
Mike J Innes
b9652f1812 typo, fixes #61 2017-09-12 10:45:07 +01:00
Mike J Innes
5f24d61ba3 important 2017-09-11 14:10:12 +01:00
Mike J Innes
7041ab9960 rm chainseq 2017-09-11 14:02:43 +01:00
Mike J Innes
c80fb999ff one hot docs 2017-09-11 13:40:11 +01:00
Mike J Innes
3f83be7bb7 more flexible training loop 2017-09-11 13:11:55 +01:00
Mike J Innes
33a5d26e57 chain utility note 2017-09-09 20:02:48 -04:00
Mike J Innes
fedee95b14 docs updates 2017-09-09 19:58:32 -04:00
Mike J Innes
a36d6d2af3 layer docs 2017-09-08 17:52:41 -04:00
Mike J Innes
f55b8cd20e track -> param 2017-09-07 15:13:04 -04:00
Mike J Innes
085d3aa9b4 handle epoch elsewhere 2017-09-07 00:29:55 -04:00
Mike J Innes
aeaa138b6d cb convenience 2017-09-07 00:27:16 -04:00
Mike J Innes
e837bb0745 rnn stuff 2017-09-07 00:27:04 -04:00
Mike J Innes
a93c440c1e style 2017-09-06 23:09:39 -04:00
Mike J Innes
cca4d25a10 efficient traversal 2017-09-06 23:09:32 -04:00
Mike J Innes
7cfc42d166 grad refactor 2017-09-06 21:21:35 -04:00
Mike J Innes
3ef72a9d7b utils updates 2017-09-06 18:59:43 -04:00
Mike J Innes
4083c34547 seq stuff 2017-09-06 18:59:36 -04:00
Mike J Innes
1855a37319 onehot 2017-09-06 18:58:55 -04:00
Mike J Innes
d7e3f7d6e1 fix stack/squeeze usage 2017-09-06 16:02:38 -04:00
Mike J Innes
1946c46e29 basic seq functionality 2017-09-06 14:03:25 -04:00
Mike J Innes
2c8b7bc64b remove these for now 2017-09-06 14:03:12 -04:00
Mike J Innes
4c12d18033 yet another vcat method 2017-09-05 19:25:42 -04:00
Mike J Innes
c95e9376a5 constructors 2017-09-05 19:25:34 -04:00
Mike J Innes
b023da1b7d lstm initialisation 2017-09-05 02:42:32 -04:00
Mike J Innes
61de692b50 lstm nonzero hidden state 2017-09-05 02:37:48 -04:00
Mike J Innes
ec02f1fabd batching in rnns 2017-09-05 02:29:31 -04:00
Mike J Innes
830d7fa611 vcat fix 2017-09-05 02:28:11 -04:00
Mike J Innes
363caeddc6 repmat forward 2017-09-05 02:12:53 -04:00
Mike J Innes
a322c07fc8 vcat back 2017-09-05 02:11:28 -04:00
Mike J Innes
788d7d35f0 better numeric grads 2017-09-03 17:10:35 -04:00
Mike J Innes
8f4ccdd5ba scalar getindex backprop 2017-09-03 17:10:23 -04:00
Mike J Innes
47ba702747 tweak optimiser interface 2017-09-03 17:10:04 -04:00
Mike J Innes
d4211b1f23 sgd export 2017-09-03 17:09:53 -04:00
Mike J Innes
f33a8edd25 meh 2017-09-03 02:45:46 -04:00
Mike J Innes
e57ae77bbb juno progress 2017-09-03 02:44:32 -04:00
Mike J Innes
bd5822fd71 cleaner lstm 2017-09-03 02:24:47 -04:00
Mike J Innes
cf58748680 nicer trackedarray type printing 2017-09-03 02:12:54 -04:00
Mike J Innes
9642ae8cd6 basic recurrence 2017-09-03 02:12:44 -04:00
Mike J Innes
f6771b98cd clearer name for dense 2017-09-02 16:50:11 -04:00
Mike J Innes
fe2b35facc add callbacks back 2017-09-01 23:59:44 -04:00
Mike J Innes
bf098d551c fuck 2017-09-01 23:41:44 -04:00
Mike J Innes
107d9daa8f add some non-differentiable functions 2017-09-01 23:33:05 -04:00
Mike J Innes
387686eb41 optimisers rework 2017-09-01 17:06:51 -04:00
Mike J Innes
892a779ed1 tracked transpose 2017-09-01 11:42:18 -04:00
Mike J Innes
b95dae1868 opt refactor 2017-08-31 14:55:23 -04:00
Mike J Innes
7cd13789dd fix removed import 2017-08-29 17:14:01 -04:00
ylxdzsw
97ecb26003 wip optimisers 2017-08-29 17:00:24 -04:00
Mike J Innes
7bba38274b Merge branch 'master' of github.com:MikeInnes/Flux.jl 2017-08-28 01:41:11 +01:00
Mike J Innes
0b89e1374c gpu-friendly 2017-08-28 01:40:59 +01:00
Mike J Innes
73166c52a0 cleaner broadcasting fix 2017-08-27 09:49:42 +01:00
Mike J Innes
12dc6b66c5 whoops 2017-08-24 22:23:05 +01:00
Mike J Innes
52f5f4a4c0 initial cuarrays integration 2017-08-24 17:00:48 +01:00
Mike J Innes
e7f26370d7 training tweaks 2017-08-24 16:10:04 +01:00
Mike J Innes
1526b13691 basic training loop 2017-08-24 11:42:29 +01:00
Mike J Innes
9ce0439943 better mse 2017-08-24 11:40:51 +01:00
Mike J Innes
23690e0083 not useful enough 2017-08-24 11:40:19 +01:00
Mike J Innes
d162e028bb utility method 2017-08-23 17:50:49 +01:00
Mike J Innes
e4e9794f5e loss function gradients 2017-08-23 17:50:43 +01:00
Mike J Innes
60c3090981 broadcasting fix 2017-08-23 17:21:02 +01:00
Mike J Innes
23c5a1b163 softmax gradient 2017-08-23 02:03:17 +01:00
Mike J Innes
5eee653a64 gradient checks 2017-08-23 01:43:45 +01:00
Mike J Innes
56ed6f5680 de-broadcasting 2017-08-23 00:25:19 +01:00
Mike J Innes
bafecfede1 sgd 2017-08-22 22:25:18 +01:00
Mike J Innes
f2dd7b0e90 fix include case 2017-08-22 17:18:27 +01:00
Mike J Innes
0ce8c0cee4 param collection 2017-08-22 17:13:03 +01:00
Mike J Innes
1179269355 remove old params 2017-08-22 15:21:08 +01:00
Mike J Innes
f43eef8d7d consistent case 2017-08-22 15:13:24 +01:00
Mike J Innes
cd45df1eca vector sum 2017-08-22 15:12:12 +01:00
Mike J Innes
97af9db181 scalar sum 2017-08-22 12:24:08 +01:00
Mike J Innes
ef681f16ea use nnlib for activations 2017-08-21 17:53:04 +01:00
Mike J Innes
3e0f45046c nicer show 2017-08-21 17:20:09 +01:00
Mike J Innes
227e41c37b gpu-friendly dualify 2017-08-21 16:35:39 +01:00
Mike J Innes
65a49188e6 mat-vec mul 2017-08-20 13:48:43 +01:00
Mike J Innes
09974caba0 extend affine with activation 2017-08-20 13:35:35 +01:00
Mike J Innes
8e59160df6 inferable broadcast 2017-08-20 13:35:20 +01:00
Mike J Innes
18e69b33c9 forwarddiff does these 2017-08-19 22:05:50 +01:00
Mike J Innes
ad0e0ea5a7 explicitly broadcast sigmoid 2017-08-19 22:04:47 +01:00
Mike J Innes
4a9dc40e7c simplify organisation 2017-08-19 20:52:29 +01:00
Mike J Innes
318e503d9b remove rnns 2017-08-19 20:52:17 +01:00
Mike J Innes
a581856954 remove params from compiler 2017-08-19 20:38:45 +01:00
Mike J Innes
b18d8cd08f rename chain.jl 2017-08-19 20:26:07 +01:00
Mike J Innes
0222103c7f fix these later 2017-08-19 20:05:22 +01:00
Mike J Innes
bd6bffde48 silo the compiler 2017-08-19 20:04:21 +01:00
Mike J Innes
8ed4d569b3 export from flux 2017-08-19 18:23:21 +01:00
Mike J Innes
f12b1d0ca1 avoid back! fallback 2017-08-19 17:40:07 +01:00
Mike J Innes
c9eb58f146 printing tweaks 2017-08-19 16:38:40 +01:00
Mike J Innes
de76472d99 exports 2017-08-19 16:20:53 +01:00
Mike J Innes
6c1a38e157 some more derivatives 2017-08-19 16:02:19 +01:00
Mike J Innes
380d32dac9 remove Core for now 2017-08-19 16:01:32 +01:00
Mike J Innes
70393138bc submodule 2017-08-19 11:11:25 +01:00
Mike J Innes
1889ccd316 tracker submodule 2017-08-19 11:06:19 +01:00
Mike J Innes
15c4f38130 printing 2017-08-19 11:00:55 +01:00
Mike J Innes
f1dda12a54 matmul 2017-08-19 10:14:50 +01:00
Mike J Innes
d9c30db2e3 AD skeleton 2017-08-18 16:50:27 +01:00
Mike J Innes
f8482ff80c remove back!, update! 2017-08-18 10:18:45 +01:00
Mike J Innes
21089fea9c remove some explicit back! 2017-08-18 01:19:10 +01:00
Mike J Innes
052cc52ada remove inference, for now 2017-08-18 01:19:10 +01:00
Mike J Innes
e79a1657d4 remove batching and training 2017-08-18 01:19:06 +01:00
Mike J Innes
5f9d8702a4 remove old cb macro 2017-08-18 01:07:58 +01:00
Mike J Innes
bfe1f8eeee shouldn't do this 2017-08-18 01:05:46 +01:00
Mike J Innes
4ed0929ac1 to be rewritten 2017-08-18 01:05:46 +01:00
Mike J Innes
8869f1f953 these are now useless 2017-08-18 01:05:46 +01:00
Mike J Innes
536949891d remove backends 2017-08-18 01:05:42 +01:00
Mike J Innes
cd9521a762 Merge branch 'master' into tf-train 2017-08-17 23:52:12 +01:00
Mike J Innes
f5b41d2e89 Merge pull request #50 from alha02/add-more-tf-ops
Add more tf ops
2017-08-17 21:47:47 +01:00
ylxdzsw
74a4a48162 fix 2017-08-01 13:28:14 +08:00
Mike J Innes
3000c7bbcb style tweak 2017-07-27 20:43:53 +01:00
ylxdzsw
88fa163c95 throttle 2017-07-21 16:31:12 +08:00
ylxdzsw
dd82edd818 fix training on the tensorflow backend 2017-07-21 11:22:40 +08:00
Ali Hamdi
ec7a0bd8f7 add expand_dims 2017-07-08 11:59:17 +02:00
Ali Hamdi
6d106c914d add randn 2017-07-08 11:59:17 +02:00
Ali Hamdi
cd8c2c2f83 add randu 2017-07-08 11:59:17 +02:00
Ali Hamdi
c8d320233a add triangular_solve 2017-07-08 11:59:17 +02:00
Ali Hamdi
015e373a34 add solve 2017-07-08 11:59:17 +02:00
Ali Hamdi
edbf698ed9 add cast 2017-07-08 11:59:17 +02:00
Ali Hamdi
f25f985109 add fill 2017-07-08 11:59:17 +02:00
Ali Hamdi
2fce3b195e add tile 2017-07-08 11:59:17 +02:00
Ali Hamdi
7aad224206 add reshape 2017-07-08 11:58:31 +02:00
Mike J Innes
e915f614aa remove confusing comments 2017-07-03 20:47:41 +01:00
Mike J Innes
e3b432165c Merge pull request #43 from alha02/add-more-tf-ops
Add tf ops: diag, diagm, size, chol
2017-07-03 18:48:32 +01:00
Mike J Innes
7e4801832b Merge pull request #44 from ylxdzsw/train-naive
training julia models
2017-07-03 18:41:39 +01:00
Mike J Innes
ea5d43ed77 show methods 2017-06-19 16:49:29 -07:00
Mike J Innes
0a6b6e1d6f cleaner chunk util 2017-06-19 16:49:17 -07:00
Mike J Innes
b3ef370456 inner constructor depwarn 2017-06-19 16:49:04 -07:00
Mike J Innes
5ee6e8f504 overload correct repeated 2017-06-19 16:48:52 -07:00
Mike J Innes
7d2a34b55d unroll multiple inputs 2017-06-17 19:21:39 -07:00
Mike J Innes
8306ed2ed7 don't force params here 2017-06-17 19:18:09 -07:00
ylxdzsw
c9ae219613 simplify back! of Chain 2017-06-14 21:58:43 +08:00
ylxdzsw
cca21a617c training julia models 2017-06-12 19:39:34 +08:00
Ali Hamdi
422b3bd5d8 Merge remote-tracking branch 'origin/add-more-tf-ops-2' into add-more-tf-ops 2017-06-10 11:45:30 +02:00
Ali Hamdi
5143410313 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-10 11:43:27 +02:00
Mike J Innes
e4a7e7b335 more general seq/tuple support 2017-06-10 08:33:17 +01:00
Ali Hamdi
0b9436cfcd add chol 2017-06-09 21:50:25 +02:00
Ali Hamdi
7559196ab9 add size 2017-06-09 21:46:29 +02:00
Ali Hamdi
697a300ec3 add diagm 2017-06-09 21:35:31 +02:00
Ali Hamdi
6dfb554e95 add diag 2017-06-09 21:22:02 +02:00
Ali Hamdi
e6db3b0e89 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-09 20:53:17 +02:00
Mike J Innes
358ba650ad more robust batches 2017-06-09 18:57:18 +01:00
Mike J Innes
65400f20ab nested batch tweaks 2017-06-09 18:55:21 +01:00
Mike J Innes
9c8dbb6b4b feedforward fix 2017-06-09 18:54:35 +01:00
Ali Hamdi
d80c1abf5e Merge remote-tracking branch 'upstream/master' into add-more-tf-ops-2 2017-06-09 17:55:00 +02:00
Mike J Innes
9b18fd639a mxarray ctx 2017-06-09 14:29:47 +01:00
Ali Hamdi
6b4db60f05 add length 2017-06-09 12:13:30 +02:00
Ali Hamdi
74b36b0094 add cat 2017-06-09 12:09:20 +02:00
Ali Hamdi
57c733af67 add permutedims 2017-06-09 11:55:26 +02:00
Ali Hamdi
d78963ab27 add transpose 2017-06-09 07:38:07 +02:00
Ali Hamdi
a098158ff5 Merge branch 'master' of https://github.com/MikeInnes/Flux.jl into add-more-tf-ops 2017-06-09 07:15:20 +02:00
Mike J Innes
1cc8100456 ctx methods for seq models 2017-06-09 01:35:07 +01:00
Mike J Innes
020ae616cc custom mxnet context 2017-06-09 00:56:52 +01:00
Mike J Innes
fe0bddd98d pass args correctly 2017-06-09 00:55:54 +01:00
Ali Hamdi
c350bfb672 Merge remote-tracking branch 'upstream/master' into add-more-tf-ops 2017-06-08 11:59:59 +02:00
Mike J Innes
4ccbbbb284 dup params fix 2017-06-08 10:49:39 +01:00
Mike J Innes
ab0f184d29 old import 2017-06-08 10:47:58 +01:00
Ali Hamdi
547ca64206 add det 2017-06-08 08:16:12 +02:00
Ali Hamdi
92616a8c3d add inv 2017-06-08 07:31:23 +02:00
Ali Hamdi
ee2e388228 add svd 2017-06-08 07:29:17 +02:00
Mike J Innes
dac80081ba these are probably the wrong abstraction 2017-06-06 18:03:36 +01:00
Mike J Innes
72e8e5c7f9 remove iterators 2017-06-06 18:03:14 +01:00
Mike J Innes
66c0403ffd seq rpad 2017-06-06 17:15:29 +01:00
Mike J Innes
e5df1c31dd batch vcat 2017-06-06 17:15:22 +01:00
Mike J Innes
3c52622896 more constructors 2017-06-06 16:58:57 +01:00
Mike J Innes
5186aa392f refactor 2017-06-06 15:59:10 +01:00
Mike J Innes
21fd79da16 fix 2017-06-05 22:51:08 +01:00
Mike J Innes
13f4af2492 generic and consistent conversions 2017-06-05 22:49:31 +01:00
Mike J Innes
bf3db99bc1 shared conversions 2017-06-05 19:04:14 +01:00
Mike J Innes
b2baf512e1 more code sharing 2017-06-05 18:46:36 +01:00
Mike J Innes
7505ee3a16 nicer display, share more code 2017-06-05 18:10:43 +01:00
Mike J Innes
ce6c96c2be rename 2017-06-05 18:00:44 +01:00
Mike J Innes
8e019e0f30 move batch and seq together 2017-06-05 17:51:30 +01:00
Mike J Innes
a7a46e3a82 that was what I wanted 2017-06-05 17:50:00 +01:00
Mike J Innes
957e23eb6b abstract batch type 2017-06-05 17:45:11 +01:00
Mike J Innes
da1697a257 fix juno import 2017-06-05 17:31:43 +01:00
Mike J Innes
1d7ac3ef3c tweak descriptions 2017-06-05 17:16:38 +01:00
Mike J Innes
2717ace397 split out core 2017-06-05 17:16:28 +01:00
Mike J Innes
9a460e12f2 make Batches submodule 2017-06-05 16:56:44 +01:00
Mike J Innes
cf8227c02f move @ Chain 2017-06-05 16:56:24 +01:00
Mike J Innes
099b9fbeb7 collect exports 2017-06-05 16:37:06 +01:00
Mike J Innes
415c5f6963 fix backend imports 2017-06-05 16:32:16 +01:00
Mike J Innes
ddcd576a74 give up and use AbstractArray 2017-06-05 16:09:06 +01:00
Mike J Innes
837173d65b model abstract is pretty useless 2017-06-05 16:08:23 +01:00
Mike J Innes
4685d2e672 strip down non-obvious exports 2017-06-05 15:47:26 +01:00
Mike J Innes
eafba7a55f graph for seqmodel 2017-06-03 15:52:43 +01:00
Mike J Innes
10abb64f4b fix stateful in backends 2017-06-02 16:28:31 +01:00
Mike J Innes
5232a3300c don't return hidden states 2017-06-02 16:22:55 +01:00
Mike J Innes
baaf9f2682 unrolled graphs only have one output 2017-06-02 16:22:39 +01:00
Mike J Innes
50bbb87674 stateless 2017-06-02 16:11:09 +01:00
Mike J Innes
d1f370a2f1 move stateful 2017-06-02 16:02:47 +01:00
Mike J Innes
2b85c76785 preserve original param states 2017-06-02 15:46:24 +01:00
Mike J Innes
af65b9200c mxnet repeated 2017-06-02 15:34:30 +01:00
Mike J Innes
4698cc0156 this might be a bit dirty 2017-06-02 15:22:23 +01:00
Mike J Innes
215e997540 broadcastto 2017-06-02 15:02:30 +01:00
Mike J Innes
edb1099cec more mxnet ops 2017-06-02 14:42:15 +01:00
Mike J Innes
e076bee00a mxnet broadcastto 2017-06-02 13:44:29 +01:00
Mike J Innes
4712568ac2 group tuple functions 2017-06-02 13:43:58 +01:00
Mike J Innes
4245f860e0 mxnet exp. 2017-06-02 13:42:59 +01:00
Mike J Innes
011449aa24 tuple sum 2017-06-02 13:40:38 +01:00
Mike J Innes
07c523a13d fix flatten/softmax batching behaviour 2017-06-01 19:28:02 +01:00
Mike J Innes
b54281bdea hadamard product 2017-06-01 19:27:46 +01:00
Mike J Innes
8cf5afdd79 add note 2017-06-01 18:37:20 +01:00
Mike J Innes
c7f8d86f9e fix mxnet 2017-06-01 16:57:39 +01:00
Mike J Innes
7a2a72a74a fix tensorflow 2017-06-01 16:56:48 +01:00
Mike J Innes
f7eb5179b1 fix basic interpreters 2017-06-01 16:55:48 +01:00
Mike J Innes
3532c7174f early throw on cyclic graphs 2017-06-01 16:52:49 +01:00
Mike J Innes
45c5502f90 obviate mapconst 2017-06-01 16:52:49 +01:00
Mike J Innes
c969d13306 mxnet vec 2017-05-31 18:23:33 +01:00
Mike J Innes
673abbcc1d mxnet hcat 2017-05-31 18:08:11 +01:00
Mike J Innes
033ed13f69 Merge pull request #38 from staticfloat/sf/act_on_your_feelings_young_one
Fix activation functions
2017-05-31 14:22:52 +01:00
Mike J Innes
b3e9dfa264 fix exec check 2017-05-30 18:04:31 +01:00
Mike J Innes
5654e15167 this is gone 2017-05-30 17:47:21 +01:00
Mike J Innes
790a58be1d support maps over tuples 2017-05-30 17:23:34 +01:00
Mike J Innes
d788cc8c54 remove old imap 2017-05-30 16:59:12 +01:00
Mike J Innes
5fa1eb4034 allow non-param states 2017-05-30 16:53:21 +01:00
Mike J Innes
b145b46cbb fix for recurrent anon models 2017-05-30 16:37:44 +01:00
Elliot Saba
dfa4dcccb5 Fix activation functions 2017-05-25 18:12:58 -07:00
Mike J Innes
e37973c3d5 Merge pull request #35 from oxinabox/patch-1
Replace deprecated `unpack` with `unstack`
2017-05-25 15:48:14 +01:00
Mike J Innes
fb8872b0b9 Merge pull request #24 from tkelman/nopiracy
Don't extend base functions on base types
2017-05-25 15:47:24 +01:00
Mike J Innes
9909af72a4 updates for new broadcasting approach 2017-05-24 12:02:03 +01:00
Lyndon White
d0fd9cd622 Replace deprecated unpack with unstack 2017-05-24 10:35:17 +08:00
Tony Kelman
41ea071f3a Use a package-local squeeze function instead of extending Base 2017-05-22 04:08:46 -04:00
Tony Kelman
5cbb47a13d Don't extend base functions on base types
better broadcast syntax
2017-05-22 04:05:57 -04:00
Mike J Innes
0ed4e65d2f refactor 2017-05-11 19:04:08 +01:00
ylxdzsw
1bd0a43b7d batch iterator 2017-05-11 19:03:53 +01:00
ylxdzsw
019e34194d catmat fix 2017-05-11 15:49:13 +01:00
Ali Hamdi
92d39f9ff7 dim defaults to nothing for reduction ops 2017-05-09 16:50:32 +02:00
Ali Hamdi
d5b073db74 add more basic math ops to tf backend 2017-05-09 14:39:10 +02:00
Ali Hamdi
eda5c2c776 add .^ and reduction ops for tf backend 2017-05-09 01:29:15 +02:00
Mike J Innes
bd97c0f93b strip away params 2017-05-04 17:01:10 +01:00
Mike J Innes
5be9ce45d8 support constant arrays in MXNet 2017-05-04 15:09:18 +01:00
Mike J Innes
a2db4b5319 factor out common tests 2017-05-04 13:52:31 +01:00
Mike J Innes
70286c0bf3 src docs 2017-05-04 13:17:21 +01:00
Mike J Innes
e04dcbd460 reorganise recurrent stuff 2017-05-04 10:45:44 +01:00
Mike J Innes
c025cddc73 runmodel no longer needed 2017-05-04 10:32:53 +01:00
Mike J Innes
51bf904179 better return value 2017-05-03 19:13:33 +01:00
Mike J Innes
c9dcc815dc add this for convenience 2017-05-03 18:33:23 +01:00
Mike J Innes
8c10a95bf4 more appropriate naming 2017-05-03 18:33:14 +01:00
Mike J Innes
d7ff193ad6 basic back pass 2017-05-01 19:44:58 +01:00
Mike J Innes
af30ee78d2 remove old code 2017-05-01 19:44:26 +01:00
Mike J Innes
b35f50571c use params instead of vars 2017-05-01 18:27:52 +01:00
Mike J Innes
796d7d7e99 recurrent models in tf 2017-05-01 18:05:17 +01:00
Mike J Innes
3998be2244 tuple i/o tests 2017-05-01 17:44:20 +01:00
Mike J Innes
2467ca4187 tuple inputs in tensorflow 2017-05-01 17:41:42 +01:00
Mike J Innes
357f989de5 pull out tuple utils 2017-05-01 16:57:51 +01:00
Mike J Innes
2934607115 multi-tuple output 2017-05-01 16:30:24 +01:00
Mike J Innes
f5b0f3e530 identity function 2017-05-01 16:28:39 +01:00
Mike J Innes
ba54552be5 remove special-cased training code
splits model and exec to allow multi inputs
2017-05-01 16:12:46 +01:00
Mike J Innes
d4ee8a6a2a use juno's info function 2017-05-01 15:14:29 +01:00
Mike J Innes
b19e31714d allow access to loop vars 2017-05-01 14:00:39 +01:00
Mike J Innes
33c6781c99 remove from training.jl 2017-05-01 13:47:08 +01:00
Mike J Innes
6778d00dbf this is no longer specific to training 2017-05-01 13:46:23 +01:00
Mike J Innes
d095c640c9 this should be a general util 2017-05-01 13:45:24 +01:00
Mike J Innes
eae13c533f training callbacks 2017-05-01 13:43:07 +01:00
Mike J Innes
5dce8df678 fix accuracy for batches 2017-05-01 13:40:11 +01:00
Mike J Innes
dba6bef245 this should be progress'd too 2017-05-01 12:46:02 +01:00
Mike J Innes
38852964f6 organise training and utils 2017-05-01 12:41:54 +01:00
Mike J Innes
0e6bb17709 import this in compiler 2017-05-01 12:31:00 +01:00
Mike J Innes
36f4103d7d cost is really a layer 2017-05-01 12:30:28 +01:00
Mike J Innes
7425584c91 more correct 2017-05-01 11:33:24 +01:00
Mike J Innes
7415e61287 remove mse!, export logloss 2017-05-01 11:28:41 +01:00
Mike J Innes
360d128b1a logloss 2017-04-28 17:14:27 +01:00
Mike J Innes
ef4ec5be4b customisable loss 2017-04-28 17:14:21 +01:00
Mike J Innes
63b328142a print epochs again 2017-04-27 17:43:38 +01:00
Mike J Innes
121af0579a update for rnn wrappers 2017-04-27 17:27:08 +01:00
Mike J Innes
f4acd286c8 fix tensorflow dep 2017-04-27 12:48:11 +01:00
Mike J Innes
52a7199d10 gradients for recurrent models 2017-04-26 17:42:47 +01:00
Mike J Innes
19cf3e2b62 split out runseq 2017-04-19 17:33:55 +01:00
Mike J Innes
edfb0211e6 better for nested batches 2017-04-19 17:18:40 +01:00
Mike J Innes
14afe54143 fixes for recurrent networks 2017-04-19 17:17:37 +01:00
Mike J Innes
358334a893 mxnet back! for multi outputs 2017-04-19 17:13:57 +01:00
Mike J Innes
42a8117704 seqmodel with raw arrays 2017-04-19 15:13:17 +01:00
Mike J Innes
f8a3b02c1d fix tf train 2017-04-19 14:48:10 +01:00
Mike J Innes
42688f8aa8 update training process, mnist example 2017-04-19 14:23:48 +01:00
Mike J Innes
f6334e7e96 still need batchone 2017-04-19 13:26:37 +01:00
Mike J Innes
5357b1e9f9 remove fake batching vestiges 2017-04-19 13:19:18 +01:00
Mike J Innes
228f7d487c remove fake batch semantics 2017-04-18 21:04:21 +01:00
Mike J Innes
88cf6d9e61 sensible default for unsqueeze 2017-04-18 20:55:59 +01:00
Mike J Innes
63f3f3b569 no longer needed 2017-04-18 19:27:24 +01:00
Mike J Innes
a3acf11a60 remove ml export 2017-04-18 19:02:55 +01:00
Mike J Innes
5cb30cecfe nicer convertel 2017-04-17 18:15:01 +01:00
Mike J Innes
e39c24fd86 lazy loading note 2017-04-17 15:56:59 +01:00
Mike J Innes
c22561c7eb Merge pull request #15 from ranjanan/RA/mse
Fix tiny bug in mse cost function
2017-04-04 19:23:44 +01:00
Mike J Innes
efa8d75ea3 neater 2017-04-04 19:19:08 +01:00
Mike J Innes
f8e1f20728 mxnet recurrence test 2017-03-31 12:39:23 +01:00
Mike J Innes
b4221f6ea6 recurrence working 2017-03-30 20:05:18 +01:00
Mike J Innes
4113d4d476 mimo working in mxnet 2017-03-30 19:50:03 +01:00
Mike J Innes
94e384930d fix back pass 2017-03-30 19:36:59 +01:00
Mike J Innes
4de61fc377 fix exec forward pass 2017-03-30 19:25:54 +01:00
Mike J Innes
4df97bf607 build executor correctly 2017-03-30 19:16:24 +01:00
Mike J Innes
5df56b6073 build graphs lazily 2017-03-30 18:14:08 +01:00
Mike J Innes
acbc4ea071 mimo models 2017-03-30 15:54:42 +01:00
ranjanan
c773229136 Fix tiny bug in mse cost function 2017-03-30 12:09:40 +05:30
Mike J Innes
ca0e20ed7a make seqmodel a model 2017-03-29 19:25:50 +01:00
Mike J Innes
5111db4037 unrolling test 2017-03-29 18:30:28 +01:00
Mike J Innes
7e983c74cb make this immutable again 2017-03-29 17:47:43 +01:00
Mike J Innes
2082d9db5c working seqmodel 2017-03-28 19:54:32 +01:00
Mike J Innes
abe66d398f remove old recurrence code 2017-03-27 18:24:02 +01:00
Mike J Innes
05ac3855f4 fixes 2017-03-27 18:23:22 +01:00
Mike J Innes
1a5e050a88 recurrence overhaul mostly working 2017-03-21 01:32:12 +00:00
Mike J Innes
90edefe072 use broadcasting plus 2017-03-21 01:18:00 +00:00
Mike J Innes
a5bd72753e fix nested mapt 2017-03-20 23:19:03 +00:00
Mike J Innes
eadf96605c fix recurrent layer batching 2017-03-20 23:11:08 +00:00
Mike J Innes
6237aa6739 shape: handle tuples better 2017-03-20 23:10:38 +00:00
Mike J Innes
6756ce7528 relu inferrence 2017-03-20 19:57:11 +00:00
Mike J Innes
498a66e7b6 update capacitors 2017-03-20 19:57:00 +00:00
Mike J Innes
7af64398d5 split out unseqout 2017-03-19 22:12:30 +00:00
Mike J Innes
8f911cc31e so can unseq 2017-03-19 22:12:30 +00:00
Mike J Innes
69b24bfa9b stateless can be a postprocess 2017-03-19 22:12:30 +00:00
Mike J Innes
93ed44b31a note about unroll options 2017-03-19 22:12:29 +00:00
Mike J Innes
d73e962da9 replace old shape inference system 2017-03-17 16:34:54 +00:00
Mike J Innes
7ef6e295cb reuse mapt 2017-03-17 01:04:41 +00:00
Mike J Innes
d27d59b071 softmax on batches 2017-03-16 11:52:00 +00:00
Mike J Innes
2adc3cd18e new struct syntax 2017-03-14 17:56:03 +00:00
Mike J Innes
0cf99dbbdf 0.6 dep 2017-03-14 17:20:39 +00:00
Mike J Innes
b0ae0849d0 invoke dep 2017-03-14 16:52:49 +00:00
Mike J Innes
6751657dfa typealias / abstract deps 2017-03-14 16:51:31 +00:00
Mike J Innes
fa56f78781 isa syntax 2017-03-14 15:21:18 +00:00
Mike J Innes
daf962a831 tensorflow exception handling 2017-03-12 18:34:11 +00:00
Mike J Innes
4b82c57f88 strip more of the trace 2017-03-12 18:33:47 +00:00
Mike J Innes
cde74337c8 should've moved this 2017-03-12 18:33:29 +00:00
Mike J Innes
06e9626c0b strip macro line numbers 2017-03-12 18:33:02 +00:00
Mike J Innes
418f5bfe28 restore alterparam 2017-03-12 14:51:55 +00:00
Mike J Innes
3f919aa81e avoid line numbers in runmodel 2017-03-09 01:10:44 +00:00
Mike J Innes
15b3ce1ada factor out fake batching semantics 2017-03-09 00:13:26 +00:00
Mike J Innes
c4d815b5fc move todo 2017-03-09 00:12:49 +00:00
Mike J Innes
854a1e1886 nicer mxnet api 2017-03-08 21:41:13 +00:00
Mike J Innes
9d1d176749 transpose everything going into mxnet 2017-03-08 17:35:15 +00:00
Mike J Innes
3b004bac7d fix for latest mxnet api 2017-03-08 16:20:33 +00:00
Mike J Innes
6a3bed1e61 this is pretty useless 2017-03-08 15:38:55 +00:00
Mike J Innes
9f62227db3 mxarray setindex 2017-03-08 15:36:51 +00:00
Mike J Innes
d713c67f02 redundant method 2017-03-08 15:36:44 +00:00
Mike J Innes
39bb625246 better mxarray dims api 2017-03-08 15:36:38 +00:00
Mike J Innes
9ecb1311a5 copy methods for param 2017-03-08 15:36:25 +00:00
Mike J Innes
d35191595d mxarray 2017-03-08 01:19:51 +00:00
Mike J Innes
06f2ee2284 iterate over chain 2017-03-07 14:37:37 +00:00
Mike J Innes
2a57150bce AvgPool shim 2017-03-06 17:21:35 +00:00
Mike J Innes
5d919175fc mxnet multi output 2017-03-06 17:20:15 +00:00
Mike J Innes
d9910070b4 fix/test native multi outputs 2017-03-06 16:12:03 +00:00
Mike J Innes
a03898d24d this was wrong 2017-03-03 14:06:51 +00:00
Mike J Innes
4d4979b401 better alternative to basemodel 2017-02-28 16:42:48 +00:00
Mike J Innes
5f1f2ebaa2 model storage notes 2017-02-28 16:41:33 +00:00
Mike J Innes
4c0e915c00 remove c++ stacktrace 2017-02-27 19:49:32 +00:00
Mike J Innes
c33f695c51 precompile 2017-02-24 15:49:03 +00:00
Mike J Innes
f3d0c21952 quote was the right thing here 2017-02-24 15:48:52 +00:00
Mike J Innes
bad6b2d1ae compile the forward pass again 2017-02-24 14:38:17 +00:00
Mike J Innes
9921881d47 fixes apparently needed for 0.6 2017-02-23 23:40:22 +00:00
Mike J Innes
06fd5adddc parameter storage 2017-02-23 21:42:34 +00:00
Mike J Innes
a4812579e9 fix back pass 2017-02-23 21:06:46 +00:00
Mike J Innes
2d77220d60 tweak param loading 2017-02-23 18:48:46 +00:00
Mike J Innes
2f2ff0b03b graph struct 2017-02-23 17:32:06 +00:00
Mike J Innes
f230b7cebf make mxerr more robust 2017-02-23 16:58:29 +00:00
Mike J Innes
21775f8c89 fix for error message change 2017-02-23 16:58:17 +00:00
Mike J Innes
616425554d update for mxnet api 2017-02-23 16:58:10 +00:00
Mike J Innes
a794f068a5 try to get biases working somewhat naturally 2017-02-21 16:07:58 +00:00
Mike J Innes
092f2038b3 use affine only for compat 2017-02-21 14:12:11 +00:00
Mike J Innes
0bb44f5ace affine error messages 2017-02-21 13:31:41 +00:00
Mike J Innes
8d63bf8053 get this somewhat working 2017-02-21 12:58:31 +00:00
Mike J Innes
417a70713b julia's map is shitty 2017-02-21 09:05:08 +00:00
Mike J Innes
521258eb08 more intuitive naming 2017-02-21 08:56:05 +00:00
Mike J Innes
4fb51d1752 factor out var creation 2017-02-21 08:52:56 +00:00
Mike J Innes
12cde694b3 conv/affine hacks 2017-02-21 08:52:40 +00:00
Mike J Innes
96c0e76b92 style 2017-02-21 08:50:54 +00:00
Mike J Innes
14cbdc5e9c these aren't working 2017-02-21 08:50:47 +00:00
Mike J Innes
b49daa367f fix for empty dicts 2017-02-20 23:45:07 +00:00
Mike J Innes
54011045e7 fix 2017-02-20 23:15:27 +00:00
Mike J Innes
3fdffea37d fix 2017-02-20 21:50:01 +00:00
Mike J Innes
b92252cd5c more interp catches 2017-02-20 21:49:47 +00:00
Mike J Innes
242b4d6e23 fixes 2017-02-20 21:49:02 +00:00
Mike J Innes
ad4d60f90d fix mx.FeedForward 2017-02-20 19:35:32 +00:00
Mike J Innes
f3a9934858 update mnist example 2017-02-02 10:09:41 +05:30
Mike J Innes
a041da3fdc shapes macro 2017-02-01 21:06:02 +05:30
Mike J Innes
cd86dfdf07 fix hintify 2017-02-01 20:26:38 +05:30
Mike J Innes
a1e35ea2c8 stacktrace fixes 2017-02-01 19:51:08 +05:30
Mike J Innes
00a5acde46 fixes 2017-02-01 17:06:42 +05:30
Mike J Innes
7c450c3200 clean up forward pass interp errors 2017-02-01 16:56:53 +05:30
Mike J Innes
abe4ab94aa clean up interpreter imports 2017-02-01 12:27:02 +05:30
Mike J Innes
7750f5375b handle interpreter errors 2017-02-01 11:56:20 +05:30
Mike J Innes
1f3587e9dc mx batch semantics 2017-01-30 23:35:15 +05:30
Mike J Innes
16d6c9aed9 MXModel -> MX.Model 2017-01-30 23:35:05 +05:30
Mike J Innes
3cb3aea825 pretty sure we don't need this 2017-01-30 23:34:36 +05:30
Mike J Innes
cd0aa26b0e simplify graph 2017-01-30 23:19:18 +05:30
Mike J Innes
de72d83f7c factor out node registration 2017-01-30 23:12:01 +05:30
Mike J Innes
508cd730c9 use ituple properly 2017-01-30 22:55:12 +05:30
Mike J Innes
a73b53e05e use iargs 2017-01-30 22:51:49 +05:30
Mike J Innes
7a01605aab add this back later 2017-01-30 22:38:38 +05:30
Mike J Innes
5f1d560e38 265 2017-01-30 22:26:46 +05:30
Mike J Innes
349e81172d throw graph-aware exceptions 2017-01-29 23:35:03 +05:30
Mike J Innes
3981485500 show compile error trace 2017-01-29 17:29:37 +05:30
Mike J Innes
c2d6059d73 redundant 2017-01-29 16:58:22 +05:30
Mike J Innes
12d05a2db1 record stack traces 2017-01-29 16:09:30 +05:30
Mike J Innes
8bf5d91605 symbolname utility 2017-01-28 23:36:58 +05:30
Mike J Innes
70168319eb remove tond 2017-01-28 23:07:22 +05:30