Mike J Innes
951c21366a
fix regex
2017-12-08 16:42:30 +00:00
GenaBitu
7e51418679
Added back for multi-parameter vcat
2017-12-08 16:10:09 +01:00
baggepinnen
385dee9d16
Add jacobian function
2017-12-08 14:46:12 +01:00
GenaBitu
41f3eedc39
Proper multi-variable vcat
2017-12-07 17:50:18 +01:00
Elliot Saba
41446d547f
Add weighted_crossentropy
for imbalanced classification problems
2017-12-05 17:09:05 -08:00
Elliot Saba
c59b820bed
Add glorot (Xavier) initialization
...
Set default `Dense` and `RNN` inits to `glorot_uniform()` for `W`, `zeros` for `b`.
2017-12-05 14:24:48 -08:00
GenaBitu
62b3600eca
Merge branch 'master' into cat-fix
2017-12-05 11:13:29 +01:00
baggepinnen
41febee9c1
Export and indent
2017-12-04 09:34:27 +01:00
baggepinnen
36001d085a
Implement AMSGrad optimiser
2017-12-04 09:17:05 +01:00
Mike J Innes
cab235a578
gpu compat
2017-11-30 13:51:31 +00:00
Mike J Innes
19039f4881
export sigmoid
2017-11-30 13:37:38 +00:00
Mike J Innes
2d33f19346
onehot unk arg
2017-11-29 16:45:50 +00:00
baggepinnen
fa718c7475
Implement Gated Recurrent Unit
2017-11-24 14:33:06 +01:00
CarloLucibello
13b934c250
improve optimizers
2017-11-24 12:12:20 +01:00
Mike J Innes
dc1f08a709
Merge pull request #98 from FluxML/log
...
GPU-ready log function
2017-11-23 17:17:39 +00:00
Mike J Innes
9f5c4dd3e9
Merge pull request #104 from baggepinnen/patch-1
...
Allow array of optimisers to train!
2017-11-21 17:16:35 +01:00
Mike J Innes
feb35783e6
Merge pull request #95 from FluxML/layernorm
...
Layer Normalisation
2017-11-21 17:12:49 +01:00
Mike J Innes
351d3d4771
std derivative
2017-11-21 17:04:04 +01:00
Mike J Innes
b06884b912
LayerNorm tweaks
2017-11-21 16:32:36 +01:00
skariel
11d53781b2
adding layer normalization
2017-11-21 16:30:24 +01:00
Mike J Innes
979949d01a
style
2017-11-21 15:25:09 +01:00
Mike J Innes
785fbcf68e
Merge pull request #107 from baggepinnen/patch-2
...
Fix bug in rmsprop and adadelta
2017-11-21 15:24:11 +01:00
Mike J Innes
e51268caf5
mention treelike
2017-11-21 12:59:39 +01:00
Mike J Innes
187fddc11c
doc fixes
2017-11-21 12:29:02 +01:00
Fredrik Bagge Carlson
8991ce028c
Fix bug in rmsprop and adadelta
...
`@. p.Δ = η * p.Δ / √acc` parses correctly while `@. p.Δ /= √acc*η` seems to parse like `@. p.Δ /= (√acc*η)`, hence the step size was de facto interpreted as `1/η`
2017-11-14 17:32:16 +01:00
Mike J Innes
e0657d93ec
mv numeric.jl to nnlib
2017-11-09 15:06:29 +00:00
Mike J Innes
2cb94981a0
gpu-ready log
2017-11-09 15:04:01 +00:00
Mike J Innes
e5d99d784e
fixes #79
2017-11-09 14:53:26 +00:00
Mike J Innes
ccdc046546
fixes #79
2017-11-09 14:52:28 +00:00
Mike J Innes
752a9e2808
tree utilities
2017-11-08 22:19:01 +00:00
Mike J Innes
6eb2ec154b
sentiment treebank loader
2017-11-08 22:19:01 +00:00
Mike J Innes
8777362eee
exports
2017-11-08 22:19:01 +00:00
Mike J Innes
8b05317895
basic tree
2017-11-08 22:19:01 +00:00
Mike J Innes
7e9468d8f8
treebank skeleton
2017-11-08 22:19:01 +00:00
Mike J Innes
bdf02e42ae
test tweaks
2017-11-08 22:18:45 +00:00
Mike J Innes
fcd091e8f0
Ac_mul_B derivatives
2017-11-08 22:18:45 +00:00
Mike J Innes
d4229c4815
useful params method
2017-11-08 22:18:45 +00:00
Mike J Innes
d6423eefe5
matrix-vector fast path
2017-11-08 22:18:45 +00:00
Fredrik Bagge Carlson
97244e0a68
Allow array of optimisers to train!
...
This allows an array of optimisers to be sent to `train!`
2017-11-04 13:27:32 +01:00
Mike J Innes
efa51f02e7
basic batch type
2017-11-02 11:49:42 +00:00
Mike J Innes
21ea93ffcd
rename treelike
2017-11-02 11:47:34 +00:00
Iblis Lin
6c7613e02b
batchnorm: leverage TrackedArray mean
2017-11-02 14:20:34 +08:00
Iblis Lin
88bd8a8fbd
batchnorm: make CuArrays happy
2017-11-02 14:02:41 +08:00
Iblis Lin
477da75428
batchnorm: fix mapchildren
2017-11-02 13:32:12 +08:00
Iblis Lin
7f5ba594a9
batchnorm: more test cases
2017-11-02 13:32:12 +08:00
Iblis Lin
5253841acc
batchnorm: update docs
2017-11-02 13:32:12 +08:00
Iblis Lin
ce46843459
batchnorm: add test cases
2017-11-02 13:32:12 +08:00
Iblis Lin
b3356cc6bb
batchnorm: batch σ correct coefficient
2017-11-02 13:32:12 +08:00
Iblis Lin
e0201be770
batchnorm: parameterize momentum and epsilon
2017-11-02 13:32:12 +08:00
Iblis Lin
669273b008
layer: implement BatchNorm layer
...
See [Batch Normalization: Accelerating Deep Network Training by Reducing
Internal Covariate Shift](https://arxiv.org/pdf/1502.03167.pdf )
2017-11-02 13:32:12 +08:00