batchnorm: update docs

2017-10-30 13:33:01 +08:00 · 2017-10-30 13:33:01 +08:00 · 5253841acc
commit 5253841acc
parent ce46843459
2 changed files with 6 additions and 3 deletions
--- a/docs/src/models/layers.md
+++ b/docs/src/models/layers.md
@ -36,5 +36,6 @@ swish
 These layers don't affect the structure of the network but may improve training times or reduce overfitting.

 ```@docs
+BatchNorm
 Dropout
 ```
--- a/src/layers/normalisation.jl
+++ b/src/layers/normalisation.jl
@ -2,8 +2,8 @@
    testmode!(m)
    testmode!(m, false)

-Put layers like [`Dropout`](@ref) and `BatchNorm` into testing mode (or back to
-training mode with `false`).
+Put layers like [`Dropout`](@ref) and [`BatchNorm`](@ref) into testing mode
+(or back to training mode with `false`).
 """
 function testmode!(m, val::Bool=true)
  prefor(x -> _testmode!(x, val), m)
@ -48,7 +48,7 @@ _testmode!(a::Dropout, test) = (a.active = !test)
    BatchNorm(dims...; λ = identity,
              initβ = zeros, initγ = ones, ϵ = 1e-8, momentum = .1)

-Batch Normalization Layer
+Batch Normalization Layer for [`Dense`](@ref) layer.

 See [Batch Normalization: Accelerating Deep Network Training by Reducing
     Internal Covariate Shift](https://arxiv.org/pdf/1502.03167.pdf)
@ -65,6 +65,8 @@ julia> m = Chain(
  BatchNorm(10),
  softmax)
 Chain(Dense(784, 64), BatchNorm(64, λ = NNlib.relu), Dense(64, 10), BatchNorm(10), NNlib.softmax)
+
+julia> opt = SGD(params(m), 10)  # a crazy learning rate
 ```
 """
 mutable struct BatchNorm{F,V,N}