From 80e378d6c969547a9afd6221d0d52bfeee7cd97a Mon Sep 17 00:00:00 2001
From: Mike Innes <mike.j.innes@gmail.com>
Date: Fri, 16 Mar 2018 12:22:30 +0000
Subject: [PATCH] update readme

---
 README.md | 72 +++++++++++++++++++++++++++++++++++++++++++++++++++++--
 1 file changed, 70 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index 72c68bd7..b7c901d9 100644
--- a/README.md
+++ b/README.md
@@ -2,10 +2,78 @@
 
 [![Build Status](https://travis-ci.org/FluxML/Flux.jl.svg?branch=master)](https://travis-ci.org/FluxML/Flux.jl) [![](https://img.shields.io/badge/docs-stable-blue.svg)](https://fluxml.github.io/Flux.jl/stable/) [![](https://img.shields.io/badge/chat-on%20slack-yellow.svg)](https://slackinvite.julialang.org/)
 
-Flux is a refreshing approach to machine learning. It provides lightweight abstractions on top of Julia's native GPU and AD support, while remaining fully hackable (right down to the [GPU kernels](https://github.com/FluxML/CuArrays.jl)).
+Flux is an elegant approach to machine learning. It's a 100% pure-Julia stack, and provides lightweight abstractions on top of Julia's native GPU and AD support. Flux makes the easy things easy while remaining fully hackable.
 
 ```julia
 julia> Pkg.add("Flux")
 ```
 
-See the [documentation](http://fluxml.github.io/Flux.jl/stable/) or the [model zoo](https://github.com/FluxML/model-zoo/) for examples.
+See the [documentation](http://fluxml.github.io/Flux.jl/) or the [model zoo](https://github.com/FluxML/model-zoo/) for examples.
+
+## Features
+
+Flux has powerful high-level features, and common architectures can be defined in a few lines.
+
+```julia
+model = Chain(
+  Dense(768, 128),
+  LSTM(128, 256)
+  LSTM(256, 128)
+  Dense(128, 10),
+  softmax)
+
+loss(x, y) = crossentropy(model(x), y)
+
+Flux.train!(loss, data, ADAM(...))
+```
+
+Yet you can easily strip away the layers, and directly write the mathematics for your problem. Flux will seamlessly take gradients of any Julia code, so your model looks just like the paper.
+
+```julia
+W = param(randn(2, 10))
+b = param(randn(2))
+
+y(x) = σ.(W * x .+ b)
+```
+
+If that's *still* not enough, you can go as deep as you want, even writing your own CUDA kernels! All this can be freely mixed-and-matched in a single model or script, and it all runs interactively via Jupyter or Juno.
+
+```julia
+function gpu_add(a, b, c)
+  i = (blockIdx().x-1) * blockDim().x + threadIdx().x
+  c[i] = a[i] + b[i]
+  return nothing
+end
+```
+
+Unusual architectures are no problem in Flux, as you can use all the loops, control flow and even macros that you're used to. Here's a Tree RNN in 4 lines.
+
+```julia
+tree() = rand() < 0.5 ? rand(10) : (tree(), tree()) # dummy data
+
+shrink = Dense(20, 10)
+combine(a, b) = shrink([a; b])
+
+model(x) = x
+model(x::Tuple) = combine(model(x[1]), model(x[2]))
+
+model(tree()) # Sample output
+```
+
+Despite this flexibility, Julia's advanced compiler lets us do some powerful optimisations. For example, this definition of `sigmoid` automatically gets fused into a *single* GPU kernel – so it's really fast.
+
+```julia
+sigmoid(xs) = 1 ./ (1 .+ exp.(.-xs))
+```
+
+Similarly, Flux is the first dynamic framework to support [compiling to the browser](https://fluxml.github.io/experiments/) and model import via [formats like ONNX](https://github.com/FluxML/ONNX.jl/), both of which are thinly-veiled compiler problems.
+
+For more on our philosophy on machine learning, check out our article [On Machine Learning & Programming Languages](https://julialang.org/blog/2017/12/ml&pl).
+
+## Contributing & Help
+
+For general questions and help, check out Julia's [community forum](https://discourse.julialang.org/c/domain/ML).
+
+Flux development is carried out via our [GitHub issues](https://github.com/FluxML/Flux.jl/issues), so feel free to open feature requests or PRs here.
+
+For more informal discussions we'd love to have you on the [Julia slack](https://slackinvite.julialang.org/), where we hang out on the #machine-learning channel.