Flux.jl

Author	SHA1	Message	Date
bors[bot]	7035ee9bea	Merge #1238 1238: Fix inline code block r=dhairyagandhi96 a=harryscholes ### PR Checklist - [ ] Tests are added - [ ] Entry in NEWS.md - [x] Documentation, if applicable - [ ] Final review from `@MikeInnes` or `@dhairyagandhi96` (for API changes). Co-authored-by: harryscholes <harryscholes@gmail.com>	2020-06-19 08:28:41 +00:00
harryscholes	57efd7fead	Fix inline code block	2020-06-19 09:24:44 +01:00
bors[bot]	19b45b49d3	Merge #1221 1221: DataLoader with NamedTuple r=CarloLucibello a=cossio Just a couple of small changes, so that `DataLoader` can be created with a `NamedTuple` of tensors instead of `Tuple`. This way the tensors can be referred to by name. For example ``` train_loader = DataLoader((images = Xtrain, labels = Ytrain), batchsize=16) batch = first(train_loader) y = model(batch.images) logitcrossentropy(y, batch.labels) ``` If we only use tuples, then in datasets with multiple tensors one has to be careful about the order in which the tensors are fed into the `DataLoader` constructor and be consistent with this elsewhere. With `NamedTuples` one just have to be consistent about the names used, which I think is a minor improvement. CC @CarloLucibello ### PR Checklist - [x] Tests are added - [x] Entry in NEWS.md - [x] Documentation, if applicable I don't think this qualifies as an API change. It's just a minor feature addition. So final review probably not required. - [ ] Final review from `@MikeInnes` or `@dhairyagandhi96` (for API changes). Co-authored-by: cossio <j.cossio.diaz@gmail.com> Co-authored-by: cossio <cossio@users.noreply.github.com>	2020-06-16 17:21:28 +00:00
bors[bot]	254e4a7058	Merge #1231 1231: use `ntuple` in conv r=MikeInnes a=MikeInnes This is the right abstraction over `map`, and in particular is a bit easier to compile away in some cases. As this is a trivial change from Flux's perspective it's not easy to test here, but there are downstream tests in XLA.jl. Co-authored-by: Mike J Innes <mike.j.innes@gmail.com>	2020-06-16 13:04:20 +00:00
Mike J Innes	9f931dd7fa	use `ntuple` in conv	2020-06-16 14:02:24 +01:00
cossio	9078f85096	revert selectdim selectdim can lead to type instability, see https://discourse.julialang.org/t/why-selectdim-is-type-instable/25271/5	2020-06-16 13:32:27 +02:00
cossio	1dbaf32810	DataLoader type inference tests	2020-06-16 13:32:27 +02:00
cossio	cb34bb848b	simplify _getobs	2020-06-16 13:32:27 +02:00
cossio	75692161a7	Apply suggestions from code review accept suggested changes Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>	2020-06-16 13:32:27 +02:00
cossio	909a55ac10	news and docs	2020-06-16 13:32:27 +02:00
cossio	02ee6ba426	DataLoader with NamedTuple	2020-06-16 13:31:29 +02:00
bors[bot]	97406507fd	Merge #1218 1218: Require weight and bias to be AbstractArrays r=CarloLucibello a=oxinabox closes #1199 While in theory someone could be using Dense with weights and biases that are not abstract arrays, I would be surprised. So allowing it is just leaving a food-gun laying around. If it is common then we can instead close #1199 by adding a special constructor for `Number` subtypes that error if they are not integers, or something a long those lines. ### PR Checklist - [x] Tests are added - [x] Entry in NEWS.md I think this is a bug-fix thus the following are not required: - [ ] Documentation, if applicable - [ ] Final review from `@MikeInnes` or `@dhairyagandhi96` (for API changes). Co-authored-by: Lyndon White <lyndon.white@invenialabs.co.uk> Co-authored-by: Lyndon White <oxinabox@ucc.asn.au>	2020-06-15 15:21:21 +00:00
Lyndon White	e61787c1c8	Update test/layers/basic.jl	2020-06-12 13:58:10 +01:00
Lyndon White	601f842eaf	bonus test	2020-06-11 23:17:40 +01:00
bors[bot]	99ec30c8c2	Merge #1220 1220: CompatHelper: bump compat for "Adapt" to "2.0" r=CarloLucibello a=github-actions[bot] This pull request changes the compat entry for the `Adapt` package from `1` to `1, 2.0`. This keeps the compat entries for earlier versions. Note: I have not tested your package with this new compat entry. It is your responsibility to make sure that your package tests pass before you merge this pull request. Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2020-06-11 09:54:46 +00:00
github-actions[bot]	fbfc973011	CompatHelper: bump compat for "Adapt" to "2.0"	2020-06-11 00:18:47 +00:00
Lyndon White	a1623aca76	move into 0.11 news	2020-06-10 12:39:00 +01:00
Lyndon White	15c7354c4e	Make release as DEV	2020-06-10 12:38:33 +01:00
Lyndon White	97b0aa4d36	bump version	2020-06-10 12:14:47 +01:00
Lyndon White	cf90517a8a	update news.md	2020-06-10 12:14:19 +01:00
Lyndon White	df84628c29	Require weight and bias to be AbstractArrays	2020-06-10 12:06:57 +01:00
bors[bot]	e1f80d4627	Merge #1213 1213: Fixing indentation in train! docstring r=CarloLucibello a=natema One code block is not correctly displayed in the doc of [Flux.Optimise.train! ](https://fluxml.ai/Flux.jl/stable/training/training/#Flux.Optimise.train!). Based on the previous code block, I guess it's an indentation problem. Co-authored-by: natema <natema@users.noreply.github.com>	2020-06-08 18:29:46 +00:00
bors[bot]	a7bbd3d35b	Merge #1152 1152: extend dataloader r=CarloLucibello a=CarloLucibello cfr discussion in #1149. Currently DataLoader interface supports 1. `for x in DataLoader(X)` 2. `for (x, y) in DataLoader(X, Y)` This PR adds 3. `for (x,) in DataLoader((X,))` 4. `for (x, y) in DataLoader((X, Y))` Edit: the constructor in 2. is removed in this PR Co-authored-by: CarloLucibello <carlo.lucibello@gmail.com>	2020-06-08 18:01:06 +00:00
CarloLucibello	0cf46432cf	cleanup	2020-06-08 19:59:34 +02:00
natema	70bbf18180	Fixing indentation in train! docstring One code block is not correctly displayed in the doc of [Flux.Optimise.train! ](https://fluxml.ai/Flux.jl/stable/training/training/#Flux.Optimise.train!). Based on the previous code block, I guess it's an indentation problem.	2020-06-07 15:44:04 +02:00
bors[bot]	d9b07475b0	Merge #1129 1129: Added dropgrad in huber_loss r=CarloLucibello a=HenriDeh Workaround to prevent `iterate(::nothing)` when working with CuArrays. See issue #1128 Co-authored-by: HenriDeh <47037088+HenriDeh@users.noreply.github.com>	2020-06-06 17:21:19 +00:00
bors[bot]	9ebbe8cb4c	Merge #1141 1141: Speedup matmul of CuMatrix and OneHotMatrix r=CarloLucibello a=AStupidBear This solves #189. ```julia julia> using Flux julia> using Flux: CuArrays julia> A = zeros(300, 10000) \|> gpu; julia> B = Flux.onehotbatch(rand(1:10000, 256), 1:10000) \|> gpu; julia> A * B; CuArrays.@time A * B; ┌ Warning: Performing scalar operations on GPU arrays: This is very slow, consider disallowing these operations with `allowscalar(false)` └ @ GPUArrays ~/shared/.julia/packages/GPUArrays/OXvxB/src/host/indexing.jl:43 0.002824 seconds (951 CPU allocations: 38.156 KiB) (2 GPU allocations: 301.000 KiB, 2.32% gc time of which 46.42% spent allocating) julia> import Base: * julia> A::AbstractMatrix * B::Flux.OneHotMatrix = @inbounds A[:, map(x->x.ix, B.data)] * (generic function with 522 methods) julia> A * B; CuArrays.@time A * B; 0.000343 seconds (169 CPU allocations: 5.000 KiB) (2 GPU allocations: 301.000 KiB, 15.53% gc time of which 65.97% spent allocating) ``` Co-authored-by: Yao Lu <luyaocns@gmail.com>	2020-06-06 17:00:01 +00:00
CarloLucibello	b1f226eb34	add news	2020-06-06 18:15:04 +02:00
CarloLucibello	a643cb6758	extend dataloader	2020-06-06 18:02:03 +02:00
bors[bot]	792a1c54f8	Merge #1211 1211: Fixing syntax in onehot docstring r=CarloLucibello a=natema `otherwise, it will error` -> `otherwise, it will raise an error` Co-authored-by: natema <natema@users.noreply.github.com>	2020-06-06 15:02:40 +00:00
natema	8f6aed5770	Fixing syntax in onehot docstring `otherwise, it will error` -> `otherwise, it will raise an error`	2020-06-05 18:20:50 +02:00
bors[bot]	22d5e318e5	Merge #1192 1192: Improve `restructure` performance r=dhairyagandhi96 a=MikeInnes A small change, but it significantly improves the performance on the following test case: ```julia julia> VERSION v"1.5.0-DEV.876" julia> using Flux, DiffEqFlux, BenchmarkTools julia> using Flux: mse julia> fastdense = FastDense(784, 32, tanh); julia> p = initial_params(fastdense); julia> dense = Dense(784, 32, tanh); julia> p,re = Flux.destructure(dense); julia> x = rand(Float32, 784, 10); julia> y = rand(Float32, 32, 10); julia> @btime gradient((x,p) -> mse(fastdense(x, p), y), x, p); 505.530 μs (87 allocations: 240.73 KiB) julia> @btime gradient((x,p) -> mse(re(p)(x), y), x, p); 107.796 μs (139 allocations: 340.94 KiB) ``` Co-authored-by: Mike J Innes <mike.j.innes@gmail.com>	2020-06-05 14:53:11 +00:00
bors[bot]	71ebd51e45	Merge #1208 1208: Fixing output format for `onehot` r=dhairyagandhi96 a=natema Currently `Flux.OneHotVector` is displayed as a binary vector (0/1) rather than a boolean one (true/false). This is also shown in successive examples in the same page. I fixed the `onehot(:b, [:a, :b, :c])` and `onehot(:c, [:a, :b, :c])` outputs in the first example of the page accordingly. Co-authored-by: natema <natema@users.noreply.github.com>	2020-06-05 09:17:12 +00:00
bors[bot]	b5a73f8532	Merge #1207 1207: Fixing typo in docs r=dhairyagandhi96 a=natema `what ever` -> `whatever` Co-authored-by: natema <natema@users.noreply.github.com>	2020-06-05 09:00:06 +00:00
natema	48d6f2d0c0	Fixing output format for `onehot` `Flux.OneHotVector` is displayed as a binary vector (0/1) rather than a boolean (true/false) one, as is also shown in successive examples in the same page, so I fixed the `onehot(:b, [:a, :b, :c])` and `onehot(:c, [:a, :b, :c])` output as given by the current Julia version 1.4.2.	2020-06-03 17:03:08 +02:00
natema	2c4b1e521e	Fixing typo in docs `what ever` -> `whatever`	2020-06-02 19:20:41 +02:00
bors[bot]	ca1b1b2c7c	Merge #1206 1206: Fixing ambiguous remark in Preserve inputs' types r=dhairyagandhi96 a=natema This PR is based on the [discussion in the forum](https://discourse.julialang.org/t/not-clear-what-0-01f0x-is-in-the-flux-docs/40553?u=mathematics) on the ambiguity of `0.01f0x` in the line > While one could change the activation function (e.g. to use `0.01f0x`) Co-authored-by: natema <natema@users.noreply.github.com>	2020-06-02 17:09:58 +00:00
natema	a24f46b606	Fixing ambiguous remark in Preserve inputs' types This PR is based on the [discussion in the forum](https://discourse.julialang.org/t/not-clear-what-0-01f0x-is-in-the-flux-docs/40553?u=mathematics) on the ambiguity of `0.01f0x` in the line > While one could change the activation function (e.g. to use `0.01f0x`)	2020-06-02 18:48:07 +02:00
Mike J Innes	089ec0832c	improved restructure adjoint	2020-05-27 12:28:22 +01:00
bors[bot]	ddd0f4e747	Merge #1191 1191: Pull Request Template r=MikeInnes a=MikeInnes Hopefully makes it a little clearer what the requirements are, which will lead to easier review, and encourage things like NEWS.md that we want to be better in sync. cc @dhairyagandhi96 and @CarloLucibello for thoughts. Co-authored-by: Mike J Innes <mike.j.innes@gmail.com>	2020-05-27 11:15:26 +00:00
Mike J Innes	e10818bbad	Update pull_request_template.md	2020-05-27 12:12:13 +01:00
Mike J Innes	8c3a80c940	Create pull_request_template.md	2020-05-26 12:52:28 +01:00
bors[bot]	85c39e2309	Merge #1190 1190: Correcting advanced.md r=dhairyagandhi96 a=Sleort To make the example consistent, it should be ``` julia> Flux.trainable(a::Affine) = (a.W,) ``` not ``` julia> Flux.trainable(a::Affine) = (a.W, a.b) ``` Co-authored-by: Troels Arnfred Bojesen <tr-ab@online.no>	2020-05-25 14:47:42 +00:00
Troels Arnfred Bojesen	17bb00a3fa	Correcting advanced.md To make the example consistent, it should be ``` julia> Flux.trainable(a::Affine) = (a.W,) ``` not ``` julia> Flux.trainable(a::Affine) = (a.W, a.b) ```	2020-05-25 23:33:09 +09:00
bors[bot]	bd152ca099	Merge #1177 1177: Align ExpDecay implementation with documentation r=dhairyagandhi96 a=DrChainsaw Fix for #1176 Co-authored-by: DrChainsaw <Christian.kyril.skarby@gmail.com>	2020-05-21 14:33:20 +00:00
bors[bot]	f343172daf	Merge #1185 1185: Add some news r=dhairyagandhi96 a=dhairyagandhi96 cc @CarloLucibello please add to this list as well Co-authored-by: Dhairya Gandhi <dhairya@juliacopmuting.com>	2020-05-21 12:46:39 +00:00
bors[bot]	472e1fbf5e	Merge #957 957: Add some gradient checking tests on GPUs r=dhairyagandhi96 a=dhairyagandhi96 Good to add generic tests for tracking gradients through the various layers on the GPU. Co-authored-by: Dhairya Gandhi <dhairya@juliacopmuting.com> Co-authored-by: Dhairya Gandhi <dhairya@juliacomputing.com>	2020-05-21 12:25:53 +00:00
Dhairya Gandhi	0801064d50	add comment on broken layers	2020-05-20 00:11:38 +05:30
Dhairya Gandhi	c4409fa6d1	clearing failures	2020-05-19 23:54:18 +05:30
bors[bot]	87ba651add	Merge #1165 1165: Fix docstring of logitcrossentropy r=dhairyagandhi96 a=cossio Since `y` is a logit, there is no log (see the diff). Co-authored-by: cossio <cossio@users.noreply.github.com>	2020-05-19 11:07:15 +00:00

1 2 3 4 5 ...

2689 Commits