Support model "buffers"

Basically these are parameters that aren't updated via gradient descent (but would be serialized - a good example that already exists here is the running mean or running variance in batch normalisation); they could also be completely fixed.

The current working solution is to just add Variables that don't need gradients within `initialize_parameters`. We need to also make sure optimizers skips anything with `requires_grad=false`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support model "buffers" #52

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Support model "buffers" #52

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions