-
Notifications
You must be signed in to change notification settings - Fork 22
Open
Labels
new methodNew algorithmNew algorithm
Description
- DADVI (paper) pre-samples the Gaussian noise in the
initand then is fully deterministic inupdatewhich makes for more stable training. More samples are needed so might need gradient accumulation support Gradient accumulation #52 - Last-layer deterministic VI (paper) provides a handy deterministic objective with linear last layers for regression and classification. Might be worth adding if we can generalise to exponential familiy losses and/or linearise the model.
Metadata
Metadata
Assignees
Labels
new methodNew algorithmNew algorithm