How microGPT adapts learning rates per parameter using momentum and squared gradient tracking.
This lesson requires an active subscription.