How microGPT normalizes layer inputs to keep training stable, without learnable parameters.
This lesson requires an active subscription.