Causal Masking
A mechanism that prevents attention from looking at future positions, enforcing the autoregressive property in language models.
learn more?
Subscribe and we'll send new content to your inbox.
A mechanism that prevents attention from looking at future positions, enforcing the autoregressive property in language models.
Subscribe and we'll send new content to your inbox.