Try this exercise. Fill in the missing part by typing it in.
Masking future positions in the decoder preserves the model’s __________ property (it generates one token at a time, using only what’s already generated).
Write the missing line below.

Masking future positions in the decoder preserves the model’s __________ property (it generates one token at a time, using only what’s already generated).
Write the missing line below.
