Embed size 2 in time series transformer

To @kashif ,
After I wrote this, I read this answer from you (Time series Prediction: inference process)

You said

what is the learning parameters you refer to? (mean, std, seasonal data, trend etc…)
And what is the probability distribution you refer to? specifically what is the X-axis of that?

This seems might be related with my question I think.
Thx!