Autoregressive models use their output to arrive at predictions. In machine learning, this amounts to “training on the output”, i.e., generated data. More broadly, intelligent behavior is often accompanied by deep thought or even dreaming between actions.