Passer au contenu principal
Publication

On the Convergence of Encoder-only Shallow Transformers