Skip to main content
Publication

On the Convergence of Encoder-only Shallow Transformers