[Paper Review] ViT : An Image Is Worth 16X16 Words : Transformers For Image Recognition At Scale
·
Paper Review/CV
ViT : An Image Is Worth 16X16 Words : Transformers For Image Recognition At Scale (2022.12.V11)Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn 외 8명https://arxiv.org/abs/2010.11929 오늘은 이미지 분류와 다양한 시각 인식 작업에서 뛰어난 성능을 보인 비전 트랜스포머(Vision Transformer, ViT)를 소개한 논문 An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale를 리뷰해보도록 하겠습니다! 😁 📌 Abstract & Introdu..