Hacker News with Generative AI: Vision Transformers

Your ViT Is Secretly an Image Segmentation Model (arxiv.org)
Vision Transformers (ViTs) have shown remarkable performance and scalability across various computer vision tasks.

Computer Vision, Vision Transformers, Image Segmentation, Artificial Intelligence

10 points by lamename 438 days ago | 0 comments

The Speed of VITs and CNNs (eyer.be)
It is often stated that because of the quadratic self-attention, ViTs aren't practical at higher resolution.

Computer Vision, Artificial Intelligence, Deep Learning, CNNs, Vision Transformers

74 points by jxmorris12 441 days ago | 23 comments

Vision Transformers, Computer Vision, Machine Learning, Deep Learning

171 points by cscurmudgeon 796 days ago | 23 comments