-
公开(公告)号:US20230290135A1
公开(公告)日:2023-09-14
申请号:US18119770
申请日:2023-03-09
Applicant: NVIDIA Corporation
Inventor: Daquan Zhou , Zhiding Yu , Enze Xie , Anima Anandkumar , Chaowei Xiao , Jose Manuel Alvarez Lopez
IPC: G06V10/82 , G06V10/77 , G06V10/778 , G06V10/30
CPC classification number: G06V10/82 , G06V10/7715 , G06V10/778 , G06V10/30
Abstract: Apparatuses, systems, and techniques to generate a robust representation of an image. In at least one embodiment, input tokens of an input image are received, and an inference about the input image is generated based on a vision transformer (ViT) system comprising at least one self-attention module to perform token mixing and a channel self-attention module to perform channel processing.