-
公开(公告)号:US20250157230A1
公开(公告)日:2025-05-15
申请号:US18635249
申请日:2024-04-15
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Dongwook LEE , Chanho EOM , Byung In YOO , Hyunjeong LEE
Abstract: A method for (3D) object detection includes: receiving an input image with respect to a 3D space, an input point cloud with respect to the 3D space, and an input language with respect to a target object in the 3D space; using an encoding model to generate candidate image features of partial areas of the input image, a point cloud feature of the input point cloud, and a linguistic feature of the input language; selecting a target image feature corresponding to the linguistic feature from among the candidate image features based on similarity scores of similarities between the candidate image features and the linguistic feature; generating a decoding output by executing a multi-modal decoding model based on the target image feature and the point cloud feature; and detecting a 3D bounding box corresponding to the target object by executing an object detection model based on the decoding output.