Patent search ap:("Samsung Electronics Co. Page Ltd.") AND inv:"Kai WANG"

1.

发明申请
METHOD AND APPARATUS WITH IMAGE PROCESSING 有权

公开(公告)号：US20250005961A1

公开(公告)日：2025-01-02

申请号：US18756803

申请日：2024-06-27

Applicant: Samsung Electronics Co., Ltd.

Inventor： Jingzhi LI , Kai WANG , Zidong GUO , Jiwon BAEK , Seungju HAN

IPC: G06V40/16 , G06T3/40 , G06T7/50

Abstract: A processor-implemented method with image processing includes detecting facial keypoints from an input face image determining a face area of the input face image and a facial feature area of the input face image based on the facial keypoints, and determining the input face image to be an invalid face image in response to the facial feature area satisfying a first preset condition, wherein the first preset condition comprises either one or both of a shape condition regarding a shape of the facial feature area, and a position condition regarding a relationship between a position of the facial feature area and a position of the face area.

2.

发明申请
DEVICE AND METHOD WITH TARGET SPEAKER IDENTIFICATION 有权

公开(公告)号：US20230100259A1

公开(公告)日：2023-03-30

申请号：US17951585

申请日：2022-09-23

Applicant: Samsung Electronics Co., Ltd.

Inventor： Kai WANG , Xiaolei ZHANG , Miao ZHANG

IPC: G10L15/02 , G10L25/78 , G10L25/51 , G10L25/30

Abstract: A processor-implemented method includes: extracting a target speaker voice feature based on an input voice of a target speaker; determining an utterance scenario of the input voice based on the target speaker voice feature; generating a final target speaker voice feature based on the determined utterance scenario; and determining whether the target speaker corresponds to a user based on the final target speaker voice feature and a final user voice feature, wherein the determined utterance scenario comprises either one of a single-speaker scenario and a multiple-speaker scenario.

Patent Agency Ranking