METHOD AND APPARATUS WITH IMAGE PROCESSING

    公开(公告)号:US20250005961A1

    公开(公告)日:2025-01-02

    申请号:US18756803

    申请日:2024-06-27

    Abstract: A processor-implemented method with image processing includes detecting facial keypoints from an input face image determining a face area of the input face image and a facial feature area of the input face image based on the facial keypoints, and determining the input face image to be an invalid face image in response to the facial feature area satisfying a first preset condition, wherein the first preset condition comprises either one or both of a shape condition regarding a shape of the facial feature area, and a position condition regarding a relationship between a position of the facial feature area and a position of the face area.

    DEVICE AND METHOD WITH TARGET SPEAKER IDENTIFICATION

    公开(公告)号:US20230100259A1

    公开(公告)日:2023-03-30

    申请号:US17951585

    申请日:2022-09-23

    Abstract: A processor-implemented method includes: extracting a target speaker voice feature based on an input voice of a target speaker; determining an utterance scenario of the input voice based on the target speaker voice feature; generating a final target speaker voice feature based on the determined utterance scenario; and determining whether the target speaker corresponds to a user based on the final target speaker voice feature and a final user voice feature, wherein the determined utterance scenario comprises either one of a single-speaker scenario and a multiple-speaker scenario.

Patent Agency Ranking