-
公开(公告)号:US20240038238A1
公开(公告)日:2024-02-01
申请号:US18020851
申请日:2021-08-12
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Lei QIN , Lele ZHANG , Hao LIU , Yuewan LU
IPC: G10L15/25 , G06V10/82 , G10L15/02 , G10L25/24 , G10L15/16 , G06V40/16 , G06T7/50 , G06V10/30 , G06V10/26 , G06V10/80
CPC classification number: G10L15/25 , G06V10/82 , G10L15/02 , G10L25/24 , G10L15/16 , G06V40/171 , G06T7/50 , G06V10/30 , G06V10/26 , G06V10/806 , G06T2207/30201 , G06T2207/10028 , G06T2207/20084
Abstract: Embodiments of this application provide a speech recognition method. The speech recognition method includes: obtaining a facial depth image and a to-be-recognized voice of a user, where the facial depth image is an image collected by using a depth camera; recognizing a mouth shape feature from the facial depth image, and recognizing a voice feature from a to-be-recognized audio; and fusing the voice feature and the mouth shape feature into an audio-video feature, and recognizing, based on the audio-video feature, a voice uttered by the user.