-
公开(公告)号:US20230048330A1
公开(公告)日:2023-02-16
申请号:US17976339
申请日:2022-10-28
Applicant: Huawei Technologies Co., Ltd.
Inventor: Youjia Huang , Weiran Nie , Yi Gao
IPC: G10L15/22 , G10L15/08 , B60R16/037 , G06F3/01
Abstract: An in-vehicle speech interaction method and a device are provided. The method includes: obtaining user speech information; determining a user instruction based on the user speech information; determining, based on the user instruction, whether response content to the user instruction is privacy-related; and determining, based on whether the response content is privacy-related, whether to output the response content in a privacy protection mode, to protect privacy from being leaked.
-
公开(公告)号:US20200258499A1
公开(公告)日:2020-08-13
申请号:US16861856
申请日:2020-04-29
Applicant: Huawei Technologies Co., Ltd.
Inventor: Weiran Nie , Hai Yu
IPC: G10L15/06 , G10L15/02 , G10L15/187
Abstract: A filtering model training method includes obtaining N original syllables, obtaining N recognized syllables, and obtaining N syllable distances based on the N original syllables and the N recognized syllables, where the N syllable distances are in a one-to-one correspondence with N syllable pairs, the N original syllables and the N recognized syllables form the N syllable pairs, each syllable pair includes an original syllable and a recognized syllable that correspond to each other, and each syllable distance is used to indicate a similarity between an original syllable and a recognized syllable that are included in a corresponding syllable pair.
-
公开(公告)号:US12087289B2
公开(公告)日:2024-09-10
申请号:US17539005
申请日:2021-11-30
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
Inventor: Weiran Nie , Fuliang Weng , Youjia Huang , Hai Yu , Shumang Hu
IPC: G10L15/18 , G10L15/187 , G10L15/22 , G10L15/06 , G10L15/065 , G10L15/07 , G10L15/08
CPC classification number: G10L15/1822 , G10L15/18 , G10L15/187 , G10L15/22 , G10L15/063 , G10L15/065 , G10L15/07 , G10L2015/088
Abstract: A speech recognition method, apparatus, and device, and a computer-readable storage medium provided pertain to the field of artificial intelligence technologies. The method includes: obtaining or generating a dynamic target language model based on reply information of a first intent, where the dynamic target language model includes a front-end part and a core part; obtaining a speech signal, parsing the speech signal to generate a key word; and invoking the dynamic target language model to determine a second intent and a service content. The front-end part of the dynamic target language model parses out the second intent based on the key word, and the core part of the dynamic target language model parses out the service content based on the key word. The speech recognition method prevents a provided service content from deviating from a user requirement and achieves a good recognition effect.
-
公开(公告)号:US11922935B2
公开(公告)日:2024-03-05
申请号:US17179764
申请日:2021-02-19
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zijuan Shi , Weiran Nie
CPC classification number: G10L15/22 , G10L15/1815
Abstract: A voice interaction method, where a service type set on which a user has a voice interaction intention is predicted based on a target event that can trigger voice interaction, and when a service type of a first service expressed by a voice instruction is a target service type in the service type set, the first service is executed.
-
公开(公告)号:US20220310095A1
公开(公告)日:2022-09-29
申请号:US17838500
申请日:2022-06-13
Applicant: Huawei Technologies Co., Ltd.
Inventor: Yi Gao , Weiran Nie , Youjia Huang
Abstract: A speech detection method includes performing recognition on a photographed face image using a model to predict whether a user intends to continue speaking, and to determine whether a collected audio signal is a speech end point with reference to a prediction result.
-
公开(公告)号:US20240312452A1
公开(公告)日:2024-09-19
申请号:US18673609
申请日:2024-05-24
Applicant: Huawei Technologies Co., Ltd.
Inventor: Yi Gao , Weiran Nie
IPC: G10L15/18 , G10L15/183
CPC classification number: G10L15/1815 , G10L15/183
Abstract: A speech recognition method includes obtaining audio data, where the audio data includes a plurality of audio frames; extracting sound categories of the plurality of audio frames and semantics; and obtaining a speech ending point of the audio data based on the sound categories and the semantics.
-
公开(公告)号:US11211052B2
公开(公告)日:2021-12-28
申请号:US16861856
申请日:2020-04-29
Applicant: Huawei Technologies Co., Ltd.
Inventor: Weiran Nie , Hai Yu
IPC: G10L15/02 , G10L15/04 , G10L15/08 , G10L15/06 , G10L15/187
Abstract: A filtering model training method includes obtaining N original syllables, obtaining N recognized syllables, and obtaining N syllable distances based on the N original syllables and the N recognized syllables, where the N syllable distances are in a one-to-one correspondence with N syllable pairs, the N original syllables and the N recognized syllables form the N syllable pairs, each syllable pair includes an original syllable and a recognized syllable that correspond to each other, and each syllable distance is used to indicate a similarity between an original syllable and a recognized syllable that are included in a corresponding syllable pair.
-
公开(公告)号:US12094468B2
公开(公告)日:2024-09-17
申请号:US17838500
申请日:2022-06-13
Applicant: Huawei Technologies Co., Ltd.
Inventor: Yi Gao , Weiran Nie , Youjia Huang
CPC classification number: G10L15/25 , G06V10/82 , G06V40/172 , G10L15/05 , G10L15/26
Abstract: A speech detection method includes performing recognition on a photographed face image using a model to predict whether a user intends to continue speaking, and to determine whether a collected audio signal is a speech end point with reference to a prediction result.
-
公开(公告)号:US20240126503A1
公开(公告)日:2024-04-18
申请号:US18397864
申请日:2023-12-27
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yubo Xiao , Weiran Nie , Dongxing Xu
IPC: G06F3/16
CPC classification number: G06F3/167
Abstract: This application provides an interface control method. The method includes: obtaining a speech instruction of a user and a sound source location of the user; obtaining line-of-sight information of the user; determining a target window on an interface based on the sound source location and the line-of-sight information; and controlling the target window based on the speech instruction. According to the interface control method in this application, collaborative decision-making is performed with reference to multimode information such as sound source information, line-of-sight tracking information, speech semantic information, and priorities thereof, so that page content in a plurality of windows on the interface is quickly and accurately controlled, to improve user experience.
-
公开(公告)号:US20220093087A1
公开(公告)日:2022-03-24
申请号:US17539005
申请日:2021-11-30
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
Inventor: Weiran Nie , Fuliang Weng , Youjia Huang , Hai Yu , Shumang Hu
IPC: G10L15/18 , G10L15/187
Abstract: A speech recognition method, apparatus, and device, and a computer-readable storage medium provided pertain to the field of artificial intelligence technologies. The method includes: obtaining or generating a dynamic target language model based on reply information of a first intent, where the dynamic target language model includes a front-end part and a core part; obtaining a speech signal, parsing the speech signal to generate a key word; and invoking the dynamic target language model to determine a second intent and a service content. The front-end part of the dynamic target language model parses out the second intent based on the key word, and the core part of the dynamic target language model parses out the service content based on the key word. The speech recognition method prevents a provided service content from deviating from a user requirement and achieves a good recognition effect.
-
-
-
-
-
-
-
-
-