-
公开(公告)号:US11908456B2
公开(公告)日:2024-02-20
申请号:US17006440
申请日:2020-08-28
发明人: Jimeng Zheng , Yi Gao , Meng Yu , Ian Ernan Liu
CPC分类号: G10L15/08 , G10L15/22 , G10L25/18 , G10L25/21 , G10L2015/088
摘要: Embodiments of this application discloses an azimuth estimation method performed at a computing device, the method including: obtaining, in real time, multi-channel sampling signals and buffering the multi-channel sampling signals; performing wakeup word detection on one or more sampling signals of the multi-channel sampling signals, and determining a wakeup word detection score for each channel of the one or more sampling signals; performing a spatial spectrum estimation on the buffered multi-channel sampling signals to obtain a spatial spectrum estimation result, when the wakeup word detection scores of the one or more sampling signals indicates that a wakeup word exists in the one or more sampling signals; and determining an azimuth of a target voice associated with the multi-channel sampling signals according to the spatial spectrum estimation result and a highest wakeup word detection score, thereby improving the accuracy of the azimuth estimation in a voice interaction process.
-
公开(公告)号:US11856149B2
公开(公告)日:2023-12-26
申请号:US17367268
申请日:2021-07-02
发明人: Yi Gao , Chi Xi , Jingcong Chen , Cheng Luo , Bin Li
IPC分类号: H04M7/00 , H04M1/72403 , H04M1/72454 , H04M1/253 , H04M1/72406 , H04L65/1069
CPC分类号: H04M7/0072 , H04L65/1069 , H04M1/253 , H04M1/72403 , H04M1/72406 , H04M1/72454 , H04M2207/20
摘要: This application discloses a method for establishing a call connection, a first terminal, a server, and a storage medium. The method includes: obtaining, by a first terminal, a second vocoder list of a second terminal, the second vocoder list including vocoders supported by the second terminal and with corresponding priorities; determining, by the first terminal, a first vocoder with the highest priority among vocoders that exist in both a first vocoder list of the first terminal and the second vocoder list, the first vocoder list including vocoders supported by the first terminal and with corresponding priorities, and the priorities of the vocoders being positively correlated with audio frequencies in encoding and decoding of the vocoders; and requesting, by the first terminal by using the first vocoder, to establish a first call connection to the second terminal.
-
公开(公告)号:US11341957B2
公开(公告)日:2022-05-24
申请号:US16933446
申请日:2020-07-20
IPC分类号: G10L15/08
摘要: A method for detecting a keyword, applied to a terminal, includes: extracting a speech eigenvector of a speech signal; obtaining, according to the speech eigenvector, a posterior probability of each target character being a key character in any keyword in an acquisition time period of the speech signal; obtaining confidences of at least two target character combinations according to the posterior probability of each target character; and determining that the speech signal includes the keyword upon determining that all the confidences of the at least two target character combinations meet a preset condition. The target character is a character in the speech signal whose pronunciation matches a pronunciation of the key character. Each target character combination includes at least one target character, and a confidence of a target character combination represents a probability of the target character combination being the keyword or a part of the keyword.
-
4.
公开(公告)号:US10425368B2
公开(公告)日:2019-09-24
申请号:US15656236
申请日:2017-07-21
发明人: Siyu Xiao , Xiaoyu Yu , Libin Ren , Yongjie Li , Wei Mao , Yi Gao , Mengsha Zhou , Zhenzhen Xu
IPC分类号: H04L12/58 , G06F17/30 , G06F16/951 , G06F3/0482
摘要: A first selectable command is displayed on a User Interface (UI) of the UE, and a first request is sent to a server via the UI, such that the server returns a first message of a non-text type; the first message returned by the server is received, and it is determined that the first message supports display of specified information; first operation applied on a result displayed by the first message is received, and a second request is sent, via the first operation, to the server to draw random information; in response to the first operation, display of the specified information is triggered, and the random information drawn from the server is received.
-
公开(公告)号:US20170329565A1
公开(公告)日:2017-11-16
申请号:US15664263
申请日:2017-07-31
发明人: Siyu Xiao , Xiaoyu Yu , Mengsha Zhou , Jiongchao Lin , Libin Ren , Yongjie Li , Zheng Dai , Yi Gao , Duokai Huang
IPC分类号: G06F3/14 , H04L29/08 , G06F17/27 , G06F3/0484 , G06F17/21
摘要: After a user logs in to a client, a first request of the user is sent to a server, and after the first request is authenticated, a communication connection between the client and the server is established; a system message sent by the server is received in a user login interface to which the user has logged in; the system message is generated by the server to contain at least text-format information capable of being displayed at the client.
-
公开(公告)号:US11842751B2
公开(公告)日:2023-12-12
申请号:US17507761
申请日:2021-10-21
发明人: Yi Gao
IPC分类号: G10L25/78 , G10L25/21 , G10L25/51 , H04L65/4038 , H04L65/75 , H04L65/1069 , H04M3/56 , H04L65/80
CPC分类号: G10L25/78 , G10L25/21 , G10L25/51 , H04L65/1069 , H04L65/4038 , H04L65/765 , H04L65/80 , H04M3/568
摘要: A call method is provided. The method includes: obtaining at least three paths of voice data transmitted by at least three first terminals, the voice data carrying indication information; selecting at least two paths of target voice data from the at least three paths of voice data according to the indication information of the at least three paths of voice data as obtained; and transmitting the at least two paths of target voice data to a second terminal, the second terminal being configured to decode the at least two paths of target voice data, mix decoded at least two paths of target voice data, and play mixed voice data.
-
公开(公告)号:US11749262B2
公开(公告)日:2023-09-05
申请号:US17343746
申请日:2021-06-10
发明人: Yi Gao , Ian Ernan Liu , Min Luo
IPC分类号: G10L15/08 , G10L21/0208 , G10L21/043 , G10L15/22
CPC分类号: G10L15/08 , G10L15/22 , G10L21/0208 , G10L21/043 , G10L2015/088 , G10L2021/02082
摘要: A keyword detection method includes: obtaining an enhanced speech signal of a to-be-detected speech signal, the enhanced speech signal corresponding to a target speech speed; performing speed adjustment on the enhanced speech signal to obtain a first speed-adjusted speech signal having a first speech speed, the first speech speed being different from the target speech speed; obtaining a first speech feature signal according to the first speed-adjusted speech signal; obtaining a detection result according to a first keyword detection result corresponding to the first speech feature signal, the detection result indicating whether a target keyword exists in the to-be-detected speech signal; and performing an operation corresponding to the target keyword in response to determining that the target keyword exists according to the detection result.
-
公开(公告)号:US20210266664A1
公开(公告)日:2021-08-26
申请号:US17319024
申请日:2021-05-12
发明人: Jimeng Zheng , Yi Gao , Xuan Ji , Weiwei Li , Meng Yu , Kai Xia , Jun Feng , Zhu Chen , Hongyang Chen , Wenbin Yang , Yu Wang , Yong Liu
IPC分类号: H04R3/00
摘要: This application discloses a sound acquisition component array, including: two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components. The two second sound acquisition components are located at a first side of a line connecting the two first sound acquisition components, and the two third sound acquisition components are located at a second side of the connecting line that is opposite to the first side of the connecting line; the two second sound acquisition components are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components are symmetrical about the perpendicular bisector; and a distance between the two first sound acquisition components, a distance between the two second sound acquisition components, and a distance between the two third sound acquisition components are respectively different from one another along a direction defined by the connecting line.
-
公开(公告)号:US10795629B2
公开(公告)日:2020-10-06
申请号:US15664263
申请日:2017-07-31
发明人: Siyu Xiao , Xiaoyu Yu , Mengsha Zhou , Jiongchao Lin , Libin Ren , Yongjie Li , Zheng Dai , Yi Gao , Duokai Huang
IPC分类号: G06F9/48 , G06F3/14 , H04L29/06 , H04L12/00 , G06F9/451 , G06F21/31 , H04L29/08 , G06F40/103 , G06F40/205 , G06F3/0484
摘要: After a user logs in to a client, a first request of the user is sent to a server, and after the first request is authenticated, a communication connection between the client and the server is established; a system message sent by the server is received in a user login interface to which the user has logged in; the system message is generated by the server to contain at least text-format information capable of being displayed at the client.
-
公开(公告)号:US10554805B2
公开(公告)日:2020-02-04
申请号:US15656155
申请日:2017-07-21
发明人: Siyu Xiao , Xiaoyu Yu , Mengsha Zhou , Jiongchao Lin , Libin Ren , Yongjie Li , Yi Gao
摘要: An association logic that associates a non-text message type with specified information is established. The association logic includes at least identification of the non-text message type, allowing a message type in line with the association logic to be identified by the identification. A first message is monitored. A first identification corresponding to the first message is obtained by analysing the first message. It is detected, according to the first identification, whether the first message is of the message type in line with the association logic. It is determined that the first message supports display of the specified information when it is detected that the first message is of the message type in line with the association logic.
-
-
-
-
-
-
-
-
-