-
公开(公告)号:US11507975B2
公开(公告)日:2022-11-22
申请号:US15976542
申请日:2018-05-10
Inventor: Bing Gong , Min Luo , Jianchen Zhu , Li Huang
IPC: G06Q30/02 , G06F16/248 , G06F16/28 , G06F16/9535 , G06F16/2457 , G06F16/954 , G06F16/9536
Abstract: A method of generating content information that matches at least one stored keyword is described. At least one keyword associated with content information is stored. At least one previously searched keyword in a search record is matched with the at least one stored keyword associated with the content information. First-category mapping data is generated based on a first mapping between the matched at least one stored keyword and the at least one previously searched keyword. Second-category mapping data is generated based on the content information and the at least one stored keyword. A received target keyword is determined to be included in the first-category mapping data. In response to the received target keyword, which is included in the first-category mapping data, circuitry of a terminal searches for the content information associated with the target keyword in the second-category mapping data and displays the content information.
-
公开(公告)号:US10984059B2
公开(公告)日:2021-04-20
申请号:US15949796
申请日:2018-04-10
Inventor: Bin Huang , Xun Luo , Jianchen Zhu , Min Luo , Shanmin Tang , Yongsheng Liu
IPC: G06F16/9535 , G06F16/22 , G06F16/955 , G06Q30/02 , G06Q50/00
Abstract: A method for data retrieval is described. Interface circuitry of an information processing apparatus receives a request for data retrieval from a database. The database stores content sharing information in a social network. The request includes a first user identifier and a first link identifier. The processing circuitry determines whether the first user identifier and the first link identifier are associated in the database as a consequence of a previous sharing of a first article corresponding to the first link identifier using the first user identifier. Further, when the first user identifier and the first link identifier are determined to be associated, the processing circuitry searches for a first message identifier in the database. The first message identifier identifies a first message that includes information of the previous sharing of the first article. The processing circuitry then retrieves the first message according to the first message identifier.
-
公开(公告)号:US20200051549A1
公开(公告)日:2020-02-13
申请号:US16655548
申请日:2019-10-17
Inventor: Lianwu Chen , Meng Yu , Min Luo , Dan Su
Abstract: Embodiments of the present invention provide a speech signal processing model training method, an electronic device and a storage medium. The embodiments of the present invention determines a target training loss function based on a training loss function of each of one or more speech signal processing tasks; inputs a task input feature of each speech signal processing task into a starting multi-task neural network, and updates model parameters of a shared layer and each of one or more task layers of the starting multi-task neural network corresponding to the one or more speech signal processing tasks by minimizing the target training loss function as a training objective, until the starting multi-task neural network converges, to obtain a speech signal processing model.
-
公开(公告)号:US11217229B2
公开(公告)日:2022-01-04
申请号:US16921537
申请日:2020-07-06
Inventor: Yi Gao , Ji Meng Zheng , Meng Yu , Min Luo
Abstract: A speech recognition method, apparatus, a computer device and an electronic device for recognizing speech. The method includes receiving an audio signal obtained by a microphone array; performing a beamforming processing on the audio signal in a plurality of target directions to obtain a plurality of beam signals; performing a speech recognition on each of the plurality of beam signals to obtain a plurality of speech recognition results corresponding to the plurality of beam signals; and determining a speech recognition result of the audio signal based on the plurality of speech recognition results of the plurality of beam signals.
-
5.
公开(公告)号:US12087290B2
公开(公告)日:2024-09-10
申请号:US16941503
申请日:2020-07-28
Inventor: Jingliang Bai , Caisheng Ouyang , Haikang Liu , Lianwu Chen , Qi Chen , Yulu Zhang , Min Luo , Dan Su
IPC: G10L15/183 , G10L15/06 , G10L15/22 , G10L15/30 , G10L21/0232 , G10L25/21 , G10L25/84 , G10L25/78
CPC classification number: G10L15/183 , G10L15/063 , G10L15/22 , G10L15/30 , G10L21/0232 , G10L25/21 , G10L25/84 , G10L2015/0636 , G10L2025/783
Abstract: A data processing method based on simultaneous interpretation, applied to a server in a simultaneous interpretation system, including: obtaining audio transmitted by a simultaneous interpretation device; processing the audio by using a simultaneous interpretation model to obtain an initial text; transmitting the initial text to a user terminal; receiving a modified text fed back by the user terminal, the modified text being obtained after the user terminal modifies the initial text; and updating the simultaneous interpretation model according to the initial text and the modified text.
-
公开(公告)号:US11430428B2
公开(公告)日:2022-08-30
申请号:US17016573
申请日:2020-09-10
Inventor: Lianwu Chen , Jingliang Bai , Min Luo
Abstract: The present disclosure describes a method, apparatus, and storage medium for performing speech recognition. The method includes acquiring, by an apparatus, first to-be-processed speech information. The apparatus includes a memory storing instructions and a processor in communication with the memory. The method includes acquiring, by the apparatus, a first pause duration according to the first to-be-processed speech information; and in response to the first pause duration being greater than or equal to a first threshold, performing, by the apparatus, speech recognition on the first to-be-processed speech information to obtain a first result of sentence segmentation of speech, the first result of sentence segmentation of speech being text information, the first threshold being determined according to speech information corresponding to a previous moment.
-
公开(公告)号:US11341957B2
公开(公告)日:2022-05-24
申请号:US16933446
申请日:2020-07-20
IPC: G10L15/08
Abstract: A method for detecting a keyword, applied to a terminal, includes: extracting a speech eigenvector of a speech signal; obtaining, according to the speech eigenvector, a posterior probability of each target character being a key character in any keyword in an acquisition time period of the speech signal; obtaining confidences of at least two target character combinations according to the posterior probability of each target character; and determining that the speech signal includes the keyword upon determining that all the confidences of the at least two target character combinations meet a preset condition. The target character is a character in the speech signal whose pronunciation matches a pronunciation of the key character. Each target character combination includes at least one target character, and a confidence of a target character combination represents a probability of the target character combination being the keyword or a part of the keyword.
-
8.
公开(公告)号:US11797084B2
公开(公告)日:2023-10-24
申请号:US17323827
申请日:2021-05-18
Inventor: Zheng Zhou , Xing Ji , Yitong Wang , Xiaolong Zhu , Min Luo
IPC: G06F3/01 , G06K9/62 , G06T5/00 , G06V10/82 , G06V40/00 , G06V40/18 , G06F18/214 , G06V40/19 , G06F18/241
CPC classification number: G06F3/013 , G06F18/214 , G06T5/002 , G06T5/009 , G06V10/82 , G06V40/00 , G06V40/18 , G06F18/241 , G06V40/19 , G06V40/193 , G06V40/197
Abstract: This application discloses a method for training a gaze tracking model, including: obtaining a training sample set; processing the eye sample images in the training sample set by using an initial gaze tracking model to obtain a predicted gaze vector of each eye sample image; determining a model loss according to a cosine distance between the predicted gaze vector and the labeled gaze vector for each eye sample image; and iteratively adjusting one or more reference parameters of the initial gaze tracking model until the model loss meets a convergence condition, to obtain a target gaze tracking model. According to the solution provided in this application, a gaze tracking procedure is simplified, a difference between a predicted value and a labeled value can be better represented by using the cosine distance as a model loss to train a model, to improve prediction accuracy of the gaze tracking model.
-
公开(公告)号:US12033621B2
公开(公告)日:2024-07-09
申请号:US17231945
申请日:2021-04-15
Inventor: Dan Su , Tianxiao Fu , Min Luo , Qi Chen , Yulu Zhang , Lin Luo
IPC: G10L15/187 , G10L15/00 , G10L15/02 , G10L15/06 , G10L15/22
CPC classification number: G10L15/187 , G10L15/005 , G10L15/02 , G10L15/063 , G10L15/22 , G10L2015/025
Abstract: A method for speech recognition based on language adaptivity comprises obtaining voice data of a user. The method also comprises extracting, based on the obtained voice data, a phoneme feature representing pronunciation phoneme information. The phoneme feature is input to a pre-trained language discrimination model that is pre-trained based on a multilingual corpus. A language discrimination result corresponding to the phoneme feature and in accordance with the language discrimination model is obtained. The method also comprises obtaining a speech recognition result of the voice data based on a language acoustic model of a language corresponding to the language discrimination result. The method further comprises determining a speech recognition result of the voice data based on a language acoustic model of a language corresponding to the language discrimination result.
-
公开(公告)号:US11749262B2
公开(公告)日:2023-09-05
申请号:US17343746
申请日:2021-06-10
Inventor: Yi Gao , Ian Ernan Liu , Min Luo
IPC: G10L15/08 , G10L21/0208 , G10L21/043 , G10L15/22
CPC classification number: G10L15/08 , G10L15/22 , G10L21/0208 , G10L21/043 , G10L2015/088 , G10L2021/02082
Abstract: A keyword detection method includes: obtaining an enhanced speech signal of a to-be-detected speech signal, the enhanced speech signal corresponding to a target speech speed; performing speed adjustment on the enhanced speech signal to obtain a first speed-adjusted speech signal having a first speech speed, the first speech speed being different from the target speech speed; obtaining a first speech feature signal according to the first speed-adjusted speech signal; obtaining a detection result according to a first keyword detection result corresponding to the first speech feature signal, the detection result indicating whether a target keyword exists in the to-be-detected speech signal; and performing an operation corresponding to the target keyword in response to determining that the target keyword exists according to the detection result.
-
-
-
-
-
-
-
-
-