SPEECH RECOGNITION
    12.
    发明申请

    公开(公告)号:US20250078839A1

    公开(公告)日:2025-03-06

    申请号:US18819018

    申请日:2024-08-29

    Abstract: A speech recognition method and a method for training a deep learning model are provided. The speech recognition method includes: obtaining a first speech feature of a speech to-be-recognized, which includes a plurality of speech segment features corresponding to a plurality of speech segments; decoding the first speech feature using a first decoder to obtain a plurality of first decoding results corresponding to a plurality of the words, indicating a first recognition result of words; extracting a second speech feature from the first speech feature based on first a priori information, which includes the plurality of first decoding results, and the second speech feature includes first word-level audio features corresponding to the plurality of words; and decoding the second speech feature using a second decoder to obtain a plurality of second decoding results corresponding to the plurality of words, indicating a second recognition result of the word.

    DATA GENERATION
    13.
    发明申请

    公开(公告)号:US20250061311A1

    公开(公告)日:2025-02-20

    申请号:US18746532

    申请日:2024-06-18

    Abstract: A data generation method is provided. The data generation method includes: generating first answer data based on first question data from a user; determining, in response to receiving negative feedback from the user for the first answer data, a first reflection result for the first answer data based on the first answer data and the negative feedback, wherein the first reflection result indicates a diagnosis reason why feedback from the user for the first answer data is negative; and generating second answer data for the first question data based on the first question data and the first reflection result.

    SPEECH WAKE-UP METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20240420684A1

    公开(公告)日:2024-12-19

    申请号:US18706313

    申请日:2023-01-17

    Abstract: A speech wake-up method, an electronic device, and a storage medium are provided. The method includes: performing a word recognition on a speech to be recognized to obtain a wake-up word recognition result (S210); performing a syllable recognition on the speech to be recognized to obtain a wake-up syllable recognition result, in response to determining that the wake-up word recognition result represents that the speech to be recognized contains a predetermined wake-up word (S220); and determining that the speech to be recognized is a correct wake-up speech, in response to determining that the wake-up syllable recognition result represents that the speech to be recognized contains a predetermined syllable (S230).

    GENERATIVE DIALOG MODEL TRAINING METHOD AND APPARATUS AS WELL AS GENERATIVE DIALOG IMPLEMENTING METHOD AND APPARATUS

    公开(公告)号:US20240338530A1

    公开(公告)日:2024-10-10

    申请号:US18745550

    申请日:2024-06-17

    CPC classification number: G06F40/35 G06N20/00

    Abstract: A generative dialog model training method in the fields of artificial intelligence, such as deep learning, natural language processing, intelligent dialogs, is disclosed. The generative dialog model training method may include: in response to determination of an update of a safety specification, taking an updated safety specification as a target safety specification, and determining a dialog input corresponding to a current optimization according to the target safety specification, the update being performed on a previous safety specification when a generative dialog model after last optimization is determined not to meet a launch requirement; and optimizing the generative dialog model according to the dialog input and a principle that a reply generated by the generative dialog model conforms to the target safety specification, the generative dialog model being configured to generate the reply corresponding to the dialog input.

    DEEP LEARNING MODEL BASED DATA GENERATION
    16.
    发明公开

    公开(公告)号:US20240028909A1

    公开(公告)日:2024-01-25

    申请号:US18478833

    申请日:2023-09-29

    CPC classification number: G06N3/096

    Abstract: A data generation method based on a deep learning model and a training method is provided. The data generation method includes: determining an initial input of the deep learning model based on input data; obtaining a first output of the model, where in response to the model determining that generating a reply based on the initial input requires calling a first functional component different from the deep learning model, the first output includes a first token for calling the first functional component and a first intermediate inquiry determined based on the initial input and recognizable by the first functional component; obtaining a first intermediate result determined by the first functional component based on the first intermediate inquiry; determining a second input for the model based on the initial input and the first intermediate result; and obtaining a second output of the model for generating a reply to the initial input.

    METHOD OF PROCESSING SPEECH INFORMATION, METHOD OF TRAINING MODEL, AND WAKE-UP METHOD

    公开(公告)号:US20230360638A1

    公开(公告)日:2023-11-09

    申请号:US18221593

    申请日:2023-07-13

    CPC classification number: G10L15/02 G10L15/14 G10L2015/027

    Abstract: A method of processing a speech information, a method of training a speech model, a speech wake-up method, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, in particular to fields of human-computer interaction, deep learning and intelligent speech technologies. A specific implementation solution includes: performing a syllable recognition on a speech information to obtain a posterior probability sequence for the speech information, where the speech information includes a speech frame sequence, the posterior probability sequence corresponds to the speech frame sequence, and each posterior probability in the posterior probability sequence represents a similarity between a syllable in a speech frame matched with the posterior probability and a predetermined syllable; and determining a target peak speech frame from the speech frame sequence based on the posterior probability sequence.

    METHOD AND APPARATUS FOR TRAINING VOICE WAKE-UP MODEL, METHOD AND APPARATUS FOR VOICE WAKE-UP, DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20230317060A1

    公开(公告)日:2023-10-05

    申请号:US18328135

    申请日:2023-06-02

    CPC classification number: G10L15/063 G10L15/02

    Abstract: The present disclosure provides a method and an apparatus for training a voice wake-up model, a method and an apparatus for voice wake-up, a device and a storage medium, which relates to the field of artificial intelligence and particularly to the field of deep learning and voice technology. A specific implementation lies in: acquiring voice recognition training data and voice wake-up training data that are created, and firstly performing training on a base model according to the voice recognition training data to obtain a model parameter of the base model when a model loss function converges; then updating, based on a model configuration instruction, a configuration parameter of a decoding module in the base model to obtain a first model; and finally performing training on the first model according to the voice wake-up training data to obtain a trained voice wake-up model when the model loss function converges.

    CONVERSATIONAL RECOMMENDATION METHOD, METHOD OF TRAINING MODEL, DEVICE AND MEDIUM

    公开(公告)号:US20230088445A1

    公开(公告)日:2023-03-23

    申请号:US18059386

    申请日:2022-11-28

    Abstract: A conversational recommendation method, a method of training a conversational recommendation model, an electronic device, and a storage medium are provided, which are related to a technical field of data processing, in particular to technical fields of voice interaction, deep learning, artificial intelligence and the like. The conversational recommendation method includes: acquiring a historical conversation information; determining a target conversation object to be generated, from a conversation target graph based on the historical conversation information, the conversation target graph includes an object node, the object node is configured to represent a conversation object, and the target conversation object is determined based on the object node; and generating a target conversation information for recommendation based on the target conversation object.

Patent Agency Ranking