Method and device for processing voice interaction, electronic device and storage medium

    公开(公告)号:US12112746B2

    公开(公告)日:2024-10-08

    申请号:US17476333

    申请日:2021-09-15

    CPC classification number: G10L15/22 G10L2015/223

    Abstract: The present disclosure provides a method and a device for processing voice interaction, an electronic device and a storage medium. The method includes: determining a first integrity of a voice instruction from a user by using a pre-trained integrity detection model in response to detecting that the voice instruction from the user is not a high-frequency instruction; determining a waiting duration for the voice instruction based on the first integrity and a preset integrity threshold, wherein the waiting duration for the voice instruction indicates a length of period between a time when a voice interaction device determines that receiving the voice instruction is completed and a time when the voice interaction device performs an operation in response to the voice instruction of the user; and controlling the voice interaction device to respond to the voice instruction of the user based on the waiting duration.

    Method and apparatus for training speech recognition model, electronic device and storage medium

    公开(公告)号:US12100388B2

    公开(公告)日:2024-09-24

    申请号:US17747732

    申请日:2022-05-18

    Inventor: Qingen Zhao

    CPC classification number: G10L15/063 G10L15/02 G10L15/16 G10L15/22

    Abstract: A method and apparatus for training a speech recognition model, an electronic device and a storage medium are provided. An implementation of the method may include: determining a plurality of feature vectors based on audio feature data corresponding to a first target frame in a sample speech, wherein the sample speech comprises a conversation among a plurality of objects and the sample speech has a corresponding sample text; generating a predicted text element corresponding to the first target frame based on an adjacent text element preceding to a text element corresponding to the first target frame in the sample text, wherein the text element and the adjacent text element are targeting at a target object in the plurality of objects; obtaining a first target text element based on the predicted text element and a first feature vector in the plurality of feature vectors; and adjusting the speech recognition model based on the first target text element and the sample text, to obtain a trained speech recognition model.

    METHOD OF TRAINING TEXT RECOGNITION MODEL, AND METHOD OF RECOGNIZING TEXT

    公开(公告)号:US20240281609A1

    公开(公告)日:2024-08-22

    申请号:US18041207

    申请日:2022-05-16

    CPC classification number: G06F40/30 G06V30/12

    Abstract: The present application provides a method of training a text recognition model. The method includes: inputting a first sample image into the visual feature extraction sub-model to obtain a first visual feature and a first predicted text, the first sample image contains a text and a tag indicating a first actual text; obtaining, by using the semantic feature extraction sub-model, a first semantic feature based on the first predicted text; obtaining, by using the sequence sub-model, a second predicted text based on the first visual feature and the first semantic feature; and training the text recognition model based on the first predicted text, the second predicted text and the first actual text. The present disclosure further provides a method of recognizing a text, an electronic device, and a storage medium.

    Group service implementation method and device, equipment and storage medium

    公开(公告)号:US12069172B2

    公开(公告)日:2024-08-20

    申请号:US17452496

    申请日:2021-10-27

    CPC classification number: H04L9/088 H04L9/3247

    Abstract: Provided are a group service implementation method and device, an equipment and a storage medium. The specific solution is described below. A service transaction request is acquired. In response to the service transaction request including to-be-authenticated data and a threshold signature, a signature group corresponding to the threshold signature is determined. Group information of the signature group is acquired by querying a blockchain, where the signature group includes at least two members, the at least two members of the signature group are used for authenticating the to-be-authenticated data by adopting secure multi-party computation and generating the threshold signature for the to-be-authenticated data by adopting a signature private key, and the group information includes at least a verification public key of the threshold signature. The threshold signature is verified by adopting the verification public key in the group information.

    CONTENT INITIALIZATION METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20240275848A1

    公开(公告)日:2024-08-15

    申请号:US18020618

    申请日:2022-08-01

    CPC classification number: H04L67/1097 G06F7/582 G06F7/588

    Abstract: The present disclosure provides a content initialization method and apparatus, an electronic device and a storage medium, which relates to a field of computer technology, in particular to fields of deep learning and distributed computing. The content initialization method is applied to any one of a plurality of devices included in a distributed system. A specific implementation scheme of the content initialization method is: determining, according to a size information of a resource space for the distributed system and an identification information of the any one of the plurality of devices, a space information of a first sub-space for the any one of the plurality of devices in the resource space, wherein the space information includes a position information of the first sub-space for the resource space; and determining an initialization content for the first sub-space according to a random seed and the position information.

Patent Agency Ranking