-
公开(公告)号:US20230076471A1
公开(公告)日:2023-03-09
申请号:US17982965
申请日:2022-11-08
Inventor: Xiyang WANG , Ruiqing ZHANG , Zhongjun HE , Zhi LI , Hua WU
Abstract: A training method, a text translation method, an electronic device, and a storage medium, which relate to a field of artificial intelligence, in particular to fields of natural language processing and deep learning technologies. A specific implementation solution includes: performing a feature extraction on source sample text data to obtain a sample feature vector sequence; obtaining a target sample feature vector according to the sample feature vector sequence; performing an autoregressive decoding and a non-autoregressive decoding on the sample feature vector sequence, respectively; performing a length prediction on the target sample feature vector; training a predetermined model by using translation sample data, the autoregressive text translation result, the non-autoregressive text translation result, a true length value of the source sample text, the first predicted length value, a true length value of the translation sample text, and the second predicted length value to obtain the text translation model.
-
152.
公开(公告)号:US20230075339A1
公开(公告)日:2023-03-09
申请号:US18056137
申请日:2022-11-16
Inventor: Zeyang LEI , Xinchao XU , Wenquan WU , Zhengyu Niu
IPC: G06F40/40 , G06F40/284
Abstract: The present disclosure provides a method of training an information generation model, a method of generating an information, an electronic device, and a storage medium. A specific implementation solution of the method of training the information generation model includes: splitting a description information for a target object in an information pair into at least one description word, so as to obtain a description word sequence, wherein the information pair further includes a first recommendation information; inputting the description word sequence into a dialog generation model to obtain a probability vector sequence for the target object, wherein each probability vector in the probability vector sequence includes probability values for a plurality of predetermined words; and training the dialog generation model according to the probability vector sequence and the first recommendation information, so as to obtain the information generation model.
-
公开(公告)号:US20230073550A1
公开(公告)日:2023-03-09
申请号:US17988065
申请日:2022-11-16
Inventor: Han LIU , Teng Hu , Shikun Feng , Yongfeng Chen
IPC: G06F40/30 , G06F40/40 , G06F40/284
Abstract: A method for extracting text information includes: acquiring a text to be extracted and a target field name; extracting candidate text information matching the target field name from the text to be extracted based on the text to be extracted and the target field name; and acquiring target text information matching fusion semantics of the text to be extracted, the target field name and the candidate text information by filtering the candidate text information based on the fusion semantics. Therefore, when the candidate text information matching the target field name is extracted from the text to be extracted, the candidate text information is filtered based on the fusion semantics of the text to be extracted, the target field name and the candidate text information, which improves the accuracy of extracting text information.
-
公开(公告)号:US11600057B2
公开(公告)日:2023-03-07
申请号:US17355368
申请日:2021-06-23
Inventor: Shengzhao Wen
Abstract: Provided are a method for processing multimodal images, an apparatus, a device and a storage medium. Multiple types of vision sensors are disposed in first preset identity recognition scenario. The method includes: if it is determined that a first vision sensor detects a biometric part of a target object, controlling each vision sensor to separately perform image acquisition for the biometric part in accordance with a preset acquisition strategy to obtain a target visual image of corresponding type and acquisition time information of the target visual image; performing identity recognition for the target object according to first target visual image to determine object identification information corresponding to first target visual image; determining object identification information corresponding to a target visual image of other type other than first target visual image according to acquisition time information of each target visual image and object identification information corresponding to first target visual image.
-
公开(公告)号:US20230067177A1
公开(公告)日:2023-03-02
申请号:US17749254
申请日:2022-05-20
Inventor: Shiqiang DING , Jizhou HUANG , Di WU
IPC: G06F40/295 , G06F16/33 , G06N5/02
Abstract: The present disclosure discloses a broadcast style determination method and apparatus, a device and a computer storage medium, and relates to voice and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: performing named entity recognition on broadcast text to obtain at least one named entity; acquiring domain knowledge corresponding to the at least one named entity; and performing sentiment analysis by using the broadcast text and the domain knowledge, to determine a broadcast style of the broadcast text.
-
公开(公告)号:US20230065675A1
公开(公告)日:2023-03-02
申请号:US17982616
申请日:2022-11-08
Inventor: Tianshu Hu , Shengyi He , Junyu Han , Zhibin Hong
IPC: G06T13/40
Abstract: A method of processing an image, a method of training a model, an electronic device and a medium, which relate to a field of artificial intelligence technology, in particular to deep learning, computer vision and other technical fields. A solution includes: generating a first face image, wherein a definition difference and an authenticity difference between the first face image and a reference face image are within a set range; adjusting, according to a target voice used to drive the first face image, a facial action information related to pronunciation in the first face image to generate a second face image with a facial tissue position conforming to a pronunciation rule of the target voice; and determining the second face image as a face image driven by the target voice.
-
公开(公告)号:US20230065341A1
公开(公告)日:2023-03-02
申请号:US17658518
申请日:2022-04-08
Inventor: Meng Li , Hui Zhao , Deguo Xia , Bing Jiang
IPC: G01C21/00
Abstract: The present disclosure provides a method and an apparatus for monitoring road data, an electronic device and a storage medium, and relates to the field of intelligent transportation. A specific implementation solution involves: acquiring a feature expression of a road; predicting, according to the feature expression of the road and a pre-trained change prediction model, a probability that road data of the road will change; and collecting the road data of the road based on the probability that the road data will change.
-
158.
公开(公告)号:US20230061398A1
公开(公告)日:2023-03-02
申请号:US17984034
申请日:2022-11-09
Inventor: Shangwen LYU , Hongyu LI , Jing LIU , Hua WU , Haifeng WANG
IPC: G06V30/19 , G06V30/412 , G06V30/194 , G06F40/205
Abstract: A method for training a document reading comprehension model includes: acquiring a question sample and a rich-text document sample, in which the rich-text document sample includes a real answer of the question sample; acquiring text information and layout information of the rich-text document sample by performing OCR processing on image information of the rich-text document sample; acquiring a predicted answer of the question sample by inputting the text information, the layout information and the image information of the rich-text document sample into a preset reading comprehension model; and training the reading comprehension model based on the real answer and the predicted answer. The method may enhance comprehension ability of the reading comprehension model to the long rich-text document, and save labor cost.
-
公开(公告)号:US20230059882A1
公开(公告)日:2023-02-23
申请号:US17738186
申请日:2022-05-06
Inventor: Liqiang ZHANG , Jiankang HOU , Tao SUN , Lei JIA
IPC: G10L13/10 , G06F40/20 , G10L13/047
Abstract: The present disclosure discloses a speech synthesis method and apparatus, a device and a computer storage medium, and relates to speech and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: acquiring to-be-synthesized text; acquiring a prosody feature extracted from the text; inputting the text and the prosody feature into a speech synthesis model to obtain a vocoder feature; and inputting the vocoder feature into a vocoder to obtain synthesized speech.
-
公开(公告)号:US11588654B2
公开(公告)日:2023-02-21
申请号:US17662072
申请日:2022-05-04
Inventor: Chunhui Wan , Tong Jin , Zhimin Wei , Jiaxiang Liu , Lei Zhang , Bingxin Fan
Abstract: Provided are a method and apparatus for operating a blockchain system, a device and a storage medium. The method is described below. To-be-processed blockchain data is acquired through a kernel engine of a blockchain system. The to-be-processed blockchain data is processed through the kernel engine, and a kernel component interface provided by a component adaptor is called during the processing process of the to-be-processed blockchain data to call a kernel component.
-
-
-
-
-
-
-
-
-