Patent search caee:"BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD."

161.

发明申请
METHOD OF TRAINING INFORMATION GENERATION MODEL, METHOD OF GENERATING INFORMATION, AND DEVICE 有权

公开(公告)号：US20230075339A1

公开(公告)日：2023-03-09

申请号：US18056137

申请日：2022-11-16

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Zeyang LEI , Xinchao XU , Wenquan WU , Zhengyu Niu

IPC: G06F40/40 , G06F40/284

Abstract: The present disclosure provides a method of training an information generation model, a method of generating an information, an electronic device, and a storage medium. A specific implementation solution of the method of training the information generation model includes: splitting a description information for a target object in an information pair into at least one description word, so as to obtain a description word sequence, wherein the information pair further includes a first recommendation information; inputting the description word sequence into a dialog generation model to obtain a probability vector sequence for the target object, wherein each probability vector in the probability vector sequence includes probability values for a plurality of predetermined words; and training the dialog generation model according to the probability vector sequence and the first recommendation information, so as to obtain the information generation model.

162.

发明申请
METHOD FOR EXTRACTING TEXT INFORMATION, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20230073550A1

公开(公告)日：2023-03-09

申请号：US17988065

申请日：2022-11-16

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Han LIU , Teng Hu , Shikun Feng , Yongfeng Chen

IPC: G06F40/30 , G06F40/40 , G06F40/284

Abstract: A method for extracting text information includes: acquiring a text to be extracted and a target field name; extracting candidate text information matching the target field name from the text to be extracted based on the text to be extracted and the target field name; and acquiring target text information matching fusion semantics of the text to be extracted, the target field name and the candidate text information by filtering the candidate text information based on the fusion semantics. Therefore, when the candidate text information matching the target field name is extracted from the text to be extracted, the candidate text information is filtered based on the fusion semantics of the text to be extracted, the target field name and the candidate text information, which improves the accuracy of extracting text information.

163.

发明授权
Method for processing multimodal images, apparatus, device and storage medium 有权

公开(公告)号：US11600057B2

公开(公告)日：2023-03-07

申请号：US17355368

申请日：2021-06-23

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Shengzhao Wen

IPC: G06K9/46 , G06V10/44 , G06T7/55 , G06T7/33 , G06K9/62 , G06V40/16 , G06V40/18 , G06V40/12

Abstract: Provided are a method for processing multimodal images, an apparatus, a device and a storage medium. Multiple types of vision sensors are disposed in first preset identity recognition scenario. The method includes: if it is determined that a first vision sensor detects a biometric part of a target object, controlling each vision sensor to separately perform image acquisition for the biometric part in accordance with a preset acquisition strategy to obtain a target visual image of corresponding type and acquisition time information of the target visual image; performing identity recognition for the target object according to first target visual image to determine object identification information corresponding to first target visual image; determining object identification information corresponding to a target visual image of other type other than first target visual image according to acquisition time information of each target visual image and object identification information corresponding to first target visual image.

164.

发明申请
BROADCAST STYLE DETERMINATION METHOD AND APPARATUS, DEVICE AND COMPUTER STORAGE MEDIUM 有权

公开(公告)号：US20230067177A1

公开(公告)日：2023-03-02

申请号：US17749254

申请日：2022-05-20

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Shiqiang DING , Jizhou HUANG , Di WU

IPC: G06F40/295 , G06F16/33 , G06N5/02

Abstract: The present disclosure discloses a broadcast style determination method and apparatus, a device and a computer storage medium, and relates to voice and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: performing named entity recognition on broadcast text to obtain at least one named entity; acquiring domain knowledge corresponding to the at least one named entity; and performing sentiment analysis by using the broadcast text and the domain knowledge, to determine a broadcast style of the broadcast text.

165.

发明申请
METHOD OF PROCESSING IMAGE, METHOD OF TRAINING MODEL, ELECTRONIC DEVICE AND MEDIUM 有权

公开(公告)号：US20230065675A1

公开(公告)日：2023-03-02

申请号：US17982616

申请日：2022-11-08

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Tianshu Hu , Shengyi He , Junyu Han , Zhibin Hong

IPC: G06T13/40

Abstract: A method of processing an image, a method of training a model, an electronic device and a medium, which relate to a field of artificial intelligence technology, in particular to deep learning, computer vision and other technical fields. A solution includes: generating a first face image, wherein a definition difference and an authenticity difference between the first face image and a reference face image are within a set range; adjusting, according to a target voice used to drive the first face image, a facial action information related to pronunciation in the first face image to generate a second face image with a facial tissue position conforming to a pronunciation rule of the target voice; and determining the second face image as a face image driven by the target voice.

166.

发明申请
ROAD DATA MONITORING METHOD AND APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20230065341A1

公开(公告)日：2023-03-02

申请号：US17658518

申请日：2022-04-08

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Meng Li , Hui Zhao , Deguo Xia , Bing Jiang

IPC: G01C21/00

Abstract: The present disclosure provides a method and an apparatus for monitoring road data, an electronic device and a storage medium, and relates to the field of intelligent transportation. A specific implementation solution involves: acquiring a feature expression of a road; predicting, according to the feature expression of the road and a pre-trained change prediction model, a probability that road data of the road will change; and collecting the road data of the road based on the probability that the road data will change.

167.

发明申请
METHOD AND DEVICE FOR TRAINING, BASED ON CROSSMODAL INFORMATION, DOCUMENT READING COMPREHENSION MODEL 有权

公开(公告)号：US20230061398A1

公开(公告)日：2023-03-02

申请号：US17984034

申请日：2022-11-09

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Shangwen LYU , Hongyu LI , Jing LIU , Hua WU , Haifeng WANG

IPC: G06V30/19 , G06V30/412 , G06V30/194 , G06F40/205

Abstract: A method for training a document reading comprehension model includes: acquiring a question sample and a rich-text document sample, in which the rich-text document sample includes a real answer of the question sample; acquiring text information and layout information of the rich-text document sample by performing OCR processing on image information of the rich-text document sample; acquiring a predicted answer of the question sample by inputting the text information, the layout information and the image information of the rich-text document sample into a preset reading comprehension model; and training the reading comprehension model based on the real answer and the predicted answer. The method may enhance comprehension ability of the reading comprehension model to the long rich-text document, and save labor cost.

168.

发明申请
SPEECH SYNTHESIS METHOD AND APPARATUS, DEVICE AND COMPUTER STORAGE MEDIUM 有权

公开(公告)号：US20230059882A1

公开(公告)日：2023-02-23

申请号：US17738186

申请日：2022-05-06

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Liqiang ZHANG , Jiankang HOU , Tao SUN , Lei JIA

IPC: G10L13/10 , G06F40/20 , G10L13/047

Abstract: The present disclosure discloses a speech synthesis method and apparatus, a device and a computer storage medium, and relates to speech and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: acquiring to-be-synthesized text; acquiring a prosody feature extracted from the text; inputting the text and the prosody feature into a speech synthesis model to obtain a vocoder feature; and inputting the vocoder feature into a vocoder to obtain synthesized speech.

169.

发明授权
Method and apparatus for operating blockchain system, device and storage medium 有权

公开(公告)号：US11588654B2

公开(公告)日：2023-02-21

申请号：US17662072

申请日：2022-05-04

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Chunhui Wan , Tong Jin , Zhimin Wei , Jiaxiang Liu , Lei Zhang , Bingxin Fan

IPC: H04L9/00 , G06F9/455 , G06F9/46 , G06F9/54

Abstract: Provided are a method and apparatus for operating a blockchain system, a device and a storage medium. The method is described below. To-be-processed blockchain data is acquired through a kernel engine of a blockchain system. The to-be-processed blockchain data is processed through the kernel engine, and a kernel component interface provided by a component adaptor is called during the processing process of the to-be-processed blockchain data to call a kernel component.

170.

发明申请
HUMAN-OBJECT INTERACTION DETECTION 有权

公开(公告)号：US20230051232A1

公开(公告)日：2023-02-16

申请号：US17976673

申请日：2022-10-28

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Desen ZHOU , Jian WANG , Hao SUN

IPC: G06V20/52 , G06V20/40 , G06V40/20

Abstract: A human-object interaction detection method, a neural network and a training method therefor is provided. The human-object interaction detection method includes: performing first target feature extraction on an image feature of an image; performing first interaction feature extraction on the image feature; processing a plurality of first target features to obtain target information of a plurality of detected targets; processing one or more first interaction features to obtain motion information of a motion, human information of a human target corresponding to each motion, and object information of an object target corresponding to each motion; matching the plurality of detected targets with one or more motions; and updating human information of a corresponding human target based on target information of a detected target matching the corresponding human target, and updating object information of a corresponding object target based on target information of a detected target matching the corresponding object target.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification