TEXT-DRIVEN VIDEO SYNTHESIS WITH PHONETIC DICTIONARY

    公开(公告)号:US20210390945A1

    公开(公告)日:2021-12-16

    申请号:US17221701

    申请日:2021-04-02

    申请人: Baidu USA, LLC

    摘要: Presented herein are novel approaches to synthesize video of the speech from text. In a training phase, embodiments build a phoneme-pose dictionary and train a generative neural network model using a generative adversarial network (GAN) to generate video from interpolated phoneme poses. In deployment, the trained generative neural network in conjunction with the phoneme-pose dictionary convert an input text into a video of a person speaking the words of the input text. Compared to audio-driven video generation approaches, the embodiments herein have a number of advantages: 1) they only need a fraction of the training data used by an audio-driven approach; 2) they are more flexible and not subject to vulnerability due to speaker variation; and 3) they significantly reduce the preprocessing, training, and inference times.

    METHOD FOR AI MODEL TRANSFERRING WITH LAYER AND MEMORY RANDOMIZATION

    公开(公告)号:US20210390463A1

    公开(公告)日:2021-12-16

    申请号:US16900584

    申请日:2020-06-12

    申请人: Baidu USA LLC

    IPC分类号: G06N20/10 G06F12/08 G06F9/50

    摘要: A method to transfer an artificial intelligence (AI) model includes identifying a plurality of layers of the AI model, the plurality of layers organized in a first ordered list. The method further includes randomizing the plurality of layers by reorganizing the first ordered list into a second ordered list, and transferring the plurality of layers of the AI model to a data processing accelerator in an order defined by the second ordered list.

    METHOD FOR AI MODEL TRANSFERRING WITH ADDRESS RANDOMIZATION

    公开(公告)号:US20210390047A1

    公开(公告)日:2021-12-16

    申请号:US16900597

    申请日:2020-06-12

    申请人: Baidu USA LLC

    IPC分类号: G06F12/06 G06F9/50 G06N20/10

    摘要: A method to transfer an artificial intelligence (AI) model includes identifying a plurality of layers of an AI model, wherein each layer of the plurality of layers is associated with a memory address. The method further includes randomizing the memory address associated with each layer of the plurality of layers, and transferring the plurality of layers with the randomized memory addresses to a data processing accelerator to execute the AI model.

    Automated factory testflow of processing unit with sensor integration for driving platform

    公开(公告)号:US11198444B2

    公开(公告)日:2021-12-14

    申请号:US16510124

    申请日:2019-07-12

    申请人: Baidu USA LLC

    IPC分类号: B60W50/04 H04N17/00 G07C5/08

    摘要: Diagnosing a sensor processing unit of an autonomous driving vehicle is described. An example computer-implemented method can include transmitting an executable image of a sensor processing application from a host system to the sensor processing unit via at least one of a universal asynchronous receiver-transmitter (UART) or an Ethernet connection. The method also includes causing the sensor processing unit to execute and launch the executable image of the sensor processing application in the DRAM from the eMMC storage device. The method also includes transmitting a sequence of predetermined commands to the executed sensor processing application to perform a plurality of sensor data processing operations on sensor data obtained from a plurality of sensors or sensor simulators associated with an autonomous driving vehicle. The method also includes comparing processing results of the sensor processing operations against expected processing results to determine whether the sensor processing application operates properly.

    Question generation systems and methods for automating diagnosis

    公开(公告)号:US11194860B2

    公开(公告)日:2021-12-07

    申请号:US15207445

    申请日:2016-07-11

    申请人: Baidu USA, LLC

    摘要: Systems and methods are disclosed for question generation to obtain more related medical information based on observed symptoms from a patient. In embodiments, possible diseases associated with the observed symptoms are generated by querying a knowledge graph. In embodiments, candidate symptoms associated with the possible diseases are also identified and are combined with the observed symptoms to obtain combined symptom sets. In embodiments, discriminative scores for the candidate symptom sets are determined and candidate symptoms with top discriminative scores are selected. In embodiments, these selected candidate symptoms may be checked for conflicts with observed symptoms and removed from further consideration if a conflict exists. In embodiments, one or more questions may be generated based on the remaining selected candidate systems to aid in collecting information about the patient. In embodiments, the process may be repeated with the updated observed symptoms.

    COMPLETE BLIND-MATE CONNECTION SYSTEM FOR LIQUID COOLING

    公开(公告)号:US20210378141A1

    公开(公告)日:2021-12-02

    申请号:US16884969

    申请日:2020-05-27

    申请人: Baidu USA LLC

    发明人: Shuai Shao Tianyi Gao

    IPC分类号: H05K7/20 H05K7/12 H05K7/14

    摘要: A holder for an electronic rack includes a pivot point and a first end of the holder having a first blind-mate connector to be coupled to a second blind-mate connector at a first engagement interface. The first blind-mate connector and the second blind-mate connector are coupled in response to the holder moving to the second position in response to contact with the electronic rack. The holder additionally includes a second end of the holder having a third blind-mate connector to be coupled to a fourth blind-mate connector at a second engagement interface. The third blind-mate connector and the fourth blind-mate connector are coupled in response to the holder moving to the second position in response to contact with the electronic rack.

    Stereo camera depth determination using hardware accelerator

    公开(公告)号:US11182917B2

    公开(公告)日:2021-11-23

    申请号:US16483399

    申请日:2017-12-08

    摘要: Described herein are systems and methods that allow for dense depth map estimation given input images. In one or more embodiments, a neural network model was developed that significantly differs from prior approaches. Embodiments of the deep neural network model comprises more computationally efficient structures and fewer layers but still produces good quality results. Also, in one or more embodiments, the deep neural network model may be specially configured and trained to operate using a hardware accelerator component or components that can speed computation and produce good results, even if lower precision bit representations are used during computation at the hardware accelerator component.