Patent search ap:("Beijing Baidu Netcom Science Technology Co. Page Ltd.") AND inv:"Hao Sun"

21.

发明申请
VIDEO-BASED HUMAN BEHAVIOR RECOGNITION METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20210192194A1

公开(公告)日：2021-06-24

申请号：US17022219

申请日：2020-09-16

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Zhizhen Chi , Fu Li , Hao Sun , Dongliang He , Xiang Long , Zhichao Zhou , Ping Wang , Shilei Wen , Errui Ding

IPC: G06K9/00 , G06F3/01 , G06N3/02

Abstract: The present application discloses a video-based human behavior recognition method, apparatus, device and storage medium, and relates to the technical field of human recognitions. The specific implementation scheme lies in: acquiring a human rectangle of each video frame of the video to be recognized, where each human rectangle includes a plurality of human key points, and each of the human key points has a key point feature; constructing a feature matrix according to the human rectangle of the each video frame; convolving the feature matrix with respect to a video frame quantity dimension to obtain a first convolution result and convolving the feature matrix with respect to a key point quantity dimension to obtain a second convolution result; inputting the first convolution result and the second convolution result into a preset classification model to obtain a human behavior category of the video to be recognized.

22.

发明公开
METHOD FOR GENERATING RELIGHTED IMAGE AND ELECTRONIC DEVICE 审中-公开

公开(公告)号：US20230215132A1

公开(公告)日：2023-07-06

申请号：US18183439

申请日：2023-03-14

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Fu Li , Hao Sun

IPC: G06V10/60 , G06T3/40 , G06T5/50 , G06V10/44 , G06V10/62 , G06V10/74

CPC classification number: G06V10/60 , G06T3/40 , G06T5/50 , G06V10/44 , G06V10/62 , G06V10/761 , G06T2207/20221

Abstract: A method for generating a relighted image includes: obtaining a to-be-processed image and a guidance image corresponding to the to-be-processed image; obtaining a first intermediate image consistent with an illumination condition in the guidance image by performing relighting rendering on the to-be-processed image in a time domain based on the guidance image; obtaining a second intermediate image consistent with the illumination condition in the guidance image by performing relighting rendering on the to-be-processed image in a frequency domain based on the guidance image; and obtaining a target relighted image corresponding to the to-be-processed image based on the first intermediate image and the second intermediate image.

23.

发明申请
HUMAN-OBJECT INTERACTION DETECTION 有权

公开(公告)号：US20230047628A1

公开(公告)日：2023-02-16

申请号：US17976668

申请日：2022-10-28

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Desen ZHOU , Jian Wang , Hao Sun

IPC: G06V20/52 , G06V40/20 , G06V20/40

Abstract: A human-object interaction detection method, a neural network and a training method therefor is provided. The human-object interaction detection method includes: performing first target feature extraction on image features of an image to obtain first target features; performing first interaction feature extraction on image features to obtain first interaction features and scores thereof; determining at least some first interaction features in the first interaction features based on the score of each of the first interaction features; determining first motion features based on the at least some first interaction features and the image features; processing the first target features to obtain target information of targets in the image; processing the first motion features to obtain motion information of one or more motions in the image; and matching the targets with the motions to obtain a human-object interaction detection result.

24.

发明授权
Video-based human behavior recognition method, apparatus, device and storage medium 有权

公开(公告)号：US11430265B2

公开(公告)日：2022-08-30

申请号：US17022219

申请日：2020-09-16

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Zhizhen Chi , Fu Li , Hao Sun , Dongliang He , Xiang Long , Zhichao Zhou , Ping Wang , Shilei Wen , Errui Ding

IPC: G06V40/20 , G06F3/01 , G06N3/02 , G06V20/40

Abstract: The present application discloses a video-based human behavior recognition method, apparatus, device and storage medium, and relates to the technical field of human recognitions. The specific implementation scheme lies in: acquiring a human rectangle of each video frame of the video to be recognized, where each human rectangle includes a plurality of human key points, and each of the human key points has a key point feature; constructing a feature matrix according to the human rectangle of the each video frame; convolving the feature matrix with respect to a video frame quantity dimension to obtain a first convolution result and convolving the feature matrix with respect to a key point quantity dimension to obtain a second convolution result; inputting the first convolution result and the second convolution result into a preset classification model to obtain a human behavior category of the video to be recognized.

25.

发明申请
STEREO MATCHING METHOD, MODEL TRAINING METHOD, RELEVANT ELECTRONIC DEVICES 有权

公开(公告)号：US20220230343A1

公开(公告)日：2022-07-21

申请号：US17709291

申请日：2022-03-30

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Xiaoqing Ye , Xiao Tan , Hao Sun

IPC: G06T7/593

Abstract: A computer-implemented stereo matching method includes: obtaining a first binocular image; inputting the first binocular image into an object model for a first operation to obtain a first initial disparity map and a first offset disparity map with respect to the first initial disparity map; and performing aggregation on the first initial disparity map and the first offset disparity map to obtain a first target disparity map of the first binocular image. The first initial disparity map is obtained through stereo matching on a second binocular image corresponding to the first binocular image, a size of the second binocular image is smaller than a size of the first binocular image, and the first offset disparity map is obtained through stereo matching on the first binocular image within a predetermined disparity offset range.

26.

发明授权
Object tracking method and apparatus 有权

公开(公告)号：US11288887B2

公开(公告)日：2022-03-29

申请号：US16869411

申请日：2020-05-07

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Xipeng Yang , Xiao Tan , Hao Sun , Shilei Wen , Errui Ding

IPC: G06V20/54 , G06V10/20 , G06V10/42 , G06V10/56 , G06V20/40 , G06V20/62

Abstract: Embodiments of the present disclosure provide an object tracking method and an apparatus. The method includes: obtaining multiple frames of first images shot by a first camera apparatus and a first shooting moment of each frame of the first images, where the first images include a first object; obtaining multiple frames of second images shot by a second camera apparatus and a second shooting moment of each frame of the second images, where the second images include a second object; obtaining a distance between the first camera apparatus and the second camera apparatus; and judging whether the first object and the second object are the same object according to the multiple frames of the first images, the first shooting moment of each frame of the first images, the multiple frames of the second images, the second shooting moment of each frame of the second images and the distance.

27.

发明申请
METHOD AND DEVICE FOR TRAINING IMAGE RECOGNITION MODEL, EQUIPMENT AND MEDIUM 有权

公开(公告)号：US20220092353A1

公开(公告)日：2022-03-24

申请号：US17540207

申请日：2021-12-01

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Ruixue Liu , Xiameng Qin , Mengyi En , Kun Yao , Chengquan Zhang , Shengxian Zhu , Yunhao Li , Junyu Han , Hao Sun

IPC: G06K9/62 , G06V30/30 , G06V30/14

Abstract: A computer-implemented method includes: acquiring training data, the training data includes training images for a preset vertical type, and the training images include a first training image containing real data of the preset vertical type and a second training image containing virtual data of the preset vertical type ; building a basic model, the basic model includes a deep learning network, and the deep learning network is configured to recognize the training images to extract text data in the training image; and training the basic model by using the training data to obtain the image recognition model.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification