Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Jingtuo LIU"

1.

发明申请
IMAGE PROCESSING METHOD, TEXT RECOGNITION METHOD AND APPARATUS 有权

公开(公告)号：US20220415072A1

公开(公告)日：2022-12-29

申请号：US17901897

申请日：2022-09-02

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Jingtuo LIU

IPC: G06V30/412 , G06V30/413

Abstract: The present disclosure provides an image processing method, a text recognition method and an apparatus. The image processing method includes: preprocessing acquired sample images to obtain position information, image blocks and text content corresponding to fields in the sample images respectively; making a mask prediction on the position information of the fields according to the position information, the image blocks and the text content corresponding to the fields respectively to obtain a prediction result; and training according to the prediction result to obtain a text recognition model, where the text recognition model is used to perform text recognition on a to-be-recognized image.

2.

发明申请
METHOD FOR TRAINING IMAGE RECOGNITION MODEL BASED ON SEMANTIC ENHANCEMENT 有权

公开(公告)号：US20220392205A1

公开(公告)日：2022-12-08

申请号：US17892669

申请日：2022-08-22

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Yipeng SUN , Rongqiao AN , Xiang WEI , Longchao WANG , Kun YAO , Junyu HAN , Jingtuo LIU , Errui DING

IPC: G06V10/80 , G06V10/77

Abstract: Embodiments of the present disclosure provide a method and apparatus for training an image recognition model based on a semantic enhancement, a method and apparatus for recognizing an image, an electronic device, and a computer readable storage medium. The method for training an image recognition model based on a semantic enhancement comprises: extracting, from an inputted first image being unannotated and having no textual description, a first feature representation of the first image; calculating a first loss function based on the first feature representation; extracting, from an inputted second image being unannotated and having an original textual description, a second feature representation of the second image; calculating a second loss function based on the second feature representation, and training an image recognition model based on a fusion of the first loss function and the second loss function.

3.

发明申请
TEXT DETECTION METHOD, TEXT RECOGNITION METHOD AND APPARATUS 有权

公开(公告)号：US20230045715A1

公开(公告)日：2023-02-09

申请号：US17966112

申请日：2022-10-14

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Chengquan ZHANG , Pengyuan LV , Sen FAN , Kun YAO , Junyu HAN , Jingtuo LIU

IPC: G06V30/19 , G06V30/16 , G06V30/14

Abstract: The present disclosure provides a text detection method, a text recognition method and an apparatus, which relate to the field of artificial intelligence technology, in particular to the field of deep learning and computer vision technologies, and can be applied to scenarios such as optical character recognition. The text detection method is: acquiring an image feature of a text strip in a to-be-recognized image; performing visual enhancement processing on the to-be-recognized image to obtain an enhanced feature map of the to-be-recognized image; comparing the image feature of the text strip with the enhanced feature map for similarity to obtain a target bounding box of the text strip on the enhanced feature map.

4.

发明申请
IMAGE CLASSIFICATION METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20220027611A1

公开(公告)日：2022-01-27

申请号：US17498226

申请日：2021-10-11

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Yuechen YU , Chengquan ZHANG , Yulin LI , Xiaoqiang ZHANG , Ju HUANG , Xiameng QIN , Kun YAO , Jingtuo LIU , Junyu HAN , Errui DING

IPC: G06K9/00 , G06K9/62 , G06N3/08

Abstract: Provided are an image classification method and apparatus, an electronic device and a storage medium, relating to the field of artificial intelligence and, in particular, to computer vision and deep learning. The method includes inputting a to-be-classified document image into a pretrained neural network and obtaining a feature submap of each text box of the to-be-classified document image by use of the neural network; inputting the feature submap of each text box, a semantic feature corresponding to preobtained text information of each text box and a position feature corresponding to preobtained position information of each text box into a pretrained multimodal feature fusion model and fusing, by use of the multimodal feature fusion model, the three into a multimodal feature corresponding to each text box; and classifying the to-be-classified document image based on the multimodal feature corresponding to each text box.

5.

发明公开
METHOD OF TRAINING TEXT RECOGNITION MODEL, AND METHOD OF RECOGNIZING TEXT 审中-公开

公开(公告)号：US20240281609A1

公开(公告)日：2024-08-22

申请号：US18041207

申请日：2022-05-16

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Pengyuan LV , Jingquan LI , Chengquan ZHANG , Kun YAO , Jingtuo LIU , Junyu HAN

IPC: G06F40/30 , G06V30/12

CPC classification number: G06F40/30 , G06V30/12

Abstract: The present application provides a method of training a text recognition model. The method includes: inputting a first sample image into the visual feature extraction sub-model to obtain a first visual feature and a first predicted text, the first sample image contains a text and a tag indicating a first actual text; obtaining, by using the semantic feature extraction sub-model, a first semantic feature based on the first predicted text; obtaining, by using the sequence sub-model, a second predicted text based on the first visual feature and the first semantic feature; and training the text recognition model based on the first predicted text, the second predicted text and the first actual text. The present disclosure further provides a method of recognizing a text, an electronic device, and a storage medium.

6.

发明公开
IMAGE RENDERING METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20230419610A1

公开(公告)日：2023-12-28

申请号：US18185359

申请日：2023-03-16

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Xing LIU , Ruizhi CHEN , Yan ZHANG , Chen ZHAO , Hao SUN , Jingtuo LIU , Errui DING , Tian WU , Haifeng WANG

IPC: G06T17/20 , G06T5/50 , G06V10/26 , G06V10/60

CPC classification number: G06T17/20 , G06T5/50 , G06V10/26 , G06V10/60 , G06T2207/10028 , G06T2207/20221

Abstract: An image rendering method includes the steps below. A model of an environmental object is rendered to obtain an image of the environmental object in a target perspective. An image of a target object in the target perspective and a model of the target object are determined according to a neural radiance field of the target object. The image of the target object is fused and rendered into the image of the environmental object according to the model of the target object.

7.

发明申请
Model Determination Method and Electronic Device 有权

公开(公告)号：US20230124389A1

公开(公告)日：2023-04-20

申请号：US17887690

申请日：2022-08-15

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Longchao WANG , Yipeng SUN , Kun YAO , Junyu HAN , Jingtuo LIU , Errui DING

IPC: G06V10/70 , G06V10/774

Abstract: A model determination method and electronic device is provided, and relates to the technical field of artificial intelligence and, in particular, to the field of computer visions and deep learning, and can be applied to image processing, image identification and other scenarios. A specific implementation solution includes an image sample and a text sample are acquired, wherein text data in the text sample is used for performing text description to target image data in the image sample; at least one image feature in the image sample is stored to a first queue, and at least text feature in the text sample is stored to a second queue; the first queue and the second queue are trained to obtain a first target model; and the first target model is determined as an initialization model for a second target model.

8.

发明申请
METHOD, APPARATUS AND SYSTEM FOR RETRIEVING IMAGE 有权

公开(公告)号：US20220292131A1

公开(公告)日：2022-09-15

申请号：US17826760

申请日：2022-05-27

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Ruibin BAI , Xiang WEI , Yipeng SUN , Kun YAO , Jingtuo LIU , Junyu HAN

IPC: G06F16/583 , G06V10/74 , G06V10/44 , G06F16/535

Abstract: A method, apparatus and system for retrieving an image is provided, the method comprises: detecting, in response to receiving a query request comprising a target image, a target subject from the target image; extracting a subject feature from the target subject if a confidence level of a detection box of the detected target subject is greater than a first threshold, the subject feature comprising an identical feature, a similar feature and a category; performing matching on the subject feature of the target image and a subject feature of a candidate image pre-stored in a database, to obtain a similarity score and an identicalness score of the candidate image; and selecting, according to the similarity score and the identicalness score, a predetermined number of candidate images as a search result for output.

9.

发明申请
METHOD, APPARATUS AND ELECTRONIC DEVICE FOR DETERMINING SKIN SMOOTHNESS 有权

公开(公告)号：US20210192725A1

公开(公告)日：2021-06-24

申请号：US17021114

申请日：2020-09-15

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Zhizhi GUO , Yipeng SUN , Jingtuo LIU , Junyu HAN , Duo YANG , Yue DANG , Huichao WANG

IPC: G06T7/00

Abstract: The present disclosure discloses a method, apparatus and electronic device for determining skin smoothness, which relates to the field of computer vision technologies. The specific implementation solution is as follows: when the skin smoothness is calculated, an image to be detected including a face area is obtained first, and then the image to be detected and a smoothness analysis mask image corresponding to the image to be detected are inputted into a deep learning model to obtain a plurality of feature vectors for indicating the skin smoothness of the face. Because the smoothness analysis mask image does not include preset factors including at least one of five sense organs, reflection and hair, the influence of the preset factors on the skin smoothness is avoided, so that the accuracy for the skin smoothness of the face is ensured to a certain extent.

10.

发明公开
PRE-TRAINING METHOD, IMAGE AND TEXT RETRIEVAL METHOD FOR A VISION AND SCENE TEXT AGGREGATION MODEL, ELECTRONIC DEVICE, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20230386168A1

公开(公告)日：2023-11-30

申请号：US18192393

申请日：2023-03-29

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Yipeng SUN , Mengjun CHENG , Longchao WANG , Xiongwei ZHU , Kun YAO , Junyu HAN , Jingtuo LIU , Errui DING , Jingdong WANG , Haifeng Wang

IPC: G06V10/42 , G06F16/583 , H04N19/176

CPC classification number: G06V10/42 , G06F16/5846 , H04N19/176

Abstract: A pre-training method for a Vision and Scene Text Aggregation model includes: acquiring a sample image-text pair; extracting a sample scene text from a sample image; inputting a sample text into a text encoding network to obtain a sample text feature; inputting the sample image and an initial sample aggregation feature into a visual encoding subnetwork and inputting the initial sample aggregation feature and the sample scene text into a scene encoding subnetwork to obtain a global image feature of the sample image and a learned sample aggregation feature; and pre-training the Vision and Scene Text Aggregation model according to the sample text feature, the global image feature of the sample image, and the learned sample aggregation feature.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification