-
171.
公开(公告)号:US20230004717A1
公开(公告)日:2023-01-05
申请号:US17572068
申请日:2022-01-10
Inventor: Lijie WANG , Shuai ZHANG , Xinyan XIAO , Yue CHANG , Tingting LI
IPC: G06F40/289 , G06N20/00
Abstract: The present disclosure provides a method and apparatus for acquiring a pre-trained model, an electronic device and a storage medium, and relates to the field of artificial intelligence, such as the natural language processing field, the deep learning field, or the like. The method may include: adding, in a process of training a pre-trained model using training sentences, a learning objective corresponding to syntactic information for a self-attention module in the pre-trained model; and training the pre-trained model according to the defined learning objective. The solution of the present disclosure may improve a performance of the pre-trained model, and reduce consumption of computing resources, or the like.
-
公开(公告)号:US20220415072A1
公开(公告)日:2022-12-29
申请号:US17901897
申请日:2022-09-02
Inventor: Jingtuo LIU
IPC: G06V30/412 , G06V30/413
Abstract: The present disclosure provides an image processing method, a text recognition method and an apparatus. The image processing method includes: preprocessing acquired sample images to obtain position information, image blocks and text content corresponding to fields in the sample images respectively; making a mask prediction on the position information of the fields according to the position information, the image blocks and the text content corresponding to the fields respectively to obtain a prediction result; and training according to the prediction result to obtain a text recognition model, where the text recognition model is used to perform text recognition on a to-be-recognized image.
-
公开(公告)号:US20220414474A1
公开(公告)日:2022-12-29
申请号:US17901803
申请日:2022-09-01
Inventor: Hongjian Shi , Xinwei Feng , Feifei Li , Chenyang Guo , Xueqian Wu , Meng Tian , Yu Sun
IPC: G06N3/08 , G06F16/953
Abstract: A search method based on a neural network model is provided. The neural network model includes a semantic representation model, a recall model, and a ranking model. The present disclosure relates to the field of artificial intelligence, and in particular to the technical field of search. An implementation of the method comprises: inputting a target search and a plurality of objects to be matched to the semantic representation model to obtain a first output of the semantic representation model; inputting the first output of the semantic representation model to the recall model, and obtaining at least one recall object matching the target search from the plurality of objects to be matched by using the recall model; and inputting a second output of the semantic representation model to the ranking model, and obtaining a matching value of each of the at least one recall object by using the ranking model.
-
公开(公告)号:US20220410939A1
公开(公告)日:2022-12-29
申请号:US17901428
申请日:2022-09-01
Inventor: Lei ZHANG , Kai YANG , Qijuan YIN , Wuzhao ZHANG , Xiaoyan WANG
Abstract: A collision detection method, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, and in particular to fields of intelligent transportation and autonomous driving technologies. The method includes: determining a predicted travel range of a target object based on a planned travel trajectory of the target object and a historical travel trajectory of the target object; determining, in response to a target obstacle being detected, a predicted travel range of the target obstacle based on a current travel state of the target obstacle; and determining whether the target object has a risk of colliding with the target obstacle, based on the predicted travel range of the target object and the predicted travel range of the target obstacle.
-
公开(公告)号:US20220406034A1
公开(公告)日:2022-12-22
申请号:US17822898
申请日:2022-08-29
Inventor: Jingru GAN , Haiwei WANG , Jinchang LUO , Kunbin CHEN , Wei HE , Shuhui WANG
IPC: G06V10/74 , G06F40/295 , G06V10/80
Abstract: A method for extracting information, includes: obtaining an information stream comprising text and an image; generating, according to the text, embedded representations of textual entity mentions and a textual similarity matrix of the textual entity mentions and candidate textual entities; generating, according to the image, embedded representations of image entity mentions and an image similarity matrix of the image entity mentions and candidate image entities; and determining, based on an optimal transport, target textual entities of the textual entity mentions and target image entities of the image entity mentions according to the embedded representations of the textual entity mentions, the embedded representations of the image entity mentions, the textual similarity matrix and the image similarity matrix.
-
公开(公告)号:US20220405020A1
公开(公告)日:2022-12-22
申请号:US17896811
申请日:2022-08-26
Inventor: Zhengli YI
IPC: G06F3/06
Abstract: The present disclosure provides a method and apparatus for writing data in an append mode, a device and a storage medium. The present disclosure relates to the field of cloud storage technology, and can be applied to a cloud platform. The method includes: acquiring to-be-written data, and writing the to-be-written data into a magnetic disk; writing first index information of the to-be-written data in a memory; storing, in response to determining that the number of pieces of second index information is greater than a first preset threshold, the second index information into storage hardware, the second index information including the first index information; and writing first identifier information corresponding to the second index information in the memory.
-
177.
公开(公告)号:US20220392493A1
公开(公告)日:2022-12-08
申请号:US17887179
申请日:2022-08-12
Inventor: Jimin Pi , Xin Wang , Guodong Guo
IPC: G11B27/036 , G06V30/416 , G10L13/08 , G06F16/735
Abstract: This disclosure provides a video generation method, a video generation apparatus, an electronic device, a storage medium and a program product, and relates to the field of artificial intelligence technology, and in particular to the field of computer vision technology and deep learning technology. A specific implementation includes: obtaining document content information of a document; extracting, from the document content information, populating information for multiple scenes in a preset video template; populating the populating information for the multiple scenes into corresponding scenes in the preset video template, respectively, to obtain image information of the multiple scenes; generating audio information of the multiple scenes according to the populating information for the multiple scenes; generating a video of the document based on the image information and audio information of the multiple scenes.
-
公开(公告)号:US20220392267A1
公开(公告)日:2022-12-08
申请号:US17820588
申请日:2022-08-18
Inventor: Xin LI , Shengzhao WEN , Haocheng FENG
Abstract: An update method for a face database, and a face recognition method, an apparatus and a system, including: obtaining a face image set belonging to a same user as an obtained current face image in an original face database, where the face database includes a face image set of at least one user, and the face image set includes a stored face image; determining similarity between the current face image and the stored face image of the same user, where there is a count value for the stored face image of the same user, and the count value represents a number of consecutive unsuccessful matches or consecutive successful matches between the stored face image of the same user and a further face image of the same user; and updating the original face database according to the similarity and the count value.
-
公开(公告)号:US20220392251A1
公开(公告)日:2022-12-08
申请号:US17820108
申请日:2022-08-16
Inventor: Shichang Zhang , Ziyuan Guo , Yafei Zhao , Chao Chen , Xirui Fan
Abstract: A method for generating an object model includes: obtaining an initial morphable model; obtaining a plurality of initial images of an object, and depth images corresponding to the plurality of initial images; obtaining a plurality of target topological images by processing the plurality of initial images based on the depth images; obtaining a plurality of models to be synthesized by processing the initial morphable model based on the plurality of target topological images; and generating a target object model based on the plurality of models to be synthesized.
-
公开(公告)号:US20220392243A1
公开(公告)日:2022-12-08
申请号:US17890629
申请日:2022-08-18
Inventor: Shanshan LIU , Meina QIAO , Liang WU , Pengyuan LYU , Sen FAN , Chengquan ZHANG , Kun YAO
Abstract: A method for training a text classification model and an electronic device are provided. The method may include: acquiring a set of to-be-trained images, the set of to-be-trained images including at least one sample image; determining predicted position information and predicted attribute information of each text line in each sample image based on each sample image; and training to obtain the text classification model, based on the annotation position information and the annotation attribute information of each text line in each sample image, and the predicted position information and the predicted attribute information of each text line in each sample image, and the text classification model is used to detect attribute information of each text line in an to-be-recognized image.
-
-
-
-
-
-
-
-
-