-
公开(公告)号:US20220301545A1
公开(公告)日:2022-09-22
申请号:US17830130
申请日:2022-06-01
Inventor: Yongguo KANG , Junchao WANG
IPC: G10L13/08 , G10L13/04 , G10L17/14 , G10L17/02 , G10L13/033
Abstract: A method for speech generation includes: acquiring speech information of an original speaker; performing text feature extraction on the speech information to obtain a text feature corresponding to the speech information; converting the text feature to an acoustic feature corresponding to a target speaker; and generating a target speech signal based on the acoustic feature.
-
公开(公告)号:US20220301334A1
公开(公告)日:2022-09-22
申请号:US17832735
申请日:2022-06-06
Inventor: Yuechen YU , Yulin LI , Chengquan ZHANG , Kun YAO
IPC: G06V30/416 , G06F40/18 , G06V30/413
Abstract: The present disclosure provides a table generating method and apparatus, an electronic device, a storage medium and a product. A specific implementation is: recognizing at least one table object in a to-be-recognized image and obtaining a table property respectively corresponding to the at least one table object, where the table property of any table object includes a cell property or a non-cell property; determining at least one target object with the cell property in the at least one table object; determining a cell region respectively corresponding to the at least one target object to obtain cell position information respectively corresponding to the at least one target object; generating a spreadsheet corresponding to the to-be-recognized image according to the cell position information respectively corresponding to the at least one target object.
-
公开(公告)号:US20220301108A1
公开(公告)日:2022-09-22
申请号:US17836913
申请日:2022-06-09
Inventor: Jiayin CAI , Huaifei XING
Abstract: The present disclosure provides a method and an apparatus for enhancing image quality, a device, and a medium, relates to the field of artificial intelligence and specifically to computer vision and deep learning technologies, and can be applied to an image processing scenario. The method includes: determining an ROI and an RONI in an image to be processed; inputting the ROI to an ROI image quality enhancement model, to obtain first image data output from the ROI image quality enhancement model; inputting the RONI to an RONI image quality enhancement model, to obtain second image data output from the RONI image quality enhancement model; and blending the first image data and the second image data.
-
公开(公告)号:US20220284807A1
公开(公告)日:2022-09-08
申请号:US17824966
申请日:2022-05-26
Inventor: Xinjiang LU , Dejing Dou
IPC: G08G1/01
Abstract: A method of predicting traffic volume, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, in particular to big data and deep learning technologies The method includes: generating, for a plurality of traffic regions, a function relation graph and a volume relation graph; generating a volume feature of a target traffic region among the plurality of traffic regions, according to a historical volume information of the target traffic region; generating a volume and function relation feature for the target traffic region, based on the function relation graph and the volume relation graph; and predicting a volume of the target traffic region according to the volume feature and the volume and function relation feature.
-
805.
公开(公告)号:US20220277732A1
公开(公告)日:2022-09-01
申请号:US17747732
申请日:2022-05-18
Inventor: Qingen ZHAO
Abstract: A method and apparatus for training a speech recognition model, an electronic device and a storage medium are provided. An implementation of the method may include: determining a plurality of feature vectors based on audio feature data corresponding to a first target frame in a sample speech, wherein the sample speech comprises a conversation among a plurality of objects and the sample speech has a corresponding sample text; generating a predicted text element corresponding to the first target frame based on an adjacent text element preceding to a text element corresponding to the first target frame in the sample text, wherein the text element and the adjacent text element are targeting at a target object in the plurality of objects; obtaining a first target text element based on the predicted text element and a first feature vector in the plurality of feature vectors; and adjusting the speech recognition model based on the first target text element and the sample text, to obtain a trained speech recognition model.
-
公开(公告)号:US20220269952A1
公开(公告)日:2022-08-25
申请号:US17739555
申请日:2022-05-09
Inventor: Yihang CHENG , Hongke ZHAO , Hengshu ZHU , Zheng DONG , Xi ZHANG
Abstract: Provided are a method of training a prediction model, a prediction method, an electronic device and a medium, which relate to the field of artificial intelligence technology, and in particular, to the field of Big Data. A prediction model includes a main prediction model and an auxiliary prediction model, a training sample set includes a project information sample of a project and an item information sample of an item associated with the project, a project information sample includes a project property information and a project comment information, and an item information sample includes an item comment information. The method includes: inputting the project comment information to the auxiliary prediction model to obtain an initial prediction semantic information; training the main prediction model by using the project property information and the initial prediction semantic information; and training the auxiliary prediction model by using the item comment information.
-
公开(公告)号:US20220254251A1
公开(公告)日:2022-08-11
申请号:US17730961
申请日:2022-04-27
Inventor: Yuting Du , Xu Dai , Mengyao Sun , Shilei Wen
Abstract: A method of recognizing illegal parking of a vehicle, a device, and a storage medium, which relate to the field of artificial intelligence, and in particular to the fields of deep learning, cloud computing, computer vision, etc. The method includes: obtaining a video image collected by an electronic device; recognizing a parking area of the vehicle in the video image; determining a shooting angle used by the electronic device for collecting the video image; determining an illegal parking area in the video image based on the shooting angle; and recognizing whether the vehicle is illegally parked or not based on the parking area of the vehicle and the illegal parking area.
-
公开(公告)号:US20220246032A1
公开(公告)日:2022-08-04
申请号:US17727390
申请日:2022-04-22
Inventor: Yuzhe WANG , Jiantao ZHAO
IPC: G08G1/123
Abstract: A method and apparatus for prompting navigation information. The method includes: acquiring, when a user is in an underground vehicle area, traffic behavior statuses of an underground vehicle at stops on a preset navigation route; determining, in response to the traffic behavior status of each stop of adjacent stops on the navigation route being consistent with a first preset traffic behavior status, a base station corresponding to a latter stop in the adjacent stops as a reference station; determining a location of a preset stop on the navigation route, based on a location of the reference station and locations of base stations provided at the stops; and outputting, in response to the traffic behavior status of the underground vehicle at the preset stop being consistent with a second preset traffic behavior status, prompt information for getting off.
-
809.
公开(公告)号:US20220237935A1
公开(公告)日:2022-07-28
申请号:US17682099
申请日:2022-02-28
Inventor: Jiaming LIU , Licheng TANG
IPC: G06V30/19 , G06T11/20 , G06F40/109 , G06T11/60
Abstract: Provided are a method for training a font generation model, a method for establishing a font library, and a device. The method for training a font generation model includes the following steps: a source-domain sample character is input into the font generation model to obtain a first target-domain generated character; the first target-domain generated character and a preset target-domain sample character are input into a character classification model to obtain a first feature loss of the font generation model; the first target-domain generated character and the target-domain sample character are input into a font classification model to obtain a second feature loss of the font generation model; a target feature loss is determined according to the first feature loss and/or the second feature loss; and the model parameter of the font generation model is updated according to the target feature loss.
-
公开(公告)号:US20220237388A1
公开(公告)日:2022-07-28
申请号:US17714891
申请日:2022-04-06
Inventor: Xinjiang LU , Yanyan LI , Jingbo ZHOU , Dejing DOU
IPC: G06F40/40 , G06F40/30 , G06F40/177 , G06F40/20
Abstract: A method and apparatus for generating a table description text, a device, and a storage medium are provided. An implementation of the method includes: acquiring a to-be-described table, and analyzing the to-be-described table to obtain a set of metalanguage of the to-be-described table, and finally generating a description text of the to-be-described table based on the metalanguage in the set of metalanguage.
-
-
-
-
-
-
-
-
-