-
公开(公告)号:US20220172707A1
公开(公告)日:2022-06-02
申请号:US17671548
申请日:2022-02-14
Inventor: Jun WANG , Wing Yip LAM
IPC: G10L15/06 , G10L15/02 , G10L15/183 , G10L15/22
Abstract: A speech recognition method includes: obtaining first sample speech data corresponding to a target user and a first reference speech recognition result corresponding to the first sample speech data; obtaining a pre-update target model; inputting the first sample speech data into the pre-update target model, and performing speech recognition by using a target speech extraction model, a target feature extraction model, and a target speech recognition model, to obtain a first model output result; obtaining a target model loss value corresponding to the target feature extraction model according to the first model output result and the first reference speech recognition result; and updating a model parameter of the target feature extraction model in the pre-update target model according to the target model loss value, to obtain a post-update target model.
-
公开(公告)号:US20170288887A1
公开(公告)日:2017-10-05
申请号:US15630791
申请日:2017-06-22
Inventor: Jun WANG , Tingji LIU , Han LI , Song CHAI , Xucheng TANG , Yi SHAN
IPC: H04L12/58 , G06F3/0481 , H04W4/14
CPC classification number: H04L51/14 , G06F3/04812 , G06F3/0482 , G06F3/04842 , G06F3/0488 , H04L51/04 , H04L51/06 , H04L51/38 , H04W4/14
Abstract: A message forwarding method performed at an electronic device having one or more processors and memory storing a plurality of programs for forwarding messages using an instant messaging application, includes: displaying a dialog box including one or more chat messages associated with a first user account of the instant messaging application; selecting one or more chat messages in the dialog box; obtaining message content and associated information of each selected chat message, the associated information including one or more of: a message sender and a sending time of the chat message, a group name of a group corresponding to the dialog box, identifiers of participants of the group; and forwarding the message content and the associated information of each chat message to a second user account of the instant messaging application.
-
3.
公开(公告)号:US20150134235A1
公开(公告)日:2015-05-14
申请号:US14600987
申请日:2015-01-20
Inventor: Yi SHAN , Pinlin CHEN , Dacheng ZHUO , Liang WU , Ling LI , Jun WANG
IPC: G01C21/26
CPC classification number: G01C21/26 , G01C21/3679 , G06F17/30241
Abstract: The present disclosure relates to a method and an apparatus for displaying a geographic location. The method comprises providing a terminal device to a user, wherein the terminal device includes a processor and a screen. Through a processor of the terminal device, the method comprises receiving a positioning instruction from the user; acquiring a first location based on the positioning instruction; acquiring information of at least one point of interest (POI) associated with the first location; displaying the first location on a map displayed in a first display area on the screen; and displaying a first POI list in a second display area on the screen, wherein the first POI list includes at least one entry being displayed in a first order, each entry includes the information of a POI in the at least one POI.
Abstract translation: 本公开涉及一种用于显示地理位置的方法和装置。 该方法包括向用户提供终端设备,其中终端设备包括处理器和屏幕。 通过终端设备的处理器,该方法包括从用户接收定位指令; 基于定位指令获取第一位置; 获取与所述第一位置相关联的至少一个兴趣点(POI)的信息; 在显示在屏幕上的第一显示区域中的地图上显示第一位置; 以及在所述屏幕上的第二显示区域中显示第一POI列表,其中所述第一POI列表包括以第一顺序显示的至少一个条目,每个条目包括所述至少一个POI中的POI的信息。
-
公开(公告)号:US20250053980A1
公开(公告)日:2025-02-13
申请号:US18928142
申请日:2024-10-27
Inventor: Jun WANG , Jinkun HOU , Runzeng GUO , Shaoming WANG , Hang ZHOU , Xiaojie CHEN
IPC: G06Q20/40 , G06V10/774 , G06V10/82 , G06V40/10
Abstract: A biometric payment processing method includes: obtaining image data, the image data comprising a plurality of images of an organism that are successively acquired; detecting a target part in an image in the image data, the target part being a part to which a biometric payment function is bound in the organism; determining, in response to that the target part is detected from the plurality of images, a movement speed corresponding to the target part in the plurality of images; and performing a payment operation based on the target part in response to that the movement speed is less than a speed threshold.
-
5.
公开(公告)号:US20240257562A1
公开(公告)日:2024-08-01
申请号:US18626162
申请日:2024-04-03
Inventor: Yafei YUAN , Wen GE , Lulu JIAO , Jiayu HUANG , Runzeng GUO , Ruixin ZHANG , Yingyi ZHANG , Hang ZHOU , Jun WANG
CPC classification number: G06V40/67 , G06T7/11 , G06T7/62 , G06T7/70 , G06V10/25 , G06V10/761 , G06V40/1347 , G06V2201/07
Abstract: This application discloses a palm image recognition method performed by a computer device. The method includes: performing palm detection on a palm image captured by a camera to generate a palm box for a palm in the palm image (304); determining location information of the palm relative to the camera based on the palm box and the palm image; and displaying a palm identifier corresponding to the palm based on the location information, the palm identifier being used for indicating the palm to move to a preset spatial location corresponding to the camera to obtain an object identifier corresponding to the palm image. A user, based on the palm identifier, moves its palm to the preset spatial location corresponding to the camera to perform palm image recognition.
-
公开(公告)号:US20220238117A1
公开(公告)日:2022-07-28
申请号:US17720876
申请日:2022-04-14
Abstract: A voice identity feature extractor training method includes extracting a voice feature vector of training voice. The method may include determining a corresponding I-vector according to the voice feature vector of the training voice. The method may include adjusting a weight of a neural network model by using the I-vector as a first target output of the neural network model, to obtain a first neural network model. The method may include obtaining a voice feature vector of target detecting voice and determining an output result of the first neural network model for the voice feature vector of the target detecting voice. The method may include determining an I-vector latent variable. The method may include estimating a posterior mean of the I-vector latent variable, and adjusting a weight of the first neural network model using the posterior mean as a second target output, to obtain a voice identity feature extractor.
-
7.
公开(公告)号:US20220172708A1
公开(公告)日:2022-06-02
申请号:US17672565
申请日:2022-02-15
Inventor: Jun WANG , Wingyip LAM , Dan SU , Dong YU
Abstract: A speech separation model training method and apparatus, a computer-readable storage medium, and a computer device are provided, the method including: obtaining first audio and second audio, the first audio including target audio and having corresponding labeled audio, and the second audio including noise audio. obtaining an encoding model, an extraction model, and an initial estimation model; performing unsupervised training on the encoding model, the extraction model, and the estimation model according to the second audio, and adjusting model parameters of the extraction model and the estimation model; performing supervised training on the encoding model and the extraction model according to the first audio and the labeled audio corresponding to the first audio, and adjusting a model parameter of the encoding model; continuously performing the unsupervised training and the supervised training, so that the unsupervised training and the supervised training overlap, and the training is not finished until a training stop condition is met.
-
8.
公开(公告)号:US20220004870A1
公开(公告)日:2022-01-06
申请号:US17476345
申请日:2021-09-15
Inventor: Jun WANG , Wing Yip LAM , Dan SU , Dong YU
Abstract: This application provides a speech recognition and apparatus and a neural network training method and apparatus, and relates to the field of Artificial Intelligence (AI) technologies. The neural network training method is performed by an electronic device and includes: obtaining sample data, the sample data including a mixed speech spectrum and a labeled phoneme thereof; extracting a target speech spectrum from the mixed speech spectrum by using a first subnetwork; adaptively transforming the target speech spectrum by using a second subnetwork, to obtain an intermediate transition representation; performing phoneme recognition based on the intermediate transition representation by using a third subnetwork; and updating parameters of the first subnetwork, the second subnetwork, and the third subnetwork according to a result of the phoneme recognition and the labeled phoneme.
-
公开(公告)号:US20210233513A1
公开(公告)日:2021-07-29
申请号:US17230515
申请日:2021-04-14
Abstract: A neural network training method is provided. The method includes obtaining an audio data stream, performing, for different audio data of each time frame in the audio data stream, feature extraction in each layer of a neural network, to obtain a depth feature outputted by a corresponding time frame, fusing, for a given label in labeling data, an inter-class confusion measurement index and an intra-class distance penalty value relative to the given label in a set loss function for the audio data stream through the depth feature, and updating a parameter in the neural network by using a loss function value obtained through fusion.
-
公开(公告)号:US20200372905A1
公开(公告)日:2020-11-26
申请号:US16989844
申请日:2020-08-10
Abstract: A mixed speech recognition method, a mixed speech recognition apparatus, and a computer-readable storage medium are provided. The mixed speech recognition method includes: monitoring an input of speech input and detecting an enrollment speech and a mixed speech; acquiring speech features of a target speaker based on the enrollment speech; and determining speech belonging to the target speaker in the mixed speech based on the speech features of the target speaker. The enrollment speech includes preset speech information, and the mixed speech is non-enrollment speech inputted after the enrollment speech.
-
-
-
-
-
-
-
-
-