Patent search ap:("Google LLC") AND inv:"Ruoming Pang" Page 7

61.

发明申请
ASYNCHRONOUS DISTRIBUTED DATA FLOW FOR MACHINE LEARNING WORKLOADS 有权

公开(公告)号：US20230118303A1

公开(公告)日：2023-04-20

申请号：US18082415

申请日：2022-12-15

Applicant: Google LLC

Inventor： Jeffrey Adgate Dean , Sudip Roy , Michael Acheson Isard , Aakanksha Chowdhery , Brennan Saeta , Chandramohan Amyangot Thekkath , Daniel William Hurt , Hyeontaek Lim , Laurent El Shafey , Parker Edward Schuh , Paul Ronald Barham , Ruoming Pang , Ryan Sepassi , Sanjay Ghemawat , Yonghui Wu

IPC: G06F9/48 , G06N3/063 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributing machine learning workloads, e.g., computations for training a neural network or computing an inference using a neural network, across multiple hardware accelerators. One of the systems comprises a plurality of accelerator islands, each hardware accelerator island comprising a respective plurality of hardware devices that include a plurality of hardware accelerators and a corresponding host for each of the plurality of hardware accelerators; and a respective scheduler for each of the accelerator islands that is configured to schedule workloads across the plurality of accelerators and corresponding hosts in the accelerator island, wherein the system is configured to: receive data representing a machine learning workload; and assign a respective portion of the machine learning workload to each of the plurality of accelerator islands for scheduling by the respective scheduler for the accelerator island.

62.

发明授权
Attention-based joint acoustic and text on-device end-to-end model 有权

公开(公告)号：US11594212B2

公开(公告)日：2023-02-28

申请号：US17155010

申请日：2021-01-21

Applicant: Google LLC

Inventor： Tara N. Sainath , Ruoming Pang , Ron Weiss , Yanzhang He , Chung-Cheng Chiu , Trevor Strohman

IPC: G10L15/06 , G06N3/08 , G10L15/16 , G10L15/197

Abstract: A method includes receiving a training example for a listen-attend-spell (LAS) decoder of a two-pass streaming neural network model and determining whether the training example corresponds to a supervised audio-text pair or an unpaired text sequence. When the training example corresponds to an unpaired text sequence, the method also includes determining a cross entropy loss based on a log probability associated with a context vector of the training example. The method also includes updating the LAS decoder and the context vector based on the determined cross entropy loss.

63.

发明授权
Using context information with end-to-end models for speech recognition 有权

公开(公告)号：US11545142B2

公开(公告)日：2023-01-03

申请号：US16827937

申请日：2020-03-24

Applicant: Google LLC

Inventor： Ding Zhao , Bo Li , Ruoming Pang , Tara N. Sainath , David Rybach , Deepti Bhatia , Zelin Wu

IPC: G10L15/183 , G10L15/16 , G06N20/00 , G06K9/62 , G06N3/08

Abstract: A method includes receiving audio data encoding an utterance, processing, using a speech recognition model, the audio data to generate speech recognition scores for speech elements, and determining context scores for the speech elements based on context data indicating a context for the utterance. The method also includes executing, using the speech recognition scores and the context scores, a beam search decoding process to determine one or more candidate transcriptions for the utterance. The method also includes selecting a transcription for the utterance from the one or more candidate transcriptions.

64.

发明申请
SINGLE-STAGE MODEL TRAINING FOR NEURAL ARCHITECTURE SEARCH 有权

公开(公告)号：US20220405579A1

公开(公告)日：2022-12-22

申请号：US17613773

申请日：2021-03-03

Applicant: Google LLC

Inventor： Jiahui Yu , Pengchong Jin , Hanxiao Liu , Gabriel Mintzer Bender , Pieter-Jan Kindermans , Mingxing Tan , Xiaodan Song , Ruoming Pang , Quoc V. Le

IPC: G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting a neural network to perform a particular machine learning task while satisfying a set of constraints.

65.

发明授权
Neural architecture search with factorized hierarchical search space 有权

公开(公告)号：US11531861B2

公开(公告)日：2022-12-20

申请号：US16258927

申请日：2019-01-28

Applicant: Google LLC

Inventor： Mingxing Tan , Quoc Le , Bo Chen , Vijay Vasudevan , Ruoming Pang

IPC: G06N3/04 , G06N20/10 , G06F17/15 , G06N3/08

Abstract: The present disclosure is directed to an automated neural architecture search approach for designing new neural network architectures such as, for example, resource-constrained mobile CNN models. In particular, the present disclosure provides systems and methods to perform neural architecture search using a novel factorized hierarchical search space that permits layer diversity throughout the network, thereby striking the right balance between flexibility and search space size. The resulting neural architectures are able to be run relatively faster and using relatively fewer computing resources (e.g., less processing power, less memory usage, less power consumption, etc.), all while remaining competitive with or even exceeding the performance (e.g., accuracy) of current state-of-the-art mobile-optimized models.

66.

发明申请
HARDWARE-OPTIMIZED NEURAL ARCHITECTURE SEARCH 有权

公开(公告)号：US20220019869A1

公开(公告)日：2022-01-20

申请号：US17039178

申请日：2020-09-30

Applicant: Google LLC

Inventor： Sheng Li , Norman Paul Jouppi , Quoc V. Le , Mingxing Tan , Ruoming Pang , Liqun Cheng , Andrew Li

IPC: G06N3/04 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining an architecture for a task neural network that is configured to perform a particular machine learning task on a target set of hardware resources. When deployed on a target set of hardware, such as a collection of datacenter accelerators, the task neural network may be capable of performing the particular machine learning task with enhanced accuracy and speed.

67.

发明申请
Emitting Word Timings with End-to-End Models 有权

公开(公告)号：US20210350794A1

公开(公告)日：2021-11-11

申请号：US17204852

申请日：2021-03-17

Applicant: Google LLC

Inventor： Tara N. Sainath , Basi Garcia , David Rybach , Trevor Strohman , Ruoming Pang

IPC: G10L15/06 , G10L25/30 , G10L25/78

Abstract: A method includes receiving a training example that includes audio data representing a spoken utterance and a ground truth transcription. For each word in the spoken utterance, the method also includes inserting a placeholder symbol before the respective word identifying a respective ground truth alignment for a beginning and an end of the respective word, determining a beginning word piece and an ending word piece, and generating a first constrained alignment for the beginning word piece and a second constrained alignment for the ending word piece. The first constrained alignment is aligned with the ground truth alignment for the beginning of the respective word and the second constrained alignment is aligned with the ground truth alignment for the ending of the respective word. The method also includes constraining an attention head of a second pass decoder by applying the first and second constrained alignments.

68.

发明申请
Deliberation Model-Based Two-Pass End-To-End Speech Recognition 有权

公开(公告)号：US20210225369A1

公开(公告)日：2021-07-22

申请号：US17149018

申请日：2021-01-14

Applicant: Google LLC

Inventor： Ke Hu , Tara N. Sainath , Ruoming Pang , Rohit Prakash Prabhavalkar

IPC: G10L15/18 , G10L15/06 , G06N3/04 , G10L15/187 , G10L15/16 , G10L19/00

Abstract: A method of performing speech recognition using a two-pass deliberation architecture includes receiving a first-pass hypothesis and an encoded acoustic frame and encoding the first-pass hypothesis at a hypothesis encoder. The first-pass hypothesis is generated by a recurrent neural network (RNN) decoder model for the encoded acoustic frame. The method also includes generating, using a first attention mechanism attending to the encoded acoustic frame, a first context vector, and generating, using a second attention mechanism attending to the encoded first-pass hypothesis, a second context vector. The method also includes decoding the first context vector and the second context vector at a context vector decoder to form a second-pass hypothesis

69.

发明授权
Synthesizing speech from text using neural networks 有权

公开(公告)号：US10971170B2

公开(公告)日：2021-04-06

申请号：US16058640

申请日：2018-08-08

Applicant: Google LLC

Inventor： Yonghui Wu , Jonathan Shen , Ruoming Pang , Ron J. Weiss , Michael Schuster , Navdeep Jaitly , Zongheng Yang , Zhifeng Chen , Yu Zhang , Yuxuan Wang , Russell John Wyatt Skerry-Ryan , Ryan M. Rifkin , Ioannis Agiomyrgiannakis

IPC: G10L25/30 , G10L13/047 , G10L13/08 , G06N7/00 , G06N3/08 , G06N3/04 , G06N5/04 , G10L25/18

Abstract: Methods, systems, and computer program products for generating, from an input character sequence, an output sequence of audio data representing the input character sequence. The output sequence of audio data includes a respective audio output sample for each of a number of time steps. One example method includes, for each of the time steps: generating a mel-frequency spectrogram for the time step by processing a representation of a respective portion of the input character sequence using a decoder neural network; generating a probability distribution over a plurality of possible audio output samples for the time step by processing the mel-frequency spectrogram for the time step using a vocoder neural network; and selecting the audio output sample for the time step from the possible audio output samples in accordance with the probability distribution.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification