Patent search ap:("Zoom Video Communications Page Inc.") AND inv:"Sebastian Stüker"

1.

发明公开
AUTOMATIC SWITCHING BETWEEN LANGUAGES DURING VIRTUAL CONFERENCES 审中-公开

公开(公告)号：US20230352011A1

公开(公告)日：2023-11-02

申请号：US17732798

申请日：2022-04-29

Applicant: Zoom Video Communications, Inc.

Inventor： Awni Yusuf HANNUN , Sebastian Stüker

IPC: G10L15/22 , G10L15/00 , G10L25/57

CPC classification number: G10L15/22 , G10L15/005 , G10L25/57 , G06F40/58

Abstract: In some aspects, a computing device may access audio information comprising an audio stream from a client device, and a source-language. The computing device may provide an audio segment from the audio stream to a language identification process of the computing device comprising a machine learning model that is trained to identify a language of a plurality of languages within recorded speech. The computing device may identify an identified-language of the plurality of languages for the speech based at least in part on the audio segment. The computing device may update the source-language to the identified language. Numerous other aspects are described.

2.

发明申请
AUTOMATIC SPEECH RECOGNITION FOR INTERACTIVE VOICE RESPONSE SYSTEMS 有权

公开(公告)号：US20250046300A1

公开(公告)日：2025-02-06

申请号：US18228349

申请日：2023-07-31

Applicant: Zoom Video Communications, Inc.

Inventor： Sebastian Stüker , Jianfang Zhai , Shenquan Zhang

IPC: G10L15/16 , G10L15/06

Abstract: One example method includes receiving an audio input from a user; determining, using a first trained model, a plurality of candidate commands; determining, using a second trained model, a recognized command from the plurality of candidate commands; and identifying a corresponding valid command in a set of valid commands based on the recognized command.

3.

发明授权
Automated language identification during virtual conferences 有权

公开(公告)号：US12074720B2

公开(公告)日：2024-08-27

申请号：US17732826

申请日：2022-04-29

Applicant: Zoom Video Communications, Inc.

Inventor： Awni Yusuf Hannun , Sebastian Stüker

IPC: G06F15/16 , G06F40/58 , G10L15/00 , H04L12/18

CPC classification number: H04L12/1818 , G06F40/58 , G10L15/005

Abstract: In some aspects, a computing device may access audio information comprising an audio stream from a client device. The computing device may provide an audio segment from the audio stream to a language identification process of the computing device comprising a machine learning model that is trained to identify a language of a plurality of languages within recorded speech. The computing device may identify an identified-language of the plurality of languages for the speech based at least in part on the audio segment. The computing device may provide the identified-language to the client device. Numerous other aspects are described.

4.

发明申请
AUTOMATED LANGUAGE IDENTIFICATION DURING VIRTUAL CONFERENCES 有权

公开(公告)号：US20240372740A1

公开(公告)日：2024-11-07

申请号：US18773974

申请日：2024-07-16

Applicant: Zoom Video Communications, Inc.

Inventor： Awni Yusuf Hannun , Sebastian Stüker

IPC: H04L12/18 , G06F40/58 , G10L15/00

Abstract: In some aspects, a computing device may access audio information comprising an audio stream from a client device. The computing device may provide an audio segment from the audio stream to a language identification process of the computing device comprising a machine learning model that is trained to identify a language of a plurality of languages within recorded speech. The computing device may identify an identified-language of the plurality of languages for the speech based at least in part on the audio segment. The computing device may provide the identified-language to the client device. Numerous other aspects are described.

5.

发明公开
CONTEXT-BIASING FOR SPEECH RECOGNITION IN VIRTUAL CONFERENCES 审中-公开

公开(公告)号：US20230353406A1

公开(公告)日：2023-11-02

申请号：US17733401

申请日：2022-04-29

Applicant: Zoom Video Communications, Inc.

Inventor： Awni Y. Hannun , Guitang Lan , Sebastian Stüker

IPC: G06F40/30 , G06F40/40 , H04L12/18 , G10L15/26

CPC classification number: H04L12/1831 , G06F40/30 , G06F40/40 , G10L15/26 , G06N20/00

Abstract: One example method includes receiving, by a virtual conference provider, a list of words associated with an entity or a context; establishing, by the virtual conference provider, a virtual conference; joining, by the virtual conference provider, a plurality of participants to the virtual conference; determining, by the virtual conference provider, that the entity or the context is associated with the virtual conference; and generating, using a machine learning (“ML”) model, a transcript of the virtual conference based on audio streams exchanged between the plurality of participants and the list of words.

6.

发明公开
PROVIDING MULTISTREAM AUTOMATIC SPEECH RECOGNITION DURING VIRTUAL CONFERENCES 审中-公开

公开(公告)号：US20230353400A1

公开(公告)日：2023-11-02

申请号：US17733106

申请日：2022-04-29

Applicant: Zoom Video Communications, Inc.

Inventor： Thai Son NGUYEN , Sebastian Stüker

IPC: H04L12/18 , G10L15/26 , G06F40/242 , G06F40/58

CPC classification number: H04L12/1818 , G10L15/26 , G06F40/242 , G06F40/58

Abstract: An example method includes hosting, by a conference provider, a virtual conference between a plurality of client devices exchanging audio streams; receiving, during the virtual conference, a first plurality of audio segments of a first audio stream from a first client device of the plurality of client devices; receiving, during the virtual conference, a second plurality of audio segments of a second audio stream from a second client device of the plurality of client devices; transcribing, by a transcription process, the first plurality of audio segments to create a first transcription; transcribing, by the transcription process, the second plurality of audio segments to create a second transcription; providing, during the virtual conference, the first transcription and the second transcription to the first and second client devices.

7.

发明公开
AUTOMATED LANGUAGE IDENTIFICATION DURING VIRTUAL CONFERENCES 审中-公开

公开(公告)号：US20230353399A1

公开(公告)日：2023-11-02

申请号：US17732826

申请日：2022-04-29

Applicant: Zoom Video Communications, Inc.

Inventor： Awni Yusuf Hannun , Sebastian Stüker

IPC: H04L12/18 , G06F40/58 , G10L15/00

CPC classification number: H04L12/1818 , G06F40/58 , G10L15/005

Abstract: In some aspects, a computing device may access audio information comprising an audio stream from a client device. The computing device may provide an audio segment from the audio stream to a language identification process of the computing device comprising a machine learning model that is trained to identify a language of a plurality of languages within recorded speech. The computing device may identify an identified-language of the plurality of languages for the speech based at least in part on the audio segment. The computing device may provide the identified-language to the client device. Numerous other aspects are described.

Patent Agency Ranking