-
公开(公告)号:US20230352011A1
公开(公告)日:2023-11-02
申请号:US17732798
申请日:2022-04-29
Applicant: Zoom Video Communications, Inc.
Inventor: Awni Yusuf HANNUN , Sebastian Stüker
CPC classification number: G10L15/22 , G10L15/005 , G10L25/57 , G06F40/58
Abstract: In some aspects, a computing device may access audio information comprising an audio stream from a client device, and a source-language. The computing device may provide an audio segment from the audio stream to a language identification process of the computing device comprising a machine learning model that is trained to identify a language of a plurality of languages within recorded speech. The computing device may identify an identified-language of the plurality of languages for the speech based at least in part on the audio segment. The computing device may update the source-language to the identified language. Numerous other aspects are described.
-
公开(公告)号:US20250046300A1
公开(公告)日:2025-02-06
申请号:US18228349
申请日:2023-07-31
Applicant: Zoom Video Communications, Inc.
Inventor: Sebastian Stüker , Jianfang Zhai , Shenquan Zhang
Abstract: One example method includes receiving an audio input from a user; determining, using a first trained model, a plurality of candidate commands; determining, using a second trained model, a recognized command from the plurality of candidate commands; and identifying a corresponding valid command in a set of valid commands based on the recognized command.
-
公开(公告)号:US12074720B2
公开(公告)日:2024-08-27
申请号:US17732826
申请日:2022-04-29
Applicant: Zoom Video Communications, Inc.
Inventor: Awni Yusuf Hannun , Sebastian Stüker
CPC classification number: H04L12/1818 , G06F40/58 , G10L15/005
Abstract: In some aspects, a computing device may access audio information comprising an audio stream from a client device. The computing device may provide an audio segment from the audio stream to a language identification process of the computing device comprising a machine learning model that is trained to identify a language of a plurality of languages within recorded speech. The computing device may identify an identified-language of the plurality of languages for the speech based at least in part on the audio segment. The computing device may provide the identified-language to the client device. Numerous other aspects are described.
-
公开(公告)号:US20240372740A1
公开(公告)日:2024-11-07
申请号:US18773974
申请日:2024-07-16
Applicant: Zoom Video Communications, Inc.
Inventor: Awni Yusuf Hannun , Sebastian Stüker
Abstract: In some aspects, a computing device may access audio information comprising an audio stream from a client device. The computing device may provide an audio segment from the audio stream to a language identification process of the computing device comprising a machine learning model that is trained to identify a language of a plurality of languages within recorded speech. The computing device may identify an identified-language of the plurality of languages for the speech based at least in part on the audio segment. The computing device may provide the identified-language to the client device. Numerous other aspects are described.
-
公开(公告)号:US20230353406A1
公开(公告)日:2023-11-02
申请号:US17733401
申请日:2022-04-29
Applicant: Zoom Video Communications, Inc.
Inventor: Awni Y. Hannun , Guitang Lan , Sebastian Stüker
CPC classification number: H04L12/1831 , G06F40/30 , G06F40/40 , G10L15/26 , G06N20/00
Abstract: One example method includes receiving, by a virtual conference provider, a list of words associated with an entity or a context; establishing, by the virtual conference provider, a virtual conference; joining, by the virtual conference provider, a plurality of participants to the virtual conference; determining, by the virtual conference provider, that the entity or the context is associated with the virtual conference; and generating, using a machine learning (“ML”) model, a transcript of the virtual conference based on audio streams exchanged between the plurality of participants and the list of words.
-
公开(公告)号:US20230353400A1
公开(公告)日:2023-11-02
申请号:US17733106
申请日:2022-04-29
Applicant: Zoom Video Communications, Inc.
Inventor: Thai Son NGUYEN , Sebastian Stüker
IPC: H04L12/18 , G10L15/26 , G06F40/242 , G06F40/58
CPC classification number: H04L12/1818 , G10L15/26 , G06F40/242 , G06F40/58
Abstract: An example method includes hosting, by a conference provider, a virtual conference between a plurality of client devices exchanging audio streams; receiving, during the virtual conference, a first plurality of audio segments of a first audio stream from a first client device of the plurality of client devices; receiving, during the virtual conference, a second plurality of audio segments of a second audio stream from a second client device of the plurality of client devices; transcribing, by a transcription process, the first plurality of audio segments to create a first transcription; transcribing, by the transcription process, the second plurality of audio segments to create a second transcription; providing, during the virtual conference, the first transcription and the second transcription to the first and second client devices.
-
公开(公告)号:US20230353399A1
公开(公告)日:2023-11-02
申请号:US17732826
申请日:2022-04-29
Applicant: Zoom Video Communications, Inc.
Inventor: Awni Yusuf Hannun , Sebastian Stüker
CPC classification number: H04L12/1818 , G06F40/58 , G10L15/005
Abstract: In some aspects, a computing device may access audio information comprising an audio stream from a client device. The computing device may provide an audio segment from the audio stream to a language identification process of the computing device comprising a machine learning model that is trained to identify a language of a plurality of languages within recorded speech. The computing device may identify an identified-language of the plurality of languages for the speech based at least in part on the audio segment. The computing device may provide the identified-language to the client device. Numerous other aspects are described.
-
-
-
-
-
-