-
公开(公告)号:US20250087215A1
公开(公告)日:2025-03-13
申请号:US18960064
申请日:2024-11-26
Applicant: Cisco Technology, Inc.
Inventor: Ali Mouline , Christopher Rowen , David Guoqing Zhang , Francis Anthony Kurupacheril
IPC: G10L15/22 , G10L15/08 , G10L15/30 , H04L65/401
Abstract: Presented herein are techniques in which a device detecting a phrase spoken in an online collaboration session between a plurality of users, the phrase being spoken by a first user to one or more second users. The device determines that the phrase indicates an issue with a quality of user experience of the online collaboration session, labels a log of metrics associated with the online collaboration session with a time stamp corresponding to a time when the phrase was spoken; and performs one or more actions to improve the user experience based on detecting the phrase.
-
公开(公告)号:US20230129867A1
公开(公告)日:2023-04-27
申请号:US17978730
申请日:2022-11-01
Applicant: Cisco Technology, Inc.
Inventor: Raul Alejandro Casas , Marcin Ciolek , Samer Hijazi , Dror Maydan , Hua Mu , Erik Panu , Christopher Rowen
Abstract: Systems and methods are disclosed for audio group identification for conferencing. For example, methods may include joining a conference call using a network interface; accessing an audio signal that has been captured using a microphone; detecting a control signal in the audio signal; and, responsive to detection of the control signal, invoking modification of an audio path of the conference call.
-
公开(公告)号:US20230117129A1
公开(公告)日:2023-04-20
申请号:US17504726
申请日:2021-10-19
Applicant: Cisco Technology, Inc.
Inventor: Ali Mouline , Christopher Rowen , David Guoqing Zhang , Francis Anthony Kurupacheril
Abstract: Presented herein are techniques in which a device detecting a phrase spoken in an online collaboration session between a plurality of users, the phrase being spoken by a first user to one or more second users. The device determines that the phrase indicates an issue with a quality of user experience of the online collaboration session, labels a log of metrics associated with the online collaboration session with a time stamp corresponding to a time when the phrase was spoken, to provide a labeled log of metrics; and performs one or more actions to improve the user experience based on detecting the phrase.
-
公开(公告)号:US20250131930A1
公开(公告)日:2025-04-24
申请号:US18540084
申请日:2023-12-14
Applicant: Cisco Technology, Inc.
Inventor: Michal Sulewski , Marcin Ciolek , Mihailo Kolundzija , Raul A. Casas , Samer Lutfi Hijazi , Christopher Rowen
IPC: G10L19/022 , G10L19/00 , G10L19/038
Abstract: A method comprises: storing n dimension (nD) dictionaries (nD dictionaries) where n decreases from a highest dimension to a lowest dimension, each nD dictionary including codewords for sequences of n symbols that are of a limited number that is less than all possible sequences of n symbols; storing key blocks for corresponding ones of the nD dictionaries, each key block configured with keys that map sequences of n−1 symbols to dictionaries of a corresponding one of the nD dictionaries that includes the codewords; receiving a sequence of symbols that represent indices of codevectors of a vector quantizer codebook that are representative of audio; determining a codeword using the key blocks and the nD dictionaries; and encoding a current symbol of the sequence of symbols using the codeword.
-
公开(公告)号:US12230262B2
公开(公告)日:2025-02-18
申请号:US17504726
申请日:2021-10-19
Applicant: Cisco Technology, Inc.
Inventor: Ali Mouline , Christopher Rowen , David Guoqing Zhang , Francis Anthony Kurupacheril
IPC: G10L15/22 , G10L15/08 , G10L15/30 , H04L65/401 , G06F40/30 , H04L65/403 , H04L65/80
Abstract: Presented herein are techniques in which a device detecting a phrase spoken in an online collaboration session between a plurality of users, the phrase being spoken by a first user to one or more second users. The device determines that the phrase indicates an issue with a quality of user experience of the online collaboration session, labels a log of metrics associated with the online collaboration session with a time stamp corresponding to a time when the phrase was spoken, to provide a labeled log of metrics; and performs one or more actions to improve the user experience based on detecting the phrase.
-
公开(公告)号:US20240371392A1
公开(公告)日:2024-11-07
申请号:US18773339
申请日:2024-07-15
Applicant: Cisco Technology, Inc.
Inventor: Samer Hijazi , Xuehong Mao , Raul Alejandro Casas , Kamil Krzysztof Wojcicki , Dror Maydan , Christopher Rowen
IPC: G10L21/0364 , G10L15/22 , G10L15/25 , G10L17/00 , G10L21/055 , G10L25/30
Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.
-
公开(公告)号:US11916686B2
公开(公告)日:2024-02-27
申请号:US17978730
申请日:2022-11-01
Applicant: Cisco Technology, Inc.
Inventor: Raul Alejandro Casas , Marcin Ciolek , Samer Hijazi , Dror Maydan , Hua Mu , Erik Panu , Christopher Rowen
CPC classification number: H04L12/1813 , G06F3/165 , G10L25/78 , H04M3/568 , H04M9/02
Abstract: Systems and methods are disclosed for audio group identification for conferencing. For example, methods may include joining a conference call using a network interface; accessing an audio signal that has been captured using a microphone; detecting a control signal in the audio signal; and, responsive to detection of the control signal, invoking modification of an audio path of the conference call.
-
8.
公开(公告)号:US20250131919A1
公开(公告)日:2025-04-24
申请号:US18539791
申请日:2023-12-14
Applicant: Cisco Technology, Inc.
Inventor: Xuehong Mao , Samer Lutfi Hijazi , Christopher Rowen , Mathew Shaji Kavalekalam , Ivana Balic , Mengjun Leng , Yusuf Ziya Isik , Adam Ali Sabra , Amir Salah Abdelsamie Abdelwahed , Samir Ouelha , Mihailo Kolundzija
Abstract: A neural network audio codec system and related methods are provided. In one example, a method is provided comprising: obtaining speech audio to be encoded; applying the speech audio to an audio encoder that is part of a neural network audio codec system that includes the audio encoder and an audio decoder. The audio encoder and the audio decoder have been trained in an end-to-end manner. The speech audio is encoded with the audio encoder to generate embedding vectors that represent a snapshot of speech audio attributes over successive timeframes of the raw speech audio, and from the embedding vectors, codeword indices are generated to entries in a codebook. The codeword indices are then transmitted or stored for later retrieval and processing by the audio decoder.
-
公开(公告)号:US20220392478A1
公开(公告)日:2022-12-08
申请号:US17471979
申请日:2021-09-10
Applicant: Cisco Technology, Inc.
Inventor: Samer Lutfi Hijazi , Christopher Rowen , Xuehong Mao , Ivana M. Balic , Raul Alejandro Casas , Savita Kini
IPC: G10L21/0364 , H04M3/56 , G10L21/0232 , G10L21/028
Abstract: An endpoint selectively enhances a captured audio signal based on an operating mode. The endpoint obtains an audio input signal of multiple users in a physical location. The audio input signal is captured by a microphone. The endpoint separates voice signals from the audio input signal and determines an operating mode for an audio output signal. The endpoint selectively adjusts each of the voice signals based on the operating mode to generate the audio output signal.
-
-
-
-
-
-
-
-