Patent search ap:("Dolby Laboratories Licensing Corporation") AND inv:"Kai Li" Page 2

11.

发明申请
HOWL DETECTION IN CONFERENCE SYSTEMS 有权

公开(公告)号：US20220201125A1

公开(公告)日：2022-06-23

申请号：US17691966

申请日：2022-03-10

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Kai Li , David Gunawan , Feng Deng , Qianqian Fang

IPC: H04M3/56 , G10L25/48

Abstract: Some disclosed teleconferencing methods may involve detecting a howl state during a teleconference. The teleconference may involve two or more teleconference client locations and a teleconference server. The teleconference server may be configured for providing full-duplex audio connectivity between the teleconference client locations. The howl state may be a state of acoustic feedback involving two or more teleconference devices in a teleconference client location. Detecting the howl state may involve an analysis of both spectral and temporal characteristics of teleconference audio data. Some disclosed teleconferencing methods may involve determining which client location is causing the howl state. Some such methods may involve mitigating the howl state and/or sending a howl state detection message.

12.

发明授权
Jitter buffer control based on monitoring of delay jitter and conversational dynamics 有权

公开(公告)号：US10742531B2

公开(公告)日：2020-08-11

申请号：US15302945

申请日：2015-04-09

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Kai Li , Xuejing Sun , Gary Spittle

IPC: H04L12/26 , H04L29/06 , H04J3/06 , G10L15/08 , G10L25/93 , G10L25/48 , G10L25/78

Abstract: Some implementations involve analyzing audio packets received during a time interval that corresponds with a conversation analysis segment to determine network jitter dynamics data and conversational interactivity data. The network jitter dynamics data may provide an indication of jitter in a network that relays the audio data packets. The conversational interactivity data may provide an indication of interactivity between participants of a conversation represented by the audio data. A jitter buffer size may be controlled according to the network jitter dynamics data and the conversational interactivity data. The time interval may include a plurality of talkspurts.

13.

发明授权
Jitter buffer level estimation 有权

公开(公告)号：US10103999B2

公开(公告)日：2018-10-16

申请号：US15125564

申请日：2015-04-08

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： JiaQuan Huo , Xuejing Sun , Kai Li

IPC: H04L12/841 , H04J3/06 , H04L12/26 , H04L12/835

Abstract: Some implementations involve controlling a jitter buffer size during a teleconference according to a jitter buffer size estimation algorithm based, at least in part, on a cumulative distribution function (CDF). The CDF may be based, at least in part, on a network jitter parameter. The CDF may be initialized according to a parametric model. At least one parameter of the parametric model may be based, at least in part, on legacy network jitter information.

14.

发明申请
NEAR-END INDICATION THAT THE END OF SPEECH IS RECEIVED BY THE FAR END IN AN AUDIO OR VIDEO CONFERENCE 有权
Title translation: 在音频或视频会议末尾接收到的语音结束的最终指示

公开(公告)号：US20150237301A1

公开(公告)日：2015-08-20

申请号：US14426134

申请日：2013-09-27

Applicant: DOLBY LABORATORIES LICENSING CORPORATION , DOLBY INTERNATIONAL AB

Inventor： Dong Shi , Xuejing Sun , Kai Li , Shen Huang , Harald Mundt , Heiko Purnhagen , Glenn N. Dickins

IPC: H04N7/14 , H04N7/15

CPC classification number: H04N7/147 , H04M3/569 , H04M9/082 , H04M2201/12 , H04M2201/14 , H04M2201/38 , H04M2203/258 , H04M2203/352 , H04N7/15

Abstract: Embodiments of client device and method for audio or video conferencing are described. An embodiment includes an offset detecting unit, a configuring unit, an estimator and an output unit. The offset detecting unit detects an offset of speech input to the client device. The configuring unit determines a voice latency from the client device to every far end. The estimator estimates a time when a user at the far end perceives the offset based on the voice latency. The output unit outputs a perceivable signal indicating that a user at the far end perceives the offset based on the time estimated for the far end. The perceivable signal is helpful to avoid collision between parties.

Abstract translation: 描述用于音频或视频会议的客户端设备和方法的实施例。实施例包括偏移检测单元，配置单元，估计器和输出单元。偏移检测单元检测输入到客户端设备的语音偏移。配置单元确定从客户端设备到每个远端的语音延迟。估计器估计在远端的用户基于语音延迟感知到偏移的时间。输出单元输出可感知的信号，指示远端的用户基于为远端估计的时间感知偏移。可感知的信号有助于避免各方之间的冲突。

15.

发明申请
AUDIO ENHANCEMENT FOR MOBILE CAPTURE 有权

公开(公告)号：US20250008284A1

公开(公告)日：2025-01-02

申请号：US18689187

申请日：2022-09-07

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Kai Li , Hao Luo , Lei Gan , Xu Li , Weiwei Wen , Yuanxing Ma

IPC: H04R29/00 , G06F3/16 , G10L25/60

Abstract: A system for real-time monitoring of user-generated audio content for audio anomaly and a related method are disclosed. In some embodiments, the system is programmed to receive, in real time, audio data generated by a first mobile device, such as a smartphone. The system is programed to determine, in real time, whether an audio anomaly has occurred from the audio data. The system is programmed to cause, in real time, a presentation of an alert to a second mobile device, which could be the same smartphone, in response to detecting an occurrence of audio anomaly.

16.

发明授权
Howl detection in conference systems 有权

公开(公告)号：US11677879B2

公开(公告)日：2023-06-13

申请号：US17691966

申请日：2022-03-10

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Kai Li , David Gunawan , Feng Deng , Qianqian Fang

IPC: H04M3/56 , G10L25/48

CPC classification number: H04M3/568 , G10L25/48

Abstract: Some disclosed teleconferencing methods may involve detecting a howl state during a teleconference. The teleconference may involve two or more teleconference client locations and a teleconference server. The teleconference server may be configured for providing full-duplex audio connectivity between the teleconference client locations. The howl state may be a state of acoustic feedback involving two or more teleconference devices in a teleconference client location. Detecting the howl state may involve an analysis of both spectral and temporal characteristics of teleconference audio data. Some disclosed teleconferencing methods may involve determining which client location is causing the howl state. Some such methods may involve mitigating the howl state and/or sending a howl state detection message.

17.

发明授权
Estimation of reverberant energy component from active audio source 有权

公开(公告)号：US10393571B2

公开(公告)日：2019-08-27

申请号：US15580242

申请日：2016-07-06

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Dong Shi , David Gunawan , Glenn N. Dickins , Kai Li

IPC: H04R29/00 , G01H7/00 , G10L25/21 , H04R1/40 , H04R3/00

Abstract: Example embodiments disclosed herein relate to a estimation of reverberant energy components from audio sources. A method of estimating a reverberant energy component from an active audio source (100) is disclosed. The method comprises determining a correspondence between the active audio source and a plurality of sample sources by comparing one or more spatial features of the active audio source with one or more spatial features of the plurality of sample sources, each of the sample sources being associated with an adaptive filtering model (101); obtaining an adaptive filtering model for the active audio source based on the determined correspondence (102); and estimating the reverberant energy component from the active audio source over time based on the adaptive filtering model (103). Corresponding system (800) and computer program product (900) are also disclosed.

18.

发明申请
Adaptive Forward Error Correction Redundant Payload Generation 审中-公开

公开(公告)号：US20170103761A1

公开(公告)日：2017-04-13

申请号：US15287953

申请日：2016-10-07

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Xuejing Sun , Kai Li , Mark S. Vinton , Shen Huang

IPC: G10L19/005 , G10L19/02 , G10L19/028

CPC classification number: G10L19/005 , G10L19/0204 , G10L19/028

Abstract: A method of encoding audio information for forward error correction reconstruction of a transmitted audio stream over a lossy packet switched network, the method including the steps of: (a) dividing the audio stream into audio frames; (b) determining a series of corresponding audio frequency bands for the audio frames; (c) determining a series of power envelopes for the frequency bands; (d) encoding the envelopes as a low bit rate version of the audio frame in a redundant transmission frame.

19.

发明授权
Near-end indication that the end of speech is received by the far end in an audio or video conference 有权
Title translation: 在音频或视频会议中远端接收到语音结束的近端指示

公开(公告)号：US09525845B2

公开(公告)日：2016-12-20

申请号：US14426134

申请日：2013-09-27

Applicant: DOLBY LABORATORIES LICENSING CORPORATION , DOLBY INTERNATIONAL AB

Inventor： Dong Shi , Xuejing Sun , Kai Li , Shen Huang , Harald Mundt , Heiko Purnhagen , Glenn Dickins

IPC: H04N7/14 , H04N7/15

CPC classification number: H04N7/147 , H04M3/569 , H04M9/082 , H04M2201/12 , H04M2201/14 , H04M2201/38 , H04M2203/258 , H04M2203/352 , H04N7/15

Abstract: Embodiments of client device and method for audio or video conferencing are described. An embodiment includes an offset detecting unit, a configuring unit, an estimator and an output unit. The offset detecting unit detects an offset of speech input to the client device. The configuring unit determines a voice latency from the client device to every far end. The estimator estimates a time when a user at the far end perceives the offset based on the voice latency. The output unit outputs a perceivable signal indicating that a user at the far end perceives the offset based on the time estimated for the far end. The perceivable signal is helpful to avoid collision between parties.

Abstract translation: 描述用于音频或视频会议的客户端设备和方法的实施例。实施例包括偏移检测单元，配置单元，估计器和输出单元。偏移检测单元检测输入到客户端设备的语音偏移。配置单元确定从客户端设备到每个远端的语音延迟。估计器估计在远端的用户基于语音延迟感知到偏移的时间。输出单元输出可感知的信号，指示远端的用户基于为远端估计的时间感知偏移。可感知的信号有助于避免各方之间的冲突。

20.

发明授权
Channel identification of multi-channel audio signals 有权

公开(公告)号：US12165657B2

公开(公告)日：2024-12-10

申请号：US17639286

申请日：2020-08-27

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Yanmeng Guo , Kai Li

IPC: G10L19/008 , H04R3/12 , H04S3/00 , H04S7/00

Abstract: A method for channel identification of a multi-channel audio signal comprising X>1 channels is provided. The method comprises the steps of: identifying, among the X channels, any empty channels, thus resulting in a subset of Y≤X non-empty channels; determining whether a low frequency effect (LFE) channel is present among the Y channels, and upon determining that an LFE channel is present, identifying the determined channel among the Y channels as the LFE channel; dividing the remaining channels among the Y channels not being identified as the LFE channel into any number of pairs of channels by matching symmetrical channels; and identifying any remaining unpaired channel among the Y channels not being identified as the LFE channel or divided into pairs as a center channel.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification