-
11.
公开(公告)号:US20160065967A1
公开(公告)日:2016-03-03
申请号:US14834624
申请日:2015-08-25
Applicant: Google Inc.
Inventor: Ermin Kozica , Dave Zachariah , Willem Bastiaan Kleijn
IPC: H04N19/136 , H04N19/172 , H04N19/44
CPC classification number: H04N19/136 , H04N19/12 , H04N19/172 , H04N19/44 , H04N19/46 , H04N19/61 , H04N21/23439 , H04N21/26275 , H04N21/4305
Abstract: Implementations of independent temporally concurrent video stream coding may include encoding a plurality of input frames from an input video sequence, wherein the plurality of input frames includes a first input frame. Encoding the plurality of input frames may include generating a first plurality of encoded frames based on the plurality of input frames such that the first plurality of encoded frames includes a first encoded I-frame corresponding to the first input frame, and generating a second plurality of encoded frames based on the plurality of input frames such that the second plurality of encoded frames includes a first encoded P-frame corresponding to the first input frame. Implementations of independent temporally concurrent video stream coding may include including the first plurality of encoded frames and the second plurality of encoded frames in an output, and transmitting the output to a decoder.
Abstract translation: 独立时间并发视频流编码的实现可以包括从输入视频序列对多个输入帧进行编码,其中多个输入帧包括第一输入帧。 编码多个输入帧可以包括基于多个输入帧生成第一多个编码帧,使得第一多个编码帧包括与第一输入帧相对应的第一编码I帧,以及生成第二多个 基于所述多个输入帧的编码帧,使得所述第二多个编码帧包括与所述第一输入帧对应的第一编码P帧。 独立时间并发视频流编码的实现可以包括在输出中包括第一多个编码帧和第二多个编码帧,并将该输出发送到解码器。
-
公开(公告)号:US10015618B1
公开(公告)日:2018-07-03
申请号:US15666220
申请日:2017-08-01
Applicant: Google Inc.
Inventor: Willem Bastiaan Kleijn , Andrew Allen , Jan Skoglund , Sze Chie Lim
CPC classification number: H04S7/303 , H04S3/008 , H04S3/02 , H04S2400/01 , H04S2400/11 , H04S2420/11
Abstract: Techniques of rendering sound for a listener involve producing, as the amplitude of each of the source driving signals, a sum of two terms: a first term based on a solution s† to the equation b=A·s, and a second term based on a projection of a specified vector ŝ onto the nullspace of A, ŝ not being a solution to the equation b=A·s. Along these lines, in one example, the first term is equivalent to a Moore-Penrose pseudoinverse, e.g., AH(AAH)−1·b. In general, any solution to the equation b=A·s is satisfactory. The specified vector that is projected onto the nullspace of A is defined to reduce the coherence of the net sound field. Advantageously, the resulting operator is both linear time-invariant and idempotent so that the sound field may be faithfully reproduce both inside the RSF and at a sufficient range outside the RSF to cover a human head.
-
公开(公告)号:US10014961B2
公开(公告)日:2018-07-03
申请号:US14249870
申请日:2014-04-10
Applicant: GOOGLE INC.
Inventor: Willem Bastiaan Kleijn , Richard C. Hendriks
IPC: H04B15/00 , G10L21/0364
CPC classification number: H04B15/00 , G10L21/0364
Abstract: Provided are methods and systems for improving the intelligibility of speech in a noisy environment. A communication model is developed that includes noise inherent in the message production and message interpretation processes, and considers that these noises have fixed signal-to-noise ratios. The communication model forms the basis of an algorithm designed to optimize the intelligibility of speech in a noisy environment. The intelligibility optimization algorithm only does something (e.g., manipulates the audio signal) when needed, and thus if no noise is present the algorithm does not alter or otherwise interfere with the audio signals, thereby preventing any speech distortion. The algorithm is also very fast and efficient in comparison to most existing approaches for speech intelligibility enhancement, and therefore the algorithm lends itself to easy implementation in an appropriate device (e.g., cellular phone or smartphone).
-
公开(公告)号:US09721582B1
公开(公告)日:2017-08-01
申请号:US15014481
申请日:2016-02-03
Applicant: Google Inc.
Inventor: Yiteng Huang , Alejandro Luebs , Jan Skoglund , Willem Bastiaan Kleijn
IPC: H04R3/00 , G10L21/0216 , G10L21/0264 , G10L21/02 , G10L25/21 , G10L21/0308
CPC classification number: G10L21/0264 , G10L21/0205 , G10L21/0216 , G10L21/0308 , G10L21/0364 , G10L25/21 , G10L2021/02166 , H04R3/005
Abstract: Existing post-filtering methods for microphone array speech enhancement have two common deficiencies. First, they assume that noise is either white or diffuse and cannot deal with point interferers. Second, they estimate the post-filter coefficients using only two microphones at a time, performing averaging over all the microphones pairs, yielding a suboptimal solution. The provided method describes a post-filtering solution that implements signal models which handle white noise, diffuse noise, and point interferers. The method also implements a globally optimized least-squares approach of microphones in a microphone array, providing a more optimal solution than existing conventional methods. Experimental results demonstrate the described method outperforming conventional methods in various acoustic scenarios.
-
公开(公告)号:US09716901B2
公开(公告)日:2017-07-25
申请号:US13856233
申请日:2013-04-03
Applicant: GOOGLE INC.
Inventor: Willem Bastiaan Kleijn
IPC: G10L19/26 , H04N19/85 , H04N19/176 , H04N19/117 , H04N19/154 , H04N19/147 , H04N19/124
CPC classification number: H04N19/85 , G10L19/26 , H04N19/117 , H04N19/124 , H04N19/147 , H04N19/154 , H04N19/176
Abstract: Methods and systems are provided for separating signal-correlated and signal-uncorrelated error components in quantization noise. Such separation leads to a generalization of the conventional rate-distortion optimization problem. For the commonly used assumption of a Gaussian process, a quantizer according to this principle is implemented in a straightforward manner using a dithered quantizer and appropriate pre-filters and post-filters. If the penalization of the signal-uncorrelated error component is increased over that of the signal-correlated error component, then the pre-filter emphasizes the signal spectrum more, reducing the differential entropy rate of the pre-filtered signal. Accordingly, the signal-uncorrelated noise is reduced for a given rate.
-
公开(公告)号:US09584909B2
公开(公告)日:2017-02-28
申请号:US13867814
申请日:2013-04-22
Applicant: Google Inc.
Inventor: Richard Heusdens , Guoqiang Zhang , Richard Hendriks , Yuan Zeng , Willem Bastiaan Kleijn
IPC: H01Q3/00 , H04R3/00 , G10L21/0364 , G10L21/0216
CPC classification number: H04R3/005 , G10L21/0364 , G10L2021/02166 , H04R2420/07
Abstract: Methods and systems are provided for implementing a distributed algorithm for beam-forming (e.g., MVDR beam-forming) using a message-passing algorithm. The message-passing algorithm provides for computations to be performed in a distributed manner across a network, rather than in a centralized processing center or “fusion center”. The message-passing algorithm may also function for any network topology, and may continue operations when various changes are made in the network (e.g., nodes appearing, nodes disappearing, etc.). Additionally, the message-passing algorithm may minimize the transmission power per iteration and, depending on the particular network, also may minimize the transmission power required for communication between network nodes.
Abstract translation: 提供了使用消息传递算法实现用于波束形成(例如,MVDR波束形成)的分布式算法的方法和系统。 消息传递算法提供了以分布式方式跨网络而不是集中处理中心或“融合中心”执行的计算。 消息传递算法还可以用于任何网络拓扑,并且可以在网络中进行各种改变时(例如,出现节点,节点消失等)来继续操作。 此外,消息传递算法可以使每次迭代的传输功率最小化,并且根据特定网络还可以最小化网络节点之间的通信所需的传输功率。
-
-
-
-
-