Patent search ap:("Dolby Laboratories Licensing Corporation") AND inv:"Roy M. FEJGIN" Page 1

1.

发明申请
AUDIO DISCONTINUITY DETECTION AND CORRECTION 审中-公开

公开(公告)号：US20180218749A1

公开(公告)日：2018-08-02

申请号：US15745824

申请日：2016-07-26

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Roy M. FEJGIN , Freddie SANCHEZ , Vinay MELKOTE , Michael WARD

IPC: G10L25/48 , G10L25/18 , G11B27/034 , G11B27/10 , G11B27/038

CPC classification number: G10L25/48 , G10L25/18 , G11B27/034 , G11B27/038 , G11B27/105 , G11B2220/2541

Abstract: Methods for detecting whether a rendered version of a specified seamless connection (“SSC”) at a connection point between two audio segment sequences results in an audible discontinuity, and methods for analyzing at least one SSC between audio segment sequences to determine whether a renderable version of each SSC would have an audible discontinuity at the connection point when rendered, and in appropriate cases, for a SSC having a renderable version which is determined to have an audible discontinuity when rendered, correcting at least one audio segment of at least one segment sequence to be connected in accordance with the SSC in an effort to ensure that rendering of the SSC will result in seamless connection without an audible discontinuity. Other aspects are editing systems configured to implement any of the methods, and storage media and rendering systems which store audio data generated in accordance with any of the methods.

2.

发明申请
Method and System for Inter-Channel Coding 审中-公开

公开(公告)号：US20190103119A1

公开(公告)日：2019-04-04

申请号：US16150112

申请日：2018-10-02

Applicant: Dolby Laboratories Licensing Corporation , DOLBY INTERNATIONAL AB

Inventor： Janusz KLEJSA, SR. , Roy M. FEJGIN , Mark S. VINTON

IPC: G10L19/008 , G10L19/00

CPC classification number: G10L19/008 , G10L19/0017 , G10L19/22

Abstract: A method for performing inter-channel encoding of a multi-channel audio signal comprising channel signals for N channels, with N being an integer, with N>1, is described. The method comprises determining a basic graph comprising the N channels as nodes and comprising directed edges between at least some of the N channels. Furthermore, the method comprises determining an inter-channel coding graph from the basic graph, such that the inter-channel coding graph is a directed acyclic graph, and such that a cumulated a cumulated cost of the signals of the nodes of the inter-channel coding graph is reduced.

3.

发明公开
PERCEPTUALLY-BASED LOSS FUNCTIONS FOR AUDIO ENCODING AND DECODING BASED ON MACHINE LEARNING 审中-公开

公开(公告)号：US20240079019A1

公开(公告)日：2024-03-07

申请号：US18507824

申请日：2023-11-13

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Roy M. FEJGIN , Grant A. DAVIDSON , Chih-Wei WU , Vivek KUMAR

IPC: G10L19/022 , G06F3/16 , G06N3/048 , G06N3/084

CPC classification number: G10L19/022 , G06F3/16 , G06N3/048 , G06N3/084

Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.

4.

发明公开
METHOD AND APPARATUS FOR PROCESSING OF AUDIO USING A NEURAL NETWORK 审中-公开

公开(公告)号：US20230395086A1

公开(公告)日：2023-12-07

申请号：US18031790

申请日：2021-10-14

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Mark S. VINTON , Cong ZHOU , Roy M. FEJGIN , Grant A. DAVIDSON

IPC: G10L19/032 , G10L19/06

CPC classification number: G10L19/032 , G10L19/06

Abstract: Described herein is a method of processing an audio signal using a neural network or using a first and a second neural network. Described is further a method of training said neural network or of jointly training a set of said first and said second neural network. Moreover, described is a method of obtaining and transmitting a latent feature space representation of a perceptual domain audio signal using a neural network and a method of obtaining an audio signal from a latent feature space representation of a perceptual domain audio signal using a neural network. Described are also respective apparatuses and computer program products.

5.

发明公开
DEEP-LEARNING BASED SPEECH ENHANCEMENT 审中-公开

公开(公告)号：US20230368807A1

公开(公告)日：2023-11-16

申请号：US18250393

申请日：2021-10-29

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Xiaoyu LIU , Michael Getty HORGAN , Roy M. FEJGIN , Paul HOLMBERG

IPC: G10L21/0232 , G10L19/022

CPC classification number: G10L21/0232 , G10L19/022

Abstract: A system for suppressing noise and enhancing speech and a related method are disclosed. The system trains a neural network model that takes banded energies corresponding to an original noisy waveform and produces a speech value indicating the amount of speech present in each band at each frame. The neural model comprises a feature extraction block that implements some lookahead. The feature extraction block is followed by an encoder with steady down-sampling along the frequency domain forming a contracting path. The encoder is followed by a corresponding decoder with steady up-sampling along the frequency domain forming an expanding path. The decoder receives scaled output feature maps from the encoder at a corresponding level. The decoder is followed by a classification block that generates a speech value indicating an amount of speech present for each frequency band of the plurality of frequency bands at each frame of the plurality of frames.

6.

发明申请
PERCEPTUALLY-BASED LOSS FUNCTIONS FOR AUDIO ENCODING AND DECODING BASED ON MACHINE LEARNING 有权

公开(公告)号：US20210082444A1

公开(公告)日：2021-03-18

申请号：US17046284

申请日：2019-04-10

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Roy M. FEJGIN , Grant A. DAVIDSON , Chih-Wei WU , Vivek KUMAR

IPC: G10L19/022 , G06F3/16 , G06N3/08 , G06N3/04

Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.

7.

发明申请
Audio Segmentation Based on Spatial Metadata 审中-公开
Title translation: 基于空间元数据的音频分割

公开(公告)号：US20170047071A1

公开(公告)日：2017-02-16

申请号：US15306051

申请日：2015-04-23

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Vinay MELKOTE , Malcolm James LAW , Roy M. FEJGIN

IPC: G10L19/00 , G10L19/20 , G10L19/008

CPC classification number: G10L19/0017 , G10L19/008 , G10L19/167 , G10L19/20 , H04S2400/11

Abstract: A method of encoding adaptive audio, comprising receiving N objects and associated spatial metadata that describes the continuing motion of these objects, and partitioning the audio into segments based on the spatial metadata. The method encodes adaptive audio having objects and channel beds by capturing a continuing motion of a number N objects in a time-varying matrix trajectory comprising a sequence of matrices, coding coefficients of the time-varying matrix trajectory in spatial metadata to be transmitted via a high-definition audio format for rendering the adaptive audio through a number M output channels, and segmenting the sequence of matrices into a plurality of sub-segments based on the spatial metadata, wherein the plurality of sub segments are configured to facilitate coding of one or more characteristics of the adaptive audio.

Abstract translation: 一种编码自适应音频的方法，包括接收N个对象和描述这些对象的持续运动的相关联的空间元数据，以及基于空间元数据将音频分割成段。该方法通过捕获包括矩阵序列的时变矩阵轨迹中的N个对象的持续运动来编码具有对象和信道床的自适应音频，将空间元数据中的时变矩阵轨迹的编码系数经由高清晰度音频格式，用于通过M个输出通道渲染自适应音频，以及基于空间元数据将矩阵序列分割成多个子段，其中多个子段被配置为便于编码一个或多个更多特点的自适应音频。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification