-
公开(公告)号:US11523245B2
公开(公告)日:2022-12-06
申请号:US17248857
申请日:2021-02-10
摘要: Some implementations may involve receiving, via an interface system, personnel location data indicating a location of at least one person and receiving, from an orientation system, headset orientation data corresponding with the orientation of a headset. First environmental element location data, indicating a location of at least a first environmental element, may be determined. Based at least in part on the headset orientation data, the personnel location data and the first environmental element location data, headset coordinate locations of at least one person and at least the first environmental element in a headset coordinate system corresponding with the orientation of the headset may be determined. An apparatus may be caused to provide spatialization indications of the headset coordinate locations. Providing the spatialization indications may involve controlling a speaker system to provide environmental element sonification corresponding with at least the first environmental element location data.
-
公开(公告)号:US11778403B2
公开(公告)日:2023-10-03
申请号:US17263125
申请日:2019-07-25
发明人: McGregor Steele Joyner , Alex Brandmeyer , Scott Daly , Jeffrey Ross Baker , Andrea Fanelli , Poppy Anne Carrie Crum
CPC分类号: H04S7/301 , G06F18/214 , G06T7/11 , G06T7/70 , G06V40/10 , H04S1/002 , H04S7/303 , G06T2207/20081 , G06T2207/20084 , G06T2207/20132 , G06T2207/30196 , H04S2400/15 , H04S2420/01
摘要: An apparatus and method of generating personalized HRTFs. The system is prepared by calculating a model for HRTFs described as the relationship between a finite example set of input data, namely anthropometric measures and demographic information for a set of individuals, and a corresponding set of output data, namely HRTFs numerically simulated using a high-resolution database of 3D scans of the same set of individuals. At the time of use, the system queries the user for their demographic information, and then from a series of images of the user, the system detects and measures various anthropometric characteristics. The system then applies the prepared model to the anthropometric and demographic data as part of generating a personalized HRTF. In this manner, the personalized HRTF can be generated with more convenience than by performing a high-resolution scan or an acoustic measurement of the user, and with less computational complexity than by numerically simulating their HRTF.
-
公开(公告)号:US11477525B2
公开(公告)日:2022-10-18
申请号:US17281946
申请日:2019-09-30
IPC分类号: H04N21/442 , H04N21/422 , H04N21/4223 , H04N21/84 , H04N21/41
摘要: Creative intent input describing emotion expectations and narrative information relating to media content is received. Expected physiologically observable states relating to the media content are generated based on the creative intent input. An audiovisual content signal with the media content and media metadata comprising the physiologically observable states is provided to a playback apparatus. The audiovisual content signal causes the playback device to use physiological monitoring signals to determine, with respect to a viewer, assessed physiologically observable states relating to the media content and generate, based on the expected physiologically observable states and the assessed physiologically observable states, modified media content to be rendered to the viewer.
-
公开(公告)号:US20240160849A1
公开(公告)日:2024-05-16
申请号:US18550429
申请日:2022-04-27
发明人: Andrea FANELLI , Mingqing YUN , Satej Suresh PANKEY , Nicholas Laurence ENGEL , Poppy Anne Carrie Crum
IPC分类号: G06F40/30
CPC分类号: G06F40/30
摘要: Embodiments are disclosed for speaker diarization supporting episodical content. In an embodiment, a method comprises: receiving media data including one or more utterances; dividing the media data into a plurality of blocks; identifying segments of each block of the plurality of blocks associated with a single speaker; extracting embeddings for the identified segments in accordance with a machine learning model, wherein extracting embeddings for identified segments further comprises statistically combining extracted embeddings for identified segments that correspond to a respective continuous utterance associated with a single speaker; clustering the embeddings for the identified segments into clusters; and assigning a speaker label to each of the embeddings for the identified segments in accordance with a result of the clustering. In some embodiments, a voiceprint is used to identify a speaker and the speaker identity for a speaker label.
-
公开(公告)号:US20230007343A1
公开(公告)日:2023-01-05
申请号:US17930357
申请日:2022-09-07
IPC分类号: H04N21/442 , H04N21/422 , H04N21/4223 , H04N21/84 , H04N21/41
摘要: Creative intent input describing emotion expectations and narrative information relating to media content is received. Expected physiologically observable states relating to the media content are generated based on the creative intent input. An audiovisual content signal with the media content and media metadata comprising the physiologically observable states is provided to a playback apparatus. The audiovisual content signal causes the playback device to use physiological monitoring signals to determine, with respect to a viewer, assessed physiologically observable states relating to the media content and generate, based on the expected physiologically observable states and the assessed physiologically observable states, modified media content to be rendered to the viewer.
-
公开(公告)号:US20220345845A1
公开(公告)日:2022-10-27
申请号:US17763124
申请日:2020-09-22
发明人: Nicolas R. Tsingos , Satej Suresh Pankey , Vimal Puthanveed , Poppy Anne Carrie Crum , Jeffrey Ross Baker , Ian Eric Esten , Scott Daly , Daniel Paul Darcy
IPC分类号: H04S7/00
摘要: Embodiments are disclosed for hybrid near/far-field speaker virtualization. In an embodiment, a method comprises: receiving a source signal including channel-based audio or audio objects; generating near-field gain(s) and far-field gain(s) based on the source signal and a blending mode; generating a far-field signal based, at least in part, on the source signal and the far-field gain(s); rendering, using a speaker virtualizer, the far-field signal for playback of far-field acoustic audio through far-field speakers into an audio reproduction environment; generating a near-field signal based at least in part on the source signal and the near-field gain(s); prior to providing the far-field signal to the far-field speakers, sending the near-field signal to a near-field playback device or an intermediate device coupled to the near-field playback device; providing the far-field signal to the far-field speakers; and providing the near-field signal to the near-field speakers to synchronously overlay the far-field acoustic audio.
-
公开(公告)号:US20210195362A1
公开(公告)日:2021-06-24
申请号:US17248857
申请日:2021-02-10
摘要: Some implementations may involve receiving, via an interface system, personnel location data indicating a location of at least one person and receiving, from an orientation system, headset orientation data corresponding with the orientation of a headset. First environmental element location data, indicating a location of at least a first environmental element, may be determined. Based at least in part on the headset orientation data, the personnel location data and the first environmental element location data, headset coordinate locations of at least one person and at least the first environmental element in a headset coordinate system corresponding with the orientation of the headset may be determined. An apparatus may be caused to provide spatialization indications of the headset coordinate locations. Providing the spatialization indications may involve controlling a speaker system to provide environmental element sonification corresponding with at least the first environmental element location data.
-
公开(公告)号:US20200045492A1
公开(公告)日:2020-02-06
申请号:US16539929
申请日:2019-08-13
摘要: Some implementations may involve receiving, via an interface system, personnel location data indicating a location of at least one person and receiving, from an orientation system, headset orientation data corresponding with the orientation of a headset. First environmental element location data, indicating a location of at least a first environmental element, may be determined. Based at least in part on the headset orientation data, the personnel location data and the first environmental element location data, headset coordinate locations of at least one person and at least the first environmental element in a headset coordinate system corresponding with the orientation of the headset may be determined. An apparatus may be caused to provide spatialization indications of the headset coordinate locations. Providing the spatialization indications may involve controlling a speaker system to provide environmental element sonification corresponding with at least the first environmental element location data.
-
公开(公告)号:US12096200B2
公开(公告)日:2024-09-17
申请号:US18455565
申请日:2023-08-24
发明人: Mcgregor Steele Joyner , Alex Brandmeyer , Scott Daly , Jeffrey Ross Baker , Andrea Fanelli , Poppy Anne Carrie Crum
CPC分类号: H04S7/301 , G06F18/214 , G06T7/11 , G06T7/70 , G06V40/10 , H04S1/002 , H04S7/303 , G06T2207/20081 , G06T2207/20084 , G06T2207/20132 , G06T2207/30196 , H04S2400/15 , H04S2420/01
摘要: An apparatus and method of generating personalized HRTFs. The system is prepared by calculating a model for HRTFs described as the relationship between a finite example set of input data, namely anthropometric measures and demographic information for a set of individuals, and a corresponding set of output data, namely HRTFs numerically simulated using a high-resolution database of 3D scans of the same set of individuals. At the time of use, the system queries the user for their demographic information, and then from a series of images of the user, the system detects and measures various anthropometric characteristics. The system then applies the prepared model to the anthropometric and demographic data as part of generating a personalized HRTF. In this manner, the personalized HRTF can be generated with more convenience than by performing a high-resolution scan or an acoustic measurement of the user, and with less computational complexity than by numerically simulating their HRTF.
-
公开(公告)号:US11678014B2
公开(公告)日:2023-06-13
申请号:US17930357
申请日:2022-09-07
IPC分类号: H04N21/442 , H04N21/422 , H04N21/4223 , H04N21/84 , H04N21/41
CPC分类号: H04N21/44218 , H04N21/4126 , H04N21/4223 , H04N21/42201 , H04N21/84
摘要: Creative intent input describing emotion expectations and narrative information relating to media content is received. Expected physiologically observable states relating to the media content are generated based on the creative intent input. An audiovisual content signal with the media content and media metadata comprising the physiologically observable states is provided to a playback apparatus. The audiovisual content signal causes the playback device to use physiological monitoring signals to determine, with respect to a viewer, assessed physiologically observable states relating to the media content and generate, based on the expected physiologically observable states and the assessed physiologically observable states, modified media content to be rendered to the viewer.
-
-
-
-
-
-
-
-
-