-
公开(公告)号:US20240365078A1
公开(公告)日:2024-10-31
申请号:US18764296
申请日:2024-07-04
申请人: FUJIFILM CORPORATION
IPC分类号: H04S7/00 , H04N13/117 , H04N13/282 , H04N13/332 , H04N13/383 , H04R1/40 , H04R3/00
CPC分类号: H04S7/30 , H04N13/117 , H04N13/282 , H04N13/332 , H04N13/383 , H04R1/406 , H04R3/005 , H04R2201/401 , H04S2400/15
摘要: An information processing apparatus acquires a plurality of pieces of sound information, sound collection device position information, and target subject position information. In addition, the information processing apparatus specifies a target sound of a region corresponding to a position of a target subject from the plurality of pieces of sound information based on the acquired sound collection device position information and the acquired target subject position information. Further, the information processing apparatus generates target subject emphasis sound information indicating a sound including a target subject emphasis sound in which the specified target sound is emphasized more than a sound emitted from a region different from the region corresponding to the position of the target subject indicated by the acquired target subject position information in a case in which a virtual viewpoint video is generated.
-
公开(公告)号:US12125468B2
公开(公告)日:2024-10-22
申请号:US17895319
申请日:2022-08-25
发明人: Tomofumi Yamanashi , Yutaka Banba
IPC分类号: G10K11/178 , G10L21/0208 , H04R1/40 , H04R3/00 , H04S7/00
CPC分类号: G10K11/17854 , H04R1/406
摘要: An audio processing system includes at least one first microphone, at least one adaptive filter, and a processor. The at least one first microphone acquires a first audio signal and outputs a first signal based on the first audio signal. The first audio signal includes at least one of a first audio component generated at a first position and a second audio component generated at a second position different from the first position. The first signal is input to the at least one adaptive filter. The at least one adaptive filter outputs a passing signal based on the first signal. The processor, when executing a program stored in a memory, performs: making a determination of which of the first audio component and the second audio component the first audio signal includes more; and controlling a filter coefficient of the adaptive filter based on a result of the determination.
-
公开(公告)号:US20240348976A1
公开(公告)日:2024-10-17
申请号:US18628982
申请日:2024-04-08
发明人: RYOHEI KANDO
CPC分类号: H04R1/406 , G03B29/00 , H04R1/028 , H04R1/2876 , H04R2201/401 , H04R2410/01 , H04R2420/07 , H04R2499/11
摘要: An image pickup apparatus in which three microphone units are arranged with high space efficiency. The image pickup apparatus comprises a lens barrel having an optical axis, a first microphone unit disposed to a left of the optical axis in a left-right direction when viewed from an optical axis direction, a second microphone unit disposed to a right of the optical axis in the left-right direction, and a third microphone unit disposed between the first microphone unit and the second microphone unit in the left-right direction, wherein when viewed from the optical axis direction, a lower end position of the first microphone unit and a lower end position of the second microphone unit are lower than an upper end position of the lens barrel and a lower end position of the third microphone unit is higher than the upper end position of the lens barrel.
-
公开(公告)号:US12120273B2
公开(公告)日:2024-10-15
申请号:US17843296
申请日:2022-06-17
发明人: Peter L. Chu , Jay McArdle
CPC分类号: H04M3/568 , H04R1/025 , H04R1/342 , H04R1/403 , H04R1/406 , H04R3/005 , H04R3/12 , H04R29/005 , H04R2201/021 , H04R2201/401
摘要: A method and apparatus for capturing audio including a ceiling mountable second-order differential microphone module. The module including a solid planar baffle having a generally centered aperture, at least one mounting foot to suspend the solid planar baffle approximately parallel with a rear reflecting plane and to space the solid planar baffle at a predetermined distance below the rear reflecting plane, a differential microphone sealably coupled to the planar baffle with a first side of the differential microphone acoustically exposed to an area above the planar baffle and a second side of the differential microphone acoustically exposed to an area below the planar baffle, and a mounting means for mounting the solid rear reflecting panel to the room ceiling.
-
公开(公告)号:US12119005B2
公开(公告)日:2024-10-15
申请号:US18323496
申请日:2023-05-25
发明人: Yi Gao
IPC分类号: G10L15/22 , G10L17/02 , G10L17/06 , G10L17/20 , G10L17/22 , G10L21/0208 , G10L21/0232 , G10L25/18 , H04R1/40 , H04R3/00 , G10L21/0216
CPC分类号: G10L17/20 , G10L15/22 , G10L17/02 , G10L17/06 , G10L17/22 , G10L21/0232 , G10L25/18 , H04R1/406 , H04R3/005 , G10L2021/02082 , G10L2021/02166
摘要: An audio data processing method is provided. The method includes: obtaining multi-path audio data in an environmental space, obtaining a speech data set based on the multi-path audio data, and separately generating, in a plurality of enhancement directions, enhanced speech information corresponding to the speech data set; matching a speech hidden feature in the enhanced speech information with a target matching word, and determining an enhancement direction corresponding to the enhanced speech information having a highest degree of matching with the target matching word as a target audio direction; obtaining speech spectrum features in the enhanced speech information, and obtaining, from the speech spectrum features, a speech spectrum feature in the target audio direction; and performing speech authentication on the speech hidden feature and the speech spectrum feature that are in the target audio direction based on the target matching word, to obtain a target authentication result.
-
公开(公告)号:US12114118B2
公开(公告)日:2024-10-08
申请号:US17572178
申请日:2022-01-10
CPC分类号: H04R1/083 , H04R1/023 , H04R1/406 , H04R2201/021
摘要: An audio device includes a circular cover comprising a top and a bottom, a circular screen rotationally engaged with the bottom of the cover, and a circular shroud removably engaged with top of the cover. The top of the cover includes a plurality of mounting holes. The mounting holes may include a plurality of holes in a VESA pole mounting pattern. The mounting holes may also include a plurality of cable mounting holes configured in a square pattern having greater spacing than the VESA mounting pattern. The audio device may include a plurality of microphones and/or one or more loudspeakers.
-
公开(公告)号:US12112750B2
公开(公告)日:2024-10-08
申请号:US17630895
申请日:2020-07-28
CPC分类号: G10L15/22 , G06F3/167 , G10L15/08 , G10L21/0264 , H04R1/406 , H04R3/005 , H04S7/303 , G10L2015/223 , H04R2430/21
摘要: A method for estimating a user's location in an environment may involve receiving output signals from each microphone of a plurality of microphones in the environment. At least two microphones of the plurality of microphones may be included in separate devices at separate locations in the environment and the output signals may correspond to a current utterance of a user. The method may involve determining multiple current acoustic features from the output signals of each microphone and applying a classifier to the multiple current acoustic features. Applying the classifier may involve applying a model trained on previously-determined acoustic features derived from a plurality of previous utterances made by the user in a plurality of user zones in the environment. The method may involve determining, based at least in part on output from the classifier, an estimate of the user zone in which the user is currently located.
-
8.
公开(公告)号:US12111409B2
公开(公告)日:2024-10-08
申请号:US17635304
申请日:2019-08-28
发明人: Makoto Koizumi
CPC分类号: G01S5/20 , G01S5/0294 , G06T7/20 , H04R1/406 , H04R3/005
摘要: An image processing apparatus includes a first reception section, a second reception section, an association processing section, an object detection section, and a process execution section. The first reception section receives image information acquired by an image sensor. The second reception section receives sound information that is acquired by one or plural directional microphones and that is generated for at least a partial region in a field of the image sensor. The association processing section associates the sound information with a pixel address of the image information indicating a position in the field. The object detection section detects, from the image information, at least a part of an object that is present in the field. The process execution section executes a predetermined process on the object on the basis of a result of the association performed by the association processing section.
-
公开(公告)号:US20240323630A1
公开(公告)日:2024-09-26
申请号:US18676347
申请日:2024-05-28
IPC分类号: H04S7/00 , G06T7/70 , G10L15/06 , G10L15/22 , G10L19/00 , G10L19/008 , G10L19/16 , G10L21/0208 , G10L21/0216 , H04R1/40 , H04R3/00 , H04R5/027 , H04S3/00
CPC分类号: H04S7/30 , G06T7/70 , G10L15/063 , G10L15/22 , G10L19/008 , G10L19/167 , G10L21/0208 , H04R1/406 , H04R3/005 , H04R5/027 , H04S3/008 , G10L2019/0001 , G10L2019/0002 , G10L2021/02166 , H04R2201/401 , H04S2400/01 , H04S2400/15
摘要: A method, computer program product, and computing system for encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information. Location information may be estimated, via a machine vision system, for an acoustic source within an acoustic environment. One or more acoustic relative transfer functions may be selected from a plurality of acoustic relative transfer functions for the plurality of audio acquisition devices of the audio recording system based upon, at least in part, the location information. The encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer function may be transmitted.
-
公开(公告)号:US12101599B1
公开(公告)日:2024-09-24
申请号:US17952806
申请日:2022-09-26
发明人: Mohamed Mansour
摘要: Disclosed are techniques for an improved method for performing sound source localization (SSL) to determine a direction of arrival of an audible sound using a combination of timing information and amplitude information. For example, a device may decompose an observed sound field into directional components, then estimate a time-delay likelihood value and an energy-based likelihood value for each of the directional components. Using a combination of these likelihood values, the device can determine the direction of arrival corresponding to a maximum likelihood value. In some examples, the device may perform Acoustic Wave Decomposition processing to determine the directional components. In order to reduce a processing consumption associated with performing AWD processing, the device splits this process into two phases: a search phase that selects a subset of a device dictionary to reduce a complexity, and a decomposition phase that solves an optimization problem using the subset of the device dictionary.
-
-
-
-
-
-
-
-
-