专利检索 cpc:"H04R1/406" 第 1 页

1.

发明公开
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM 审中-公开

公开(公告)号：US20240365078A1

公开(公告)日：2024-10-31

申请号：US18764296

申请日：2024-07-04

申请人： FUJIFILM CORPORATION

发明人： Takashi AOKI , Fuminori IRIE , Kazunori TAMURA , Masahiko MIYATA , Yasunori MURAKAMI

IPC分类号： H04S7/00 , H04N13/117 , H04N13/282 , H04N13/332 , H04N13/383 , H04R1/40 , H04R3/00

CPC分类号： H04S7/30 , H04N13/117 , H04N13/282 , H04N13/332 , H04N13/383 , H04R1/406 , H04R3/005 , H04R2201/401 , H04S2400/15

摘要： An information processing apparatus acquires a plurality of pieces of sound information, sound collection device position information, and target subject position information. In addition, the information processing apparatus specifies a target sound of a region corresponding to a position of a target subject from the plurality of pieces of sound information based on the acquired sound collection device position information and the acquired target subject position information. Further, the information processing apparatus generates target subject emphasis sound information indicating a sound including a target subject emphasis sound in which the specified target sound is emphasized more than a sound emitted from a region different from the region corresponding to the position of the target subject indicated by the acquired target subject position information in a case in which a virtual viewpoint video is generated.

2.

发明授权
Audio processing system, audio processing device, and audio processing method 有权

公开(公告)号：US12125468B2

公开(公告)日：2024-10-22

申请号：US17895319

申请日：2022-08-25

申请人： Panasonic Intellectual Property Management Co., Ltd.

发明人： Tomofumi Yamanashi , Yutaka Banba

IPC分类号： G10K11/178 , G10L21/0208 , H04R1/40 , H04R3/00 , H04S7/00

CPC分类号： G10K11/17854 , H04R1/406

摘要： An audio processing system includes at least one first microphone, at least one adaptive filter, and a processor. The at least one first microphone acquires a first audio signal and outputs a first signal based on the first audio signal. The first audio signal includes at least one of a first audio component generated at a first position and a second audio component generated at a second position different from the first position. The first signal is input to the at least one adaptive filter. The at least one adaptive filter outputs a passing signal based on the first signal. The processor, when executing a program stored in a memory, performs: making a determination of which of the first audio component and the second audio component the first audio signal includes more; and controlling a filter coefficient of the adaptive filter based on a result of the determination.

3.

发明公开
IMAGE PICKUP APPARATUS HAVING MICROPHONES 审中-公开

公开(公告)号：US20240348976A1

公开(公告)日：2024-10-17

申请号：US18628982

申请日：2024-04-08

申请人： CANON KABUSHIKI KAISHA

发明人： RYOHEI KANDO

IPC分类号： H04R1/40 , G03B29/00 , H04R1/02 , H04R1/28

CPC分类号： H04R1/406 , G03B29/00 , H04R1/028 , H04R1/2876 , H04R2201/401 , H04R2410/01 , H04R2420/07 , H04R2499/11

摘要： An image pickup apparatus in which three microphone units are arranged with high space efficiency. The image pickup apparatus comprises a lens barrel having an optical axis, a first microphone unit disposed to a left of the optical axis in a left-right direction when viewed from an optical axis direction, a second microphone unit disposed to a right of the optical axis in the left-right direction, and a third microphone unit disposed between the first microphone unit and the second microphone unit in the left-right direction, wherein when viewed from the optical axis direction, a lower end position of the first microphone unit and a lower end position of the second microphone unit are lower than an upper end position of the lens barrel and a lower end position of the third microphone unit is higher than the upper end position of the lens barrel.

4.

发明授权
Distributed network of ceiling image-derived directional microphones 有权

公开(公告)号：US12120273B2

公开(公告)日：2024-10-15

申请号：US17843296

申请日：2022-06-17

申请人： Hewlett-Packard Development Company, L.P.

发明人： Peter L. Chu , Jay McArdle

IPC分类号： H04M3/56 , H04R1/02 , H04R1/34 , H04R1/40 , H04R3/00 , H04R3/12 , H04R29/00

CPC分类号： H04M3/568 , H04R1/025 , H04R1/342 , H04R1/403 , H04R1/406 , H04R3/005 , H04R3/12 , H04R29/005 , H04R2201/021 , H04R2201/401

摘要： A method and apparatus for capturing audio including a ceiling mountable second-order differential microphone module. The module including a solid planar baffle having a generally centered aperture, at least one mounting foot to suspend the solid planar baffle approximately parallel with a rear reflecting plane and to space the solid planar baffle at a predetermined distance below the rear reflecting plane, a differential microphone sealably coupled to the planar baffle with a first side of the differential microphone acoustically exposed to an area above the planar baffle and a second side of the differential microphone acoustically exposed to an area below the planar baffle, and a mounting means for mounting the solid rear reflecting panel to the room ceiling.

5.

发明授权
Audio data processing method for wake-up speech detection, apparatus, and storage medium 有权

公开(公告)号：US12119005B2

公开(公告)日：2024-10-15

申请号：US18323496

申请日：2023-05-25

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Yi Gao

IPC分类号： G10L15/22 , G10L17/02 , G10L17/06 , G10L17/20 , G10L17/22 , G10L21/0208 , G10L21/0232 , G10L25/18 , H04R1/40 , H04R3/00 , G10L21/0216

CPC分类号： G10L17/20 , G10L15/22 , G10L17/02 , G10L17/06 , G10L17/22 , G10L21/0232 , G10L25/18 , H04R1/406 , H04R3/005 , G10L2021/02082 , G10L2021/02166

摘要： An audio data processing method is provided. The method includes: obtaining multi-path audio data in an environmental space, obtaining a speech data set based on the multi-path audio data, and separately generating, in a plurality of enhancement directions, enhanced speech information corresponding to the speech data set; matching a speech hidden feature in the enhanced speech information with a target matching word, and determining an enhancement direction corresponding to the enhanced speech information having a highest degree of matching with the target matching word as a target audio direction; obtaining speech spectrum features in the enhanced speech information, and obtaining, from the speech spectrum features, a speech spectrum feature in the target audio direction; and performing speech authentication on the speech hidden feature and the speech spectrum feature that are in the target audio direction based on the target matching word, to obtain a target authentication result.

6.

发明授权
Audio device housing 有权

公开(公告)号：US12114118B2

公开(公告)日：2024-10-08

申请号：US17572178

申请日：2022-01-10

申请人： Shure Acquisition Holdings, Inc.

发明人： Benjamin Neal Huyck , Gregory William Lantz

IPC分类号： H04R1/00 , H04R1/02 , H04R1/08 , H04R1/32 , H04R1/40

CPC分类号： H04R1/083 , H04R1/023 , H04R1/406 , H04R2201/021

摘要： An audio device includes a circular cover comprising a top and a bottom, a circular screen rotationally engaged with the bottom of the cover, and a circular shroud removably engaged with top of the cover. The top of the cover includes a plurality of mounting holes. The mounting holes may include a plurality of holes in a VESA pole mounting pattern. The mounting holes may also include a plurality of cable mounting holes configured in a square pattern having greater spacing than the VESA mounting pattern. The audio device may include a plurality of microphones and/or one or more loudspeakers.

7.

发明授权
Acoustic zoning with distributed microphones 有权

公开(公告)号：US12112750B2

公开(公告)日：2024-10-08

申请号：US17630895

申请日：2020-07-28

申请人： Dolby Laboratories Licensing Corporation

发明人： Mark R. P. Thomas , Richard J. Cartwright

IPC分类号： G10L15/22 , G06F3/16 , G10L15/08 , G10L21/0264 , H04R1/40 , H04R3/00 , H04S7/00

CPC分类号： G10L15/22 , G06F3/167 , G10L15/08 , G10L21/0264 , H04R1/406 , H04R3/005 , H04S7/303 , G10L2015/223 , H04R2430/21

摘要： A method for estimating a user's location in an environment may involve receiving output signals from each microphone of a plurality of microphones in the environment. At least two microphones of the plurality of microphones may be included in separate devices at separate locations in the environment and the output signals may correspond to a current utterance of a user. The method may involve determining multiple current acoustic features from the output signals of each microphone and applying a classifier to the multiple current acoustic features. Applying the classifier may involve applying a model trained on previously-determined acoustic features derived from a plurality of previous utterances made by the user in a plurality of user zones in the environment. The method may involve determining, based at least in part on output from the classifier, an estimate of the user zone in which the user is currently located.

8.

发明授权
Image processing apparatus, system, image processing method, and image processing program 有权

公开(公告)号：US12111409B2

公开(公告)日：2024-10-08

申请号：US17635304

申请日：2019-08-28

申请人： Sony Interactive Entertainment Inc.

发明人： Makoto Koizumi

IPC分类号： G01S5/20 , G01S5/02 , G06T7/20 , H04R1/40 , H04R3/00

CPC分类号： G01S5/20 , G01S5/0294 , G06T7/20 , H04R1/406 , H04R3/005

摘要： An image processing apparatus includes a first reception section, a second reception section, an association processing section, an object detection section, and a process execution section. The first reception section receives image information acquired by an image sensor. The second reception section receives sound information that is acquired by one or plural directional microphones and that is generated for at least a partial region in a field of the image sensor. The association processing section associates the sound information with a pixel address of the image information indicating a position in the field. The object detection section detects, from the image information, at least a part of an object that is present in the field. The process execution section executes a predetermined process on the object on the basis of a result of the association performed by the association processing section.

9.

发明公开
Multi-Channel Speech Compression System and Method 审中-公开

公开(公告)号：US20240323630A1

公开(公告)日：2024-09-26

申请号：US18676347

申请日：2024-05-28

申请人： Microsoft Technology Licensing, LLC

发明人： Dushyant Sharma , Patrick A. Naylor , Uwe Helmut Jost

IPC分类号： H04S7/00 , G06T7/70 , G10L15/06 , G10L15/22 , G10L19/00 , G10L19/008 , G10L19/16 , G10L21/0208 , G10L21/0216 , H04R1/40 , H04R3/00 , H04R5/027 , H04S3/00

CPC分类号： H04S7/30 , G06T7/70 , G10L15/063 , G10L15/22 , G10L19/008 , G10L19/167 , G10L21/0208 , H04R1/406 , H04R3/005 , H04R5/027 , H04S3/008 , G10L2019/0001 , G10L2019/0002 , G10L2021/02166 , H04R2201/401 , H04S2400/01 , H04S2400/15

摘要： A method, computer program product, and computing system for encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information. Location information may be estimated, via a machine vision system, for an acoustic source within an acoustic environment. One or more acoustic relative transfer functions may be selected from a plurality of acoustic relative transfer functions for the plurality of audio acquisition devices of the audio recording system based upon, at least in part, the location information. The encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer function may be transmitted.

10.

发明授权
Sound source localization using acoustic wave decomposition 有权

公开(公告)号：US12101599B1

公开(公告)日：2024-09-24

申请号：US17952806

申请日：2022-09-26

申请人： Amazon Technologies, Inc.

发明人： Mohamed Mansour

IPC分类号： H04R3/00 , H04R1/40

CPC分类号： H04R1/406 , H04R3/005

摘要： Disclosed are techniques for an improved method for performing sound source localization (SSL) to determine a direction of arrival of an audible sound using a combination of timing information and amplitude information. For example, a device may decompose an observed sound field into directional components, then estimate a time-delay likelihood value and an energy-based likelihood value for each of the directional components. Using a combination of these likelihood values, the device can determine the direction of arrival corresponding to a maximum likelihood value. In some examples, the device may perform Acoustic Wave Decomposition processing to determine the directional components. In order to reduce a processing consumption associated with performing AWD processing, the device splits this process into two phases: a search phase that selects a subset of a device dictionary to reduce a complexity, and a decomposition phase that solves an optimization problem using the subset of the device dictionary.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类