-
公开(公告)号:US20230335149A1
公开(公告)日:2023-10-19
申请号:US18210702
申请日:2023-06-16
Inventor: Masanari MIYAMOTO
IPC: G10L21/0232 , G10L15/06 , H04R3/00 , H04R1/40
CPC classification number: G10L21/0232 , G10L15/063 , H04R3/005 , H04R1/406 , G10L2021/02166
Abstract: A speech processing device includes a processor. The processor performs operations including: detecting a single-talk state based on a speech signal collected by each of microphones, the single-talk state in which any one of persons speaks; estimating a mixing rate indicating a ratio of a speech signal of the main speaking person to a speech signal of another person based on a sound pressure ratio of the speech signals collected by the microphones in the single-talk state of the main speaking person and a sound pressure ratio of the speech signals collected by the plurality of microphones in the single-talk state of the another person; and determining whether suppression of a crosstalk component due to speaking of the another person contained in the speech signal of the main speaking person is necessary based on an estimation result of the mixing rate.
-
公开(公告)号:US20210043198A1
公开(公告)日:2021-02-11
申请号:US16979714
申请日:2018-12-11
Inventor: Naoya TANAKA , Tomofumi YAMANASHI , Masanari MIYAMOTO
Abstract: This voice processing device is provided with: an utterer's position detection unit which specifies, as position microphones of an utterer, microphones that receive a voice signal of WuW on the basis of the characteristics of each voice signal for a prescribed time, when the WuW voice is detected, the voice signal being held in a voice signal buffer unit; and a CTC unit (one example of a voice processing unit) which outputs a voice uttered by the utterer and suppress a voice uttered by an occupant, who is not the utterer, by using the voice signal for the prescribed time, which is held in the voice signal buffer unit, and information relating to the utterer's position microphones.
-
公开(公告)号:US20180158446A1
公开(公告)日:2018-06-07
申请号:US15572047
申请日:2016-04-19
Inventor: Masanari MIYAMOTO , Hiroyuki MATSUMOTO , Ryouichi YUGE , Shintaro YOSHIKUNI , Toshimichi TOKUDA , Naoya TANAKA
CPC classification number: G10K11/17827 , G01S3/80 , G01S3/8083 , G10K11/17885 , G10K11/346 , G10K2210/102 , G10K2210/12 , G10K2210/3045 , H04N5/225 , H04N7/183 , H04R1/406 , H04R3/005 , H04R3/12 , H04R27/00 , H04R29/002 , H04R2430/01 , H04R2430/23 , H04R2499/11
Abstract: In a directionality control system, a camera device captures a video of image capture area (SA). A microphone array device collects a sound in image capture area (SA). A signal processing section detects a sound source of the sound in image capture area (SA) which is collected by the microphone array device. In a case where the detected sound source is within a range of privacy area (PRA), an output control section controls the sound in image capture area (SA) which is collected by the microphone array device and is output from speaker device (37).
-
公开(公告)号:US20170280108A1
公开(公告)日:2017-09-28
申请号:US15454722
申请日:2017-03-09
Inventor: Hiroyuki MATSUMOTO , Shintaro YOSHIKUNI , Masanari MIYAMOTO
CPC classification number: H04N7/183 , G01H9/002 , G01S5/20 , G06K9/00771 , G08G5/0026 , G08G5/0069 , G08G5/0082 , G10L21/10 , H04R1/406 , H04R3/005 , H04R29/005 , H04R2420/07 , H04R2430/20
Abstract: In a pilotless flying object detection system, a masking area setter sets a masking area to be excluded from detection of a pilotless flying object which appears in a captured image of a monitoring area, based on audio collected by a microphone array. An object detector detects the pilotless flying object based on the audio collected by the microphone array and the masking area set by the masking area setter. An output controller superimpose sound source visual information, which indicates the volume of a sound at a sound source position, at the sound source position of the pilotless flying object in the captured image and displays the result on a first monitor in a case where the pilotless flying object is detected in an area other than the masking area.
-
公开(公告)号:US20250029615A1
公开(公告)日:2025-01-23
申请号:US18715556
申请日:2022-12-01
Inventor: Teppei FUKUDA , Shintaro OKADA , Masanari MIYAMOTO
Abstract: A voice registration device includes an acquisition unit that acquires a voice signal of an utterance voice of a speaker, a detection unit that detects, from the voice signal, a first utterance section of the speaker and a second utterance section different from the first utterance section, a sensing unit that compares a voice signal of the first utterance section with a voice signal of the second utterance section and senses switching from the speaker to another speaker different from the speaker, and a registration unit that registers the voice signal of the speaker in a database based on the sensing of the switching by the sensing unit.
-
公开(公告)号:US20240354389A1
公开(公告)日:2024-10-24
申请号:US18680553
申请日:2024-05-31
Inventor: Shintaro OKADA , Teppei FUKUDA , Masanari MIYAMOTO
Abstract: An authentification device includes an acquisition unit configured to acquire and detect a voice signal of an utterance voice of a speaker, an authentication unit configured to authenticate whether the speaker is the person himself/herself based on collation between the voice signal detected by the acquisition unit and a database, and a display interface configured to display, on a terminal device, an authentication status indicating whether the speaker is the person himself/herself based on an authentication result of the authentication unit, in which the display interface updates a display content of the authentication status of the speaker by the authentication unit every time the authentication status changes.
-
公开(公告)号:US20240005919A1
公开(公告)日:2024-01-04
申请号:US18370162
申请日:2023-09-19
Inventor: Naoya TANAKA , Tomofumi YAMANASHI , Masanari MIYAMOTO
CPC classification number: G10L15/20 , B60R11/0217 , B60R11/0247 , B60R2011/0005 , G10L15/30 , H04R1/025 , G10L15/10
Abstract: A voice processing device includes plural microphones arranged so as to correspond to a plurality of positions. The voice processing device includes at least one memory that stores instructions and voice signals from the plural microphones, and a processor. The voice signals collected by the plural microphones, respectively, during a prescribed period before a present time, are repeatedly stored in the at least one memory as buffered voice signals. The processor detects whether a prescribed word is uttered by a speaker based on the voice signals collected by the plural microphones, determines a microphone corresponding to the speaker by referring to the buffered voice signals, and suppresses the voice signals collected by the plural microphones other than the microphone corresponding to the speaker.
-
公开(公告)号:US20220328059A1
公开(公告)日:2022-10-13
申请号:US17851945
申请日:2022-06-28
Inventor: Masanari MIYAMOTO
IPC: G10L21/0232 , G10L15/06 , H04R3/00 , H04R1/40
Abstract: A speech processing device includes a processor. The processor performs operations including: detecting a single-talk state based on a speech signal collected by each of microphones, the single-talk state in which any one of persons speaks; estimating a mixing rate indicating a ratio of a speech signal of the main speaking person to a speech signal of another person based on a sound pressure ratio of the speech signals collected by the microphones in the single-talk state of the main speaking person and a sound pressure ratio of the speech signals collected by the plurality of microphones in the single-talk state of the another person; and determining whether suppression of a crosstalk component due to speaking of the another person contained in the speech signal of the main speaking person is necessary based on an estimation result of the mixing rate.
-
公开(公告)号:US20210264936A1
公开(公告)日:2021-08-26
申请号:US17179985
申请日:2021-02-19
Inventor: Masanari MIYAMOTO
IPC: G10L21/0232 , G10L15/06 , H04R1/40 , H04R3/00
Abstract: A speech processing device includes a processor. The processor performs operations including: detecting a single-talk state based on a speech signal collected by each of microphones, the single-talk state in which any one of persons speaks; estimating a mixing rate indicating a ratio of a speech signal of the main speaking person to a speech signal of another person based on a sound pressure ratio of the speech signals collected by the microphones in the single-talk state of the main speaking person and a sound pressure ratio of the speech signals collected by the plurality of microphones in the single-talk state of the another person; and determining whether suppression of a crosstalk component due to speaking of the another person contained in the speech signal of the main speaking person is necessary based on an estimation result of the mixing rate.
-
公开(公告)号:US20200007690A1
公开(公告)日:2020-01-02
申请号:US16489897
申请日:2018-01-29
Inventor: Masanari MIYAMOTO , Hiromasa OHASHI , Naoya TANAKA
Abstract: A microphone picks-up voice of a driver. A first echo suppression unit outputs a voice signal after first echo suppression based on a voice signal of the driver and a voice signal after echo suppression in the past (first reference signal) stored in a buffer memory. A second echo suppression unit outputs a voice signal after second echo suppression based on a voice signal of the driver and a voice signal after the echo suppression in the past (second reference signal) stored in a buffer memory. An output signal selector selects one of the voice signals after the first echo suppression or the voice signal after the second echo suppression according to a detection result of the presence or absence of a system variation by a system variation detector, and causes a speaker to output the selected voice signal.
-
-
-
-
-
-
-
-
-