-
公开(公告)号:US12039993B2
公开(公告)日:2024-07-16
申请号:US18210702
申请日:2023-06-16
Inventor: Masanari Miyamoto
IPC: G10L21/0232 , G10L15/06 , G10L21/0216 , H04R1/40 , H04R3/00
CPC classification number: G10L21/0232 , G10L15/063 , H04R1/406 , H04R3/005 , G10L2015/0631 , G10L2021/02166
Abstract: A speech processing device includes a processor. The processor performs operations including: detecting a single-talk state based on a speech signal collected by each of microphones, the single-talk state in which any one of persons speaks; estimating a mixing rate indicating a ratio of a speech signal of the main speaking person to a speech signal of another person based on a sound pressure ratio of the speech signals collected by the microphones in the single-talk state of the main speaking person and a sound pressure ratio of the speech signals collected by the plurality of microphones in the single-talk state of the another person; and determining whether suppression of a crosstalk component due to speaking of the another person contained in the speech signal of the main speaking person is necessary based on an estimation result of the mixing rate.
-
公开(公告)号:US09641928B2
公开(公告)日:2017-05-02
申请号:US14797597
申请日:2015-07-13
Inventor: Koshi Tanaka , Shinichi Shigenaga , Ryota Fujii , Masanari Miyamoto , Kazuyuki Horio , Yuji Abe
IPC: H04R1/40 , H04R3/00 , G10L21/0216
CPC classification number: H04R1/406 , G10L2021/02166 , H04R3/005 , H04R2201/401
Abstract: A sound collecting control apparatus includes: a vehicle stop detector; a noise source direction specifier to specify a direction from the sound collector to a noise source of the vehicle stopped at the predetermined position; a search beam former that forms a plurality of search beams in the direction of the noise source specified by the noise source direction specifier and around the direction of the noise source so as to search for a sound source of a voice of a speaker in the vehicle; a search beam selector that selects a search beam corresponding to the sound source of the voice of the speaker in the vehicle from the plurality of search beams formed by the search beam former; and a directivity former that forms directivity of the sound collected by the sound collector in the direction corresponding to the search beam selected by the search beam selector.
-
公开(公告)号:US11410671B2
公开(公告)日:2022-08-09
申请号:US17179985
申请日:2021-02-19
Inventor: Masanari Miyamoto
IPC: G10L21/0232 , G10L15/06 , H04R3/00 , H04R1/40 , G10L21/0216
Abstract: A speech processing device includes a processor. The processor performs operations including: detecting a single-talk state based on a speech signal collected by each of microphones, the single-talk state in which any one of persons speaks; estimating a mixing rate indicating a ratio of a speech signal of the main speaking person to a speech signal of another person based on a sound pressure ratio of the speech signals collected by the microphones in the single-talk state of the main speaking person and a sound pressure ratio of the speech signals collected by the plurality of microphones in the single-talk state of the another person; and determining whether suppression of a crosstalk component due to speaking of the another person contained in the speech signal of the main speaking person is necessary based on an estimation result of the mixing rate.
-
公开(公告)号:US20210407528A1
公开(公告)日:2021-12-30
申请号:US17472171
申请日:2021-09-10
Inventor: Masanari Miyamoto , Naoya Tanaka , Hiromasa Ohashi
IPC: G10L21/0208 , G10K11/16
Abstract: An acoustic noise suppressing apparatus includes a sound pickup circuit, a first and second suppression circuits, and an output signal selection circuit. The sound pickup circuit picks up sound. The first suppression circuit processes the sound, in which the first suppression circuit is configured to calculate a first suppression sound signal in which acoustic noise is suppressed from the sound by using a first algorithm suitable for multiple sound sources. The second suppression circuit processes the audio signal in parallel with the first suppression circuit, in which the second suppression circuit is configured to calculate a second suppression sound signal in which acoustic noise is suppressed from the sound signal by using a second algorithm suitable for a single sound source. The output signal selection circuit outputs only one of the first suppression audio signal and the second suppression audio
-
公开(公告)号:US11152010B2
公开(公告)日:2021-10-19
申请号:US16841199
申请日:2020-04-06
Inventor: Masanari Miyamoto , Naoya Tanaka , Hiromasa Ohashi
IPC: G10L15/20 , G10L21/0208 , G10K11/16
Abstract: An acoustic noise suppressing apparatus outputs a first suppression audio signal in which the acoustic noise is suppressed by subtracting a first pseudo noise signal from the picked up audio signal, the first pseudo noise signal being generated based on a first delay signal and a first filter updated by a first algorithm which is valid when a plurality of talkers are talking, and outputs a second suppression audio signal in which the acoustic noise is suppressed by subtracting a second pseudo noise signal from the picked up audio signal, the second pseudo noise signal being generated based on a second delay signal and a second filter updated by a second algorithm which is valid when one talker is talking. The apparatus outputs a suppressed one of the first suppressed audio signal or the second suppressed audio signal.
-
公开(公告)号:US11804220B2
公开(公告)日:2023-10-31
申请号:US16979714
申请日:2018-12-11
Inventor: Naoya Tanaka , Tomofumi Yamanashi , Masanari Miyamoto
CPC classification number: G10L15/20 , B60R11/0217 , B60R11/0247 , G10L15/10 , G10L15/30 , H04R1/025 , B60R2011/0005 , B60R2011/0021
Abstract: This voice processing device is provided with: an utterer's position detection unit which specifies, as position microphones of an utterer, microphones that receive a voice signal of WuW on the basis of the characteristics of each voice signal for a prescribed time, when the WuW voice is detected, the voice signal being held in a voice signal buffer unit; and a CTC unit (one example of a voice processing unit) which outputs a voice uttered by the utterer and suppress a voice uttered by an occupant, who is not the utterer, by using the voice signal for the prescribed time, which is held in the voice signal buffer unit, and information relating to the utterer's position microphones.
-
公开(公告)号:US10706448B2
公开(公告)日:2020-07-07
申请号:US15513622
申请日:2015-09-14
Inventor: Hisahiro Tanaka , Akitoshi Izumi , Masanari Miyamoto , Shinichi Shigenaga , Ryota Fujii , Koshi Tanaka , Hisashi Tsuji
IPC: G06Q10/00 , G06Q30/06 , G06Q10/06 , G07G1/12 , G10L15/08 , G10L17/00 , G10L17/22 , G10L25/78 , G10L21/0216 , G10L15/22 , G10L15/02
Abstract: A service monitoring system includes a voice collector that collects a voice of an employee in a predetermined voice collection region, a storage unit that stores service event data including determination conditions for each predetermined service event, terminal operation history data indicating an operation history of an employee on a predetermined business terminal and voice data of the employee in correlation with each other, a detector that detects the service event of the employee based on the service event data and the terminal operation history data, a calculator that calculates a service speech evaluation value corresponding to a predetermined speech keyword on the basis of the voice data of the employee during the service event, and an output that stores the service speech evaluation value in correlation with identification information of the employee, and voice data of the employee specified by a service position and time point of the employee.
-
公开(公告)号:US10397525B2
公开(公告)日:2019-08-27
申请号:US15454722
申请日:2017-03-09
Inventor: Hiroyuki Matsumoto , Shintaro Yoshikuni , Masanari Miyamoto
IPC: H04N7/18 , H04R29/00 , G10L21/10 , G01H9/00 , G01S5/20 , G06K9/00 , G08G5/00 , H04R3/00 , H04R1/40
Abstract: In a pilotless flying object detection system, a masking area setter sets a masking area to be excluded from detection of a pilotless flying object which appears in a captured image of a monitoring area, based on audio collected by a microphone array. An object detector detects the pilotless flying object based on the audio collected by the microphone array and the masking area set by the masking area setter. An output controller superimpose sound source visual information, which indicates the volume of a sound at a sound source position, at the sound source position of the pilotless flying object in the captured image and displays the result on a first monitor in a case where the pilotless flying object is detected in an area other than the masking area.
-
公开(公告)号:US12119013B2
公开(公告)日:2024-10-15
申请号:US17778277
申请日:2020-11-16
Inventor: Masanari Miyamoto , Naoya Tanaka , Hiromasa Ohashi
IPC: G10L21/00 , G10L15/20 , G10L21/02 , G10L21/0208 , H04R3/02
CPC classification number: G10L21/0208 , G10L15/20 , H04R3/02
Abstract: An acoustic crosstalk suppression device includes a speaker estimation unit configured to estimate a main speaker based on voice signals collected by n units of microphones corresponding to n number of persons (n: an integer equal to or larger than 3); n units of filter update units each of which is configured to update a parameter of a filter configured to generate a suppression signal of a crosstalk component included in a voice signal of the main speaker; and a crosstalk suppression unit configured to suppress the crosstalk component by using a synthesis suppression signal generated by the maximum (n-1) units of filter update units corresponding to reference signals collected by the maximum (n-1) units of microphones.
-
公开(公告)号:US12118990B2
公开(公告)日:2024-10-15
申请号:US18370162
申请日:2023-09-19
Inventor: Naoya Tanaka , Tomofumi Yamanashi , Masanari Miyamoto
CPC classification number: G10L15/20 , B60R11/0217 , B60R11/0247 , G10L15/10 , G10L15/30 , H04R1/025 , B60R2011/0005 , B60R2011/0021
Abstract: A voice processing device includes plural microphones arranged so as to correspond to a plurality of positions. The voice processing device includes at least one memory that stores instructions and voice signals from the plural microphones, and a processor. The voice signals collected by the plural microphones, respectively, during a prescribed period before a present time, are repeatedly stored in the at least one memory as buffered voice signals. The processor detects whether a prescribed word is uttered by a speaker based on the voice signals collected by the plural microphones, determines a microphone corresponding to the speaker by referring to the buffered voice signals, and suppresses the voice signals collected by the plural microphones other than the microphone corresponding to the speaker.
-
-
-
-
-
-
-
-
-