-
公开(公告)号:US20190096431A1
公开(公告)日:2019-03-28
申请号:US16136487
申请日:2018-09-20
申请人: FUJITSU LIMITED
发明人: Sayuri Nakayama , TARO TOGAWA , Takeshi OTANI
摘要: A speech processing method for estimating a pitch frequency includes: executing a conversion process that includes acquiring an input spectrum from an input signal by converting the input signal from a time domain to a frequency domain; executing a feature amount acquisition process that includes acquiring a feature amount of speech likeness for each band included in a target band based on the input spectrum; executing a selection process that includes selecting a selection band selected from the target band based on the feature amount of speech likeness for each band; and executing a detection process that includes detecting a pitch frequency based on the input spectrum and the selection band.
-
2.
公开(公告)号:US20190027165A1
公开(公告)日:2019-01-24
申请号:US16035153
申请日:2018-07-13
申请人: FUJITSU LIMITED
发明人: TARO TOGAWA , Sayuri Nakayama , Takeshi OTANI
CPC分类号: G10L25/87 , G10L15/222 , G10L17/02 , G10L21/02 , G10L25/60 , G10L25/78 , G10L25/84 , G10L2025/783
摘要: An information processing apparatus includes a memory, and a processor coupled to the memory and configured to specify a first signal level of a first voice signal, specify a second signal level of a second voice signal, and execute evaluation of at least one of the first voice signal and the second voice signal based on at least one of a sum of the first signal level and the second signal level and an average of the first signal level and the second signal level.
-
公开(公告)号:US20150371662A1
公开(公告)日:2015-12-24
申请号:US14723907
申请日:2015-05-28
申请人: FUJITSU LIMITED
发明人: TARO TOGAWA , Chisato Shioda , Takeshi OTANI
CPC分类号: G10L25/48 , G10L21/0216 , G10L25/06 , G10L25/93
摘要: A voice processing device includes a memory; and a processor configured to execute a plurality of instructions stored in the memory, the instructions includes acquiring a transmitted voice; first detecting a first utterance segment of the transmitted voice; second detecting a response segment from the first utterance segment; determining a frequency of the response segment included in the transmitted voice; and estimating an utterance time period of a received voice on a basis of the frequency.
摘要翻译: 语音处理装置包括存储器; 以及处理器,被配置为执行存储在所述存储器中的多个指令,所述指令包括获取发送的语音; 首先检测所发送的声音的第一话音段; 第二检测来自第一话音段的响应段; 确定包括在所发送的语音中的响应段的频率; 以及基于所述频率来估计所接收的语音的发声时间段。
-
公开(公告)号:US20140163979A1
公开(公告)日:2014-06-12
申请号:US14074511
申请日:2013-11-07
申请人: FUJITSU LIMITED
发明人: Masanao SUZUKI , Takeshi OTANI , Taro TOGAWA
IPC分类号: G10L15/20
CPC分类号: G10L21/04
摘要: A voice processing device includes: a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute, receiving a first signal including a plurality of voice segments; controlling such that a non-voice segment with a length equal to or greater than a predetermined first threshold value exists between at least one of the plurality of voice segments; and outputting a second signal including the plurality of voice segments and the controlled non-voice segment.
摘要翻译: 语音处理装置包括:处理器; 以及存储器,其存储多个指令,所述指令在由所述处理器执行时使所述处理器执行,接收包括多个语音段的第一信号; 控制使得在所述多个语音段中的至少一个之间存在长度等于或大于预定的第一阈值的非语音段; 以及输出包括所述多个语音段和所述受控非语音段的第二信号。
-
公开(公告)号:US20140153743A1
公开(公告)日:2014-06-05
申请号:US14080373
申请日:2013-11-14
申请人: FUJITSU LIMITED
发明人: Takeshi OTANI , Taro TOGAWA , Chisato ISHIKAWA , Masanao SUZUKI
IPC分类号: G10K11/16
CPC分类号: H04R3/04 , G10L21/0208 , H03G5/18 , H04R3/007 , H04R29/001 , H04R2430/01 , H04R2430/03 , H04R2499/11
摘要: An audio processing device includes a setting section that sets a reproduction sampling frequency Fplay and a recording sampling frequency Frec higher than Fplay, a digital-to-analogue converter that based on Fplay converts a sound source signal that is a digital signal into a reproduction signal that is an analogue signal, an analogue-to-digital converter that based on Frec converts a recording signal that is an analogue signal converter into an input signal that is a digital signal, a signal separator that separates the input signal into a low region signal contained in a band of less than Fplay and a high region signal contained in a band of the Fplay and higher, and a breakup detector that detects whether or not breakup is occurring in the reproduced sound based on power of the high region signal.
摘要翻译: 音频处理装置包括设置部分,其设置再现采样频率Fplay和高于Fplay的记录采样频率Frec,基于Fplay的数模转换器将作为数字信号的声源信号转换为再现信号 这是模拟信号,基于Frec的模数转换器将作为模拟信号转换器的记录信号转换为作为数字信号的输入信号,信号分离器将输入信号分离成低区域信号 包含在Fplay和Fplay的频带中包含的低于Fplay的频带和高区域信号,以及分解检测器,其基于高区域信号的功率来检测再现声音中是否发生分解。
-
公开(公告)号:US20140142943A1
公开(公告)日:2014-05-22
申请号:US14054266
申请日:2013-10-15
申请人: FUJITSU LIMITED
发明人: Chisato Ishikawa , Taro TOGAWA , Takeshi OTANI , Masanao SUZUKI
IPC分类号: G10L17/00
摘要: A signal processing device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute, receiving speech of a speaker as a first signal; detecting an expiration period included in the first signal; extracting a number of phonemes included in the expiration period; and controlling, a second signal, which is an output to the speaker, on the basis of the number of phonemes and a length of the expiration period.
摘要翻译: 信号处理装置包括处理器; 以及存储器,其存储多个指令,当由所述处理器执行时,使所述处理器执行,接收说话者的语音作为第一信号; 检测包括在第一信号中的期满期; 提取期满中包含的一些音素; 并根据音素的数量和有效期的长度来控制作为扬声器的输出的第二信号。
-
公开(公告)号:US20190027158A1
公开(公告)日:2019-01-24
申请号:US16143537
申请日:2018-09-27
申请人: FUJITSU LIMITED
发明人: Taro Togawa , Sayuri Nakayama , Takeshi OTANI
IPC分类号: G10L21/013 , G10L25/63 , G10L25/78
摘要: A non-transitory computer-readable recording medium that records a program for causing a computer to execute an utterance impression determination process of: specifying a current fundamental frequency from a voice signal which is received, calculating a relaxation value by changing the current fundamental frequency in chronological order so that the change in the current fundamental frequency becomes moderate, and evaluating the voice signal based on a degree of a magnitude of a difference between at least one feature amount associated with the current fundamental frequency and the relaxation value corresponding to the feature amount.
-
公开(公告)号:US20180059155A1
公开(公告)日:2018-03-01
申请号:US15645011
申请日:2017-07-10
申请人: FUJITSU LIMITED
发明人: Sayuri Nakayama , TARO TOGAWA , Takeshi OTANI
CPC分类号: G01R23/16 , G10L19/02 , G10L21/0232 , G10L2021/02087 , G10L2021/02165 , H03G3/32 , H03G5/165 , H04H60/04 , H04R1/222 , H04R3/005 , H04R2430/03
摘要: A sound processing device performs obtaining a first frequency spectrum that corresponds to a first sound signal and a second frequency spectrum that corresponds to a second sound signal, calculating a level difference between a level of each of frequency components in the first frequency spectrum and a level of each of frequency components in the second frequency spectrum, calculating a spread of a distribution of the level difference during a prescribed period for each of the frequency components, and determining a gain to be multiplied to the frequency component in the first frequency spectrum and a gain to be multiplied to the frequency component in the second frequency spectrum in accordance with the spread of the distribution of the level difference.
-
9.
公开(公告)号:US20170086779A1
公开(公告)日:2017-03-30
申请号:US15259149
申请日:2016-09-08
申请人: FUJITSU LIMITED
发明人: Akira Kamano , Takeshi OTANI , Yohei KISHI
摘要: The eating and drinking action detection apparatus: acquires vibration produced from inside of a body of a subject and generates a vibration signal corresponding to the vibration; divides the vibration signal into each frame to calculate power of the vibration signal for each frame; determines, for each frame, whether the frame is a stationary signal having a periodicity or a non-stationary signal having no periodicity; detects, based on the power of each frame and a determination result for each frame whether the frame is the stationary signal or the non-stationary signal, a period of the non-stationary signal being continued while the power of the vibration signal is equal to or larger than a power threshold, acquires a continuation time of the period; and determines, based on the continuation time, whether the subject performed swallowing or mastication in the period of the non-stationary signal being continued.
-
公开(公告)号:US20170061991A1
公开(公告)日:2017-03-02
申请号:US15247887
申请日:2016-08-25
申请人: FUJITSU LIMITED
发明人: Sayuri KOHMURA , TARO TOGAWA , Takeshi OTANI
摘要: An utterance condition determination device includes a memory configured to a voice signal of a first speaker and a voice signal of a second speaker, and a processor configured to estimate an average backchannel frequency that represents a backchannel frequency of the second speaker in a period of time from a voice start time of the voice signal of the second speaker to a predetermined time based on the voice signal of the first speaker and the voice signal of the second speaker, to calculate the backchannel frequency of the second speaker for each unit of time based on the voice signal of the first speaker and the voice signal of the second speaker, and to determine a satisfaction level of the second speaker based on the estimated average backchannel frequency and the calculated backchannel frequency.
摘要翻译: 话音条件确定装置包括被配置为第一扬声器的语音信号和第二扬声器的语音信号的存储器,以及处理器,被配置为估计在一段时间内表示第二说话者的反向信道频率的平均反向信道频率 基于第一扬声器的语音信号和第二扬声器的语音信号,从第二扬声器的语音信号的语音开始时间到预定时间,以基于每个时间单位计算第二扬声器的反向频道频率 在第一扬声器的语音信号和第二扬声器的语音信号上,并且基于所估计的平均反向信道频率和所计算的反向信道频率来确定第二扬声器的满意度。
-
-
-
-
-
-
-
-
-