Transcription support system and transcription support method

    公开(公告)号:US10304457B2

    公开(公告)日:2019-05-28

    申请号:US13420827

    申请日:2012-03-15

    IPC分类号: G10L15/26

    摘要: According to one embodiment, a transcription support system supports transcription work to convert voice data to text. The system includes a first storage unit configured to store therein the voice data; a playback unit configured to play back the voice data; a second storage unit configured to store therein voice indices, each of which associates a character string obtained from a voice recognition process with voice positional information, for which the voice positional information is indicative of a temporal position in the voice data and corresponds to the character string; a text creating unit that creates the text in response to an operation input of a user; and an estimation unit configured to estimate already-transcribed voice positional information indicative of a position at which the creation of the text is completed in the voice data based on the voice indices.

    RETRIEVING DEVICE, RETRIEVING METHOD, AND COMPUTER PROGRAM PRODUCT
    3.
    发明申请
    RETRIEVING DEVICE, RETRIEVING METHOD, AND COMPUTER PROGRAM PRODUCT 审中-公开
    检索设备,检索方法和计算机程序产品

    公开(公告)号:US20130080174A1

    公开(公告)日:2013-03-28

    申请号:US13527763

    申请日:2012-06-20

    IPC分类号: G10L13/00

    CPC分类号: G10L15/22 G10L2015/221

    摘要: In an embodiment, a retrieving device includes: a text input unit, a first extracting unit, a retrieving unit, a second extracting unit, an acquiring unit, and a selecting unit. The text input unit inputs a text including unknown word information representing a phrase that a user was unable to transcribe. The first extracting unit extracts related words representing a phrase related to the unknown word information among phrases other than the unknown word information included in the text. The retrieving unit retrieves a related document representing a document including the related words. The second extracting unit extracts candidate words representing candidates for the unknown word information from a plurality of phrases included in the related document. The acquiring unit acquires reading information representing estimated pronunciation of the unknown word information. The selecting unit selects at least one of candidate word of which pronunciation is similar to the reading information.

    摘要翻译: 在一个实施例中,检索装置包括:文本输入单元,第一提取单元,检索单元,第二提取单元,获取单元和选择单元。 文本输入单元输入包括表示用户不能转录的短语的未知单词信息的文本。 第一提取单元提取表示与包含在文本中的未知单词信息以外的短语中的与未知单词信息相关的短语的相关单词。 检索单元检索表示包括相关词的文档的相关文档。 第二提取单元从相关文档中包括的多个短语中提取表示未知单词信息候选的候选词。 获取单元获取表示未知单词信息的估计发音的读取信息。 选择单元选择与读取信息类似的发音的候选词中的至少一个。

    INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD AND COMPUTER PROGRAM PRODUCT
    4.
    发明申请
    INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD AND COMPUTER PROGRAM PRODUCT 有权
    信息处理设备,信息处理方法和计算机程序产品

    公开(公告)号:US20130080163A1

    公开(公告)日:2013-03-28

    申请号:US13533091

    申请日:2012-06-26

    IPC分类号: G10L15/26

    CPC分类号: G06F17/30743 G10L15/26

    摘要: According to an embodiment, an information processing apparatus includes a storage unit, a detector, an acquisition unit, and a search unit. The storage unit configured to store therein voice indices, each of which associates a character string included in voice text data obtained from a voice recognition process with voice positional information, the voice positional information indicating a temporal position in the voice data and corresponding to the character string. The acquisition unit acquires reading information being at least a part of a character string representing a reading of a phrase to be transcribed from the voice data played back. The search unit specifies, as search targets, character strings whose associated voice positional information is included in the played-back section information among the character strings included in the voice indices, and retrieves a character string including the reading represented by the reading information from among the specified character strings.

    摘要翻译: 根据实施例,信息处理设备包括存储单元,检测器,采集单元和搜索单元。 所述存储单元被配置为存储语音索引,每个声音索引将包括在从语音识别处理获得的语音文本数据中的字符串与语音位置信息相关联,所述语音位置信息指示语音数据中的时间位置并对应于所述字符 串。 获取单元获取读取信息,该信息是表示从回放的语音数据中转录的短语的阅读的字符串的至少一部分。 搜索单元将包括在语音索引中的字符串中的相关联的语音位置信息包括在回放部分信息中的字符串指定为检索单元,并从 指定的字符串。

    TRANSCRIPTION SUPPORT SYSTEM AND TRANSCRIPTION SUPPORT METHOD
    5.
    发明申请
    TRANSCRIPTION SUPPORT SYSTEM AND TRANSCRIPTION SUPPORT METHOD 审中-公开
    转录支持系统和转录支持方法

    公开(公告)号:US20130030805A1

    公开(公告)日:2013-01-31

    申请号:US13420827

    申请日:2012-03-15

    IPC分类号: G10L15/26

    CPC分类号: G10L15/26

    摘要: According to one embodiment, a transcription support system supports transcription work to convert voice data to text. The system includes a first storage unit configured to store therein the voice data; a playback unit configured to play back the voice data; a second storage unit configured to store therein voice indices, each of which associates a character string obtained from a voice recognition process with voice positional information, for which the voice positional information is indicative of a temporal position in the voice data and corresponds to the character string; a text creating unit that creates the text in response to an operation input of a user; and an estimation unit configured to estimate already-transcribed voice positional information indicative of a position at which the creation of the text is completed in the voice data based on the voice indices.

    摘要翻译: 根据一个实施例,转录支持系统支持将语音数据转换为文本的转录工作。 该系统包括配置成在其中存储语音数据的第一存储单元; 播放单元,被配置为回放语音数据; 第二存储单元,被配置为在其中存储语音索引,每个声音索引将从语音识别处理获得的字符串与语音位置信息相关联,语音位置信息指示语音数据中的时间位置并对应于该字符 串; 文本创建单元,其响应于用户的操作输入创建文本; 以及估计单元,被配置为基于所述语音索引来估计在所述语音数据中指示完成文本的创建的位置的已经转录的语音位置信息。

    APPARATUS AND METHOD FOR CLUSTERING SPEAKERS, AND A NON-TRANSITORY COMPUTER READABLE MEDIUM THEREOF
    6.
    发明申请
    APPARATUS AND METHOD FOR CLUSTERING SPEAKERS, AND A NON-TRANSITORY COMPUTER READABLE MEDIUM THEREOF 有权
    用于聚集扬声器的装置和方法,以及非终端计算机可读介质

    公开(公告)号:US20130030794A1

    公开(公告)日:2013-01-31

    申请号:US13412694

    申请日:2012-03-06

    IPC分类号: G10L15/02 G06F17/27

    CPC分类号: G10L21/028 G10L15/26

    摘要: According to one embodiment, a speaker clustering apparatus includes a clustering unit, an extraction unit, and an error detection unit. The clustering unit is configured to extract acoustic features for speakers from an acoustic signal, and to cluster utterances included in the acoustic signal into the speakers by using the acoustic features. The extraction unit is configured to acquire character strings representing contents of the utterances, and to extract linguistic features of the speakers by using the character strings. The error detection unit is configured to decide that, when one of the character strings does not fit with a linguistic feature of a speaker into which an utterance of the one is clustered, the utterance is erroneously clustered by the clustering unit.

    摘要翻译: 根据一个实施例,扬声器群集设备包括聚类单元,提取单元和错误检测单元。 聚类单元被配置为从声学信号提取扬声器的声学特征,并且通过使用声学特征将包括在声学信号中的声音聚类到扬声器中。 提取单元被配置为获取表示话语内容的字符串,并且通过使用字符串来提取扬声器的语言特征。 错误检测单元被配置为当其中一个字符串不符合一个发音的扬声器的语言特征被聚集时,该话音被该聚类单元错误地聚集。

    Apparatus and method for clustering speakers, and a non-transitory computer readable medium thereof
    7.
    发明授权
    Apparatus and method for clustering speakers, and a non-transitory computer readable medium thereof 有权
    用于聚类扬声器的装置和方法及其非暂时计算机可读介质

    公开(公告)号:US09251808B2

    公开(公告)日:2016-02-02

    申请号:US13412694

    申请日:2012-03-06

    CPC分类号: G10L21/028 G10L15/26

    摘要: According to one embodiment, a speaker clustering apparatus includes a clustering unit, an extraction unit, and an error detection unit. The clustering unit is configured to extract acoustic features for speakers from an acoustic signal, and to cluster utterances included in the acoustic signal into the speakers by using the acoustic features. The extraction unit is configured to acquire character strings representing contents of the utterances, and to extract linguistic features of the speakers by using the character strings. The error detection unit is configured to decide that, when one of the character strings does not fit with a linguistic feature of a speaker into which an utterance of the one is clustered, the utterance is erroneously clustered by the clustering unit.

    摘要翻译: 根据一个实施例,扬声器群集设备包括聚类单元,提取单元和错误检测单元。 聚类单元被配置为从声学信号提取扬声器的声学特征,并且通过使用声学特征将包括在声学信号中的声音聚类到扬声器中。 提取单元被配置为获取表示话语内容的字符串,并且通过使用字符串来提取扬声器的语言特征。 错误检测单元被配置为当其中一个字符串不符合一个发音的扬声器的语言特征被聚集时,该话音被该聚类单元错误地聚集。

    Apparatus and method for editing speech synthesis, and computer readable medium
    8.
    发明授权
    Apparatus and method for editing speech synthesis, and computer readable medium 有权
    用于编辑语音合成的装置和方法,以及计算机可读介质

    公开(公告)号:US09020821B2

    公开(公告)日:2015-04-28

    申请号:US13235656

    申请日:2011-09-19

    申请人: Osamu Nishiyama

    发明人: Osamu Nishiyama

    IPC分类号: G10L13/033 G10L13/08

    CPC分类号: G10L13/033 G10L13/08

    摘要: An acquisition unit analyzes a text, and acquires phonemic and prosodic information. An editing unit edits a part of the phonemic and prosodic information. A speech synthesis unit converts the phonemic and prosodic information before editing the part to a first speech waveform, and converts the phonemic and prosodic information after editing the part to a second speech waveform. A period calculation unit calculates a contrast period corresponding to the part in the first speech waveform and the second speech waveform. A speech generation unit generates an output waveform by connecting a first partial waveform and a second partial waveform. The first partial waveform contains the contrast period of the first speech waveform. The second partial waveform contains the contrast period of the second speech waveform.

    摘要翻译: 收购单位分析文本,并获取音韵信息。 编辑单元编辑音素和韵律信息的一部分。 语音合成单元将编辑该部分之前的音素和韵律信息转换为第一语音波形,并且将编辑该部分之后的音素和韵律信息转换为第二语音波形。 周期计算单元计算与第一语音波形和第二语音波形中的部分对应的对比度周期。 语音生成单元通过连接第一部分波形和第二部分波形来生成输出波形。 第一部分波形包含第一语音波形的对比度周期。 第二部分波形包含第二语音波形的对比度周期。

    Wide range monitor apparatus for output from nuclear reactor
    9.
    发明授权
    Wide range monitor apparatus for output from nuclear reactor 失效
    用于从核反应堆输出的宽范围监测装置

    公开(公告)号:US4652419A

    公开(公告)日:1987-03-24

    申请号:US655447

    申请日:1984-09-28

    IPC分类号: G21C17/00

    CPC分类号: G21C17/00

    摘要: A wide range monitor apparatus for the output from a nuclear reactor has a logarithmic count rate measuring circuit and a Campbel measuring circuit corresponding to different neutron flux density ranges of the neutron flux output from a neutron detector. The apparatus monitors the nuclear reactor output as a single output which has a linearity with the neutron flux density over a wide range thereof. A logic circuit combines two comparison discrimination signals obtained by comparing low and high comparison voltages corresponding to the hysteresis width in an overlap region of the outputs from the two measuring circuits with detection output voltages from the two measuring circuits. One of the outputs from the two measuring circuits is selected in accordance with the logical level signal obtained by combining the two comparison discrimination signals by the logic circuit.

    摘要翻译: 用于核反应堆输出的宽范围监视装置具有对数计数速率测量电路和对应于从中子检测器输出的中子通量的不同中子通量密度范围的坎贝尔测量电路。 该装置将核反应堆输出监视为与其宽范围内的中子通量密度具有线性关系的单个输出。 逻辑电路将通过比较来自两个测量电路的输出的重叠区域中的与迟滞宽度相对应的低比较电压和高比较电压与来自两个测量电路的检测输出电压相组合的两个比较判别信号。 根据由逻辑电路组合两个比较鉴别信号而获得的逻辑电平信号,选择两个测量电路的输出之一。

    APPARATUS AND METHOD FOR EDITING SPEECH SYNTHESIS, AND COMPUTER READABLE MEDIUM
    10.
    发明申请
    APPARATUS AND METHOD FOR EDITING SPEECH SYNTHESIS, AND COMPUTER READABLE MEDIUM 审中-公开
    用于编辑语音合成的装置和方法,以及计算机可读介质

    公开(公告)号:US20120239404A1

    公开(公告)日:2012-09-20

    申请号:US13235656

    申请日:2011-09-19

    申请人: Osamu Nishiyama

    发明人: Osamu Nishiyama

    IPC分类号: G10L13/08

    CPC分类号: G10L13/033 G10L13/08

    摘要: An acquisition unit analyzes a text, and acquires phonemic and prosodic information. An editing unit edits a part of the phonemic and prosodic information. A speech synthesis unit converts the phonemic and prosodic information before editing the part to a first speech waveform, and converts the phonemic and prosodic information after editing the part to a second speech waveform. A period calculation unit calculates a contrast period corresponding to the part in the first speech waveform and the second speech waveform. A speech generation unit generates an output waveform by connecting a first partial waveform and a second partial waveform. The first partial waveform contains the contrast period of the first speech waveform. The second partial waveform contains the contrast period of the second speech waveform.

    摘要翻译: 收购单位分析文本,并获取音韵信息。 编辑单元编辑音素和韵律信息的一部分。 语音合成单元将编辑该部分之前的音素和韵律信息转换为第一语音波形,并且将编辑该部分之后的音素和韵律信息转换为第二语音波形。 周期计算单元计算与第一语音波形和第二语音波形中的部分对应的对比度周期。 语音生成单元通过连接第一部分波形和第二部分波形来生成输出波形。 第一部分波形包含第一语音波形的对比度周期。 第二部分波形包含第二语音波形的对比度周期。