Dictionary lookup for mobile devices using spelling recognition
    3.
    发明申请
    Dictionary lookup for mobile devices using spelling recognition 审中-公开
    使用拼写识别的移动设备的字典查找

    公开(公告)号:US20070016420A1

    公开(公告)日:2007-01-18

    申请号:US11176154

    申请日:2005-07-07

    IPC分类号: G10L15/04 G10L15/00

    CPC分类号: G10L15/19

    摘要: A method for querying an electronic dictionary using letters of an alphabet enunciated by a user includes accepting a speech input from the user. The speech input includes a sequence of spelled letters enunciated by the user that spell a query word. The speech input is analyzed to determine one or more sequences of the letters that approximate the sequence of spelled letters. The one or more sequences of the letters are post-processed so as to produce a plurality of recognized words approximating the query word. The electronic dictionary is queried with the plurality of recognized words so as to retrieve a respective plurality of dictionary entries. A list of results including the plurality of recognized words and the respective plurality of dictionary entries is presented to the user.

    摘要翻译: 一种使用用户名字母字母查询电子词典的方法包括接受来自用户的语音输入。 语音输入包括由用户发出拼写查询词的拼写字母序列。 分析语音输入以确定近似拼写字母序列的一个或多个字母序列。 对字母的一个或多个序列进行后处理,以产生近似于查询词的多个识别词。 使用多个识别的字查询电子词典,以便检索相应的多个字典条目。 向用户呈现包括多个识别字和相应的多个字典条目的结果列表。

    Voice transformation with encoded information
    4.
    发明授权
    Voice transformation with encoded information 有权
    具有编码信息的语音变换

    公开(公告)号:US08930182B2

    公开(公告)日:2015-01-06

    申请号:US13049924

    申请日:2011-03-17

    CPC分类号: G10L21/003 G10L19/018

    摘要: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

    摘要翻译: 提供语音转换的方法,系统和计算机程序产品。 该方法包括使用变换参数来变换源语言,以及使用隐写术对输入语音中的变换参数对信息进行编码,其中可以使用输出语音和关于变换参数的信息来重构源语音。 还提供了一种用于重建语音变换的方法,包括:接收语音转换系统的输出语音,其中输出语音是使用隐写术编码关于变换参数的信息的变换语音; 提取变换参数信息; 并执行输出语音的逆变换以获得原始源语音的近似。

    SPEECH OUTPUT WITH CONFIDENCE INDICATION
    5.
    发明申请
    SPEECH OUTPUT WITH CONFIDENCE INDICATION 审中-公开
    语音输出与信心指示

    公开(公告)号:US20110313762A1

    公开(公告)日:2011-12-22

    申请号:US12819203

    申请日:2010-06-20

    IPC分类号: G10L13/08 G10L21/00 G10L15/00

    CPC分类号: G10L13/08

    摘要: A method, system, and computer program product are provided for speech output with confidence indication. The method includes receiving a confidence score for segments of speech or text to be synthesized to speech. The method includes modifying a speech segment by altering one or more parameters of the speech proportionally to the confidence score.

    摘要翻译: 提供了一种用于具有置信指示的语音输出的方法,系统和计算机程序产品。 该方法包括接收将要合成为语音的语音段或文本段的置信度分数。 该方法包括通过根据置信度分数改变语音的一个或多个参数来修改语音段。

    VOICE TRANSFORMATION WITH ENCODED INFORMATION
    6.
    发明申请
    VOICE TRANSFORMATION WITH ENCODED INFORMATION 有权
    语音转换与编码信息

    公开(公告)号:US20120239387A1

    公开(公告)日:2012-09-20

    申请号:US13049924

    申请日:2011-03-17

    IPC分类号: G10L19/02

    CPC分类号: G10L21/003 G10L19/018

    摘要: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

    摘要翻译: 提供语音转换的方法,系统和计算机程序产品。 该方法包括使用变换参数来变换源语言,以及使用隐写术对输入语音中的变换参数对信息进行编码,其中可以使用输出语音和关于变换参数的信息来重构源语音。 还提供了一种用于重建语音变换的方法,包括:接收语音转换系统的输出语音,其中输出语音是使用隐写术编码关于变换参数的信息的变换语音; 提取变换参数信息; 并执行输出语音的逆变换以获得原始源语音的近似。

    VOCAL SOURCE EXTRACTION BY MAXIMUM PHASE DETECTION
    8.
    发明申请
    VOCAL SOURCE EXTRACTION BY MAXIMUM PHASE DETECTION 有权
    通过最大相位检测提取VOCAL SOURCE

    公开(公告)号:US20130325455A1

    公开(公告)日:2013-12-05

    申请号:US13487275

    申请日:2012-06-04

    IPC分类号: G10L11/04

    CPC分类号: G10L25/75 G10L25/03 G10L25/45

    摘要: Methods, apparatus and computer program products implement embodiments of the present invention that include receiving a time domain voice signal, and extracting a single pitch cycle from the received signal. The extracted single pitch cycle is transformed to a frequency domain, and the misclassified roots of the frequency domain are identified and corrected. Using the corrected roots, an indication of a maximum phase of the frequency domain is generated.

    摘要翻译: 方法,装置和计算机程序产品实现本发明的实施例,其包括接收时域语音信号,并从接收到的信号中提取单个音调周期。 提取的单音调周期被转换为频域,并且识别和校正频域的错误分类的根。 使用校正的根,产生频域的最大相位的指示。

    Feature-domain concatenative speech synthesis
    9.
    发明授权
    Feature-domain concatenative speech synthesis 有权
    特征域级联语音合成

    公开(公告)号:US07035791B2

    公开(公告)日:2006-04-25

    申请号:US09901031

    申请日:2001-07-10

    申请人: Dan Chazan Ron Hoory

    发明人: Dan Chazan Ron Hoory

    IPC分类号: G10L11/04

    CPC分类号: G10L13/07 G10L25/18

    摘要: A method for speech synthesis includes receiving an input speech signal containing a set of speech segments, and estimating spectral envelopes of the input speech signal in a succession of time intervals during each of the speech segments. The spectral envelopes are integrated over a plurality of window functions in a frequency domain so as to determine elements of feature vectors corresponding to the speech segments. An output speech signal is reconstructed by concatenating the feature vectors corresponding to a sequence of the speech segments.

    摘要翻译: 一种用于语音合成的方法包括接收包含一组语音段的输入语音信号,并且在每个语音段期间以一连串的时间间隔估计输入语音信号的频谱包络。 频谱包络被集成在频域中的多个窗口函数上,以便确定与语音段对应的特征向量的元素。 通过连接对应于语音片段序列的特征向量来重构输出语音信号。

    Fast frequency-domain pitch estimation
    10.
    发明授权
    Fast frequency-domain pitch estimation 有权
    快速频域间距估计

    公开(公告)号:US06587816B1

    公开(公告)日:2003-07-01

    申请号:US09617582

    申请日:2000-07-14

    IPC分类号: G10L1104

    CPC分类号: G10L25/90

    摘要: A method for estimating a pitch frequency of an audio signal includes computing a first transform of the signal to a frequency domain over a first time interval, and computing a second transform of the signal to the frequency domain over a second time interval, which contains the first time interval. A line spectrum of the signal is found, based on the first and second transforms, the spectrum including spectral lines having respective line amplitudes and line frequencies. A utility function that is periodic in the frequencies of the lines in the spectrum is then computed. This function is indicative, for each candidate pitch frequency in a given pitch frequency range, of a compatibility of the spectrum with the candidate pitch frequency. The pitch frequency of the speech signal is estimated responsive to the utility function.

    摘要翻译: 一种用于估计音频信号的音调频率的方法包括:在第一时间间隔上计算信号到频域的第一变换,以及在第二时间间隔上计算信号到频域的第二变换,该第二时间间隔包含 第一时间间隔。 基于第一和第二变换,发现包括具有各自线路幅度和线路频率的谱线的频谱的信号线谱。 然后计算在频谱中的线的频率中周期性的效用函数。 该功能针对给定音调频率范围内的每个候选音调频率指示频谱与候选音调频率的兼容性。 响应于效用函数来估计语音信号的音调频率。