Speech recognition faciliation method and apparatus
    1.
    发明申请
    Speech recognition faciliation method and apparatus 有权
    语音识别提供方法和装置

    公开(公告)号:US20040034526A1

    公开(公告)日:2004-02-19

    申请号:US10218548

    申请日:2002-08-14

    Applicant: Motorola, Inc.

    Inventor: Changxue Ma

    CPC classification number: G10L15/20

    Abstract: In a speech recognition platform, a masking unit 17 can be utilized to mask noisy content within an audio sample. By masking such noise in a dynamic but predictable manner, valid content can be preserved while largely overcoming the random and detrimental presence of noise. In one embodiment, speech recognition features are extracted pursuant to a hierarchical process that localizes, at least to some extent, some of the resultant features from other resultant features. As a result, noisy or otherwise unreliable information corresponding to the audio sample will not be leveraged unduly across the entire feature set. In another embodiment, an average energy value for processed samples is calculated with individual energy values that are downwardly weighted when such individual energy values are likely representative of noise.

    Abstract translation: 在语音识别平台中,可以使用掩蔽单元17来掩盖音频样本内的噪声内容。 通过以动态但可预测的方式掩蔽这种噪声,可以保持有效的内容,同时在很大程度上克服噪声的随机和有害存在。 在一个实施例中,语音识别特征是根据分级过程来提取的,该层级过程至少在一定程度上将其它结果特征中的一些特征定位。 因此,与音频样本相对应的噪声或其他不可靠信息将不会在整个特征集中过度使用。 在另一个实施例中,当单个能量值可能代表噪声时,用各个能量值计算处理样本的平均能量值。

    Method and apparatus to facilitate correlating symbols to sounds
    2.
    发明申请
    Method and apparatus to facilitate correlating symbols to sounds 有权
    便于将符号与声音相关联的方法和装置

    公开(公告)号:US20040059574A1

    公开(公告)日:2004-03-25

    申请号:US10251354

    申请日:2002-09-20

    Applicant: Motorola, Inc.

    CPC classification number: G10L13/08

    Abstract: A dictionary is comprised of a dendroid hierarchy of branches and nodes, wherein each node represents no more than one symbol (which symbol is to be converted to a corresponding sound) and wherein each such symbol as is represented at a given node has only one corresponding sound associated with that symbol at that node. In addition, many of the branches include a plurality of nodes representing a string of the symbols in a particular sequence. The dictionary is used to translate an input comprising a given integral sequence of the symbols into a corresponding integral sequence of sounds. This permits both method and apparatus to convert, for example, text to representative phonemes. Such phonemes can be used, amongst other purposes, to support synthesized speech production.

    Abstract translation: 字典由分支和节点的树状分层组成,其中每个节点表示不超过一个符号(该符号将被转换为对应的声音),并且其中在给定节点处表示的每个这样的符号只有一个对应 在该节点处与该符号相关联的声音。 此外,许多分支包括表示特定序列中的符号串的多个节点。 字典用于将包括符号的给定整数序列的输入转换成相应的整体声音序列。 这允许方法和装置将例如文本转换为代表性音素。 除了其它目的之外,还可以使用这样的音素来支持合成语音制作。

Patent Agency Ranking