Speech coding apparatus with single-dimension acoustic prototypes for a
speech recognizer
    1.
    发明授权
    Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer 失效
    具有用于语音识别器的单维声学原型的语音编码装置

    公开(公告)号:US5280562A

    公开(公告)日:1994-01-18

    申请号:US770495

    申请日:1991-10-03

    CPC分类号: G10L19/038 H03M7/3082

    摘要: In speech recognition and speech coding, the values of at least two features of an utterance are measured during a series of time intervals to produce a series of feature vector signals. A plurality of single-dimension prototype vector signals having only one parameter value are stored. At least two single-dimension prototype vector signals having parameter values representing first feature values, and at least two other single-dimension prototype vector signals have parameter values representing second feature values. A plurality of compound-dimension prototype vector signals have unique identification values and comprise one first-dimension and one second-dimension prototype vector signal. At least two compound-dimension prototype vector signals comprise the same first-dimension prototype vector signal. The feature values of each feature vector signal are compared to the parameter values of the compound-dimension prototype vector signals to obtain prototype match scores. The identification values of the compound-dimension prototype vector signals having the best prototype match scores for the feature vectors signals are output as a sequence of coded representations of an utterance to be recognized. A match score, comprising an estimate of the closeness of a match between a speech unit and the sequence of coded representations of the utterance, is generated for each of a plurality of speech units. At least one speech subunit, of one or more best candidate speech units having the best match scores, is displayed.

    摘要翻译: 在语音识别和语音编码中,在一系列时间间隔期间测量话音的至少两个特征的值,以产生一系列特征向量信号。 存储仅具有一个参数值的多个单维原型矢量信号。 具有表示第一特征值的参数值和至少两个其它单维原型矢量信号的至少两个单维原型矢量信号具有表示第二特征值的参数值。 多个复合尺寸原型矢量信号具有唯一的识别值,并且包括一个第一维和一个第二维原型矢量信号。 至少两个复合维度原型矢量信号包括相同的第一维原型矢量信号。 将每个特征向量信号的特征值与化合物维度原型矢量信号的参数值进行比较,以获得原型匹配分数。 具有特征矢量信号的具有最佳原型匹配分数的复合维度原型矢量信号的识别值被输出为将被识别的话语的编码表示的序列。 针对多个语音单元中的每一个生成包括语音单元与语音编码表示序列之间的匹配的接近度的估计的匹配分数。 显示具有最佳匹配分数的一个或多个最佳候选语音单元的至少一个语音子单元。

    System and method for user driven interactive application integration
    2.
    发明授权
    System and method for user driven interactive application integration 有权
    用于用户驱动的交互式应用程序集成的系统和方法

    公开(公告)号:US07856600B2

    公开(公告)日:2010-12-21

    申请号:US11774603

    申请日:2007-07-08

    IPC分类号: G06F3/048 G06F3/00 G06F15/16

    CPC分类号: G06F17/3089

    摘要: A system and method is provided for integrating portlets. When viewing portlets within a portal container, a user is presented with a choice of one or more sources of data and, for each source, one or more actions that the user can take regarding the source. When an action is selected, it causes the source data to be transferred to one or more “target” portlets that have also been activated by the user. The set of actions available from a given source is automatically provided given the available target portlets. As each portlet is initialized, it informs a “broker” of the actions that the portlet supports along with the type of data that is used by the action. When a portal page is being constructed, each portlet identifies to the broker the sources of data within the portlet along with the values and data types corresponding to the sources.

    摘要翻译: 提供了一种集成portlet的系统和方法。 当在门户容器中查看Portlet时,向用户呈现一个或多个数据源的选择,并且对于每个源,用户可以采用关于源的一个或多个动作。 当选择动作时,它会使源数据传输到一个或多个用户也已激活的“目标”portlet。 给定可用的目标portlet会自动提供给定源可用的一组操作。 当每个portlet被初始化时,它会通知“代理”portlet支持的操作以及该操作使用的数据类型。 当构建门户页面时,每个portlet向代理标识portlet中的数据源以及与源对应的值和数据类型。

    MVC (model-view-controller) based multi-modal authoring tool and development environment
    3.
    发明授权
    MVC (model-view-controller) based multi-modal authoring tool and development environment 有权
    基于MVC(模型视图 - 控制器)的多模式创作工具和开发环境

    公开(公告)号:US06996800B2

    公开(公告)日:2006-02-07

    申请号:US10007037

    申请日:2001-12-04

    IPC分类号: G06F9/44

    CPC分类号: G06F8/38

    摘要: Application development tools and method for building multi-channel, multi-device and multi-modal applications, and in particular, to systems and methods for developing applications whereby a user can interact in parallel with the same information via a multiplicity of channels and user interfaces, while a unified, synchronized views of the information are presented across the various channels or devices deployed by the user to interact with the information. In a preferred embodiment, application frameworks and development tools are preferably based on a MVC (Model-View-Controller) design paradigm that is adapted to provide synchronized multi-modal interactions. Multi-channel authoring can be developed using a similar methodology.

    摘要翻译: 用于构建多通道,多设备和多模式应用的应用程序开发工具和方法,特别是用于开发应用程序的系统和方法,由此用户可以通过多个通道和用户界面与相同的信息并行交互 而信息的统一的,同步的视图则呈现在用户部署的各种渠道或设备上,以与信息进行交互。 在优选实施例中,应用框架和开发工具优选地基于适于提供同步多模态交互的MVC(模型 - 视图 - 控制器)设计范例。 可以使用类似的方法开发多渠道创作。

    Method and system for reducing perplexity in speech recognition via
caller identification
    4.
    发明授权
    Method and system for reducing perplexity in speech recognition via caller identification 失效
    通过呼叫者识别减少语音识别困惑的方法和系统

    公开(公告)号:US5802251A

    公开(公告)日:1998-09-01

    申请号:US523755

    申请日:1995-09-05

    摘要: A method and system are disclosed for reducing perplexity in a speech recognition system within a telephonic network based upon determined caller identity. In a speech recognition system which processes input frames of speech against stored templates representing speech, a core library of speech templates is created and stored representing a basic vocabulary of speech. Multiple caller-specific libraries of speech templates are also created and stored, each library containing speech templates which represent a specialized vocabulary and pronunciations for a specific geographic location and a particular individual. Additionally, the caller-specific libraries of speech templates are preferably processed to reflect the reduced bandwidth, transmission channel variations and other signal variations introduced into the system via a telephonic network. The identification of a caller is determined upon connection to the network via standard caller identification circuitry and upon detection of a spoken utterance, that utterance is processed against the core library, if the caller's identity cannot be determined, or against a particular caller-specific library, if the caller's identity can be determined, thereby greatly enhancing the efficiency and accuracy of speech recognition by the system.

    摘要翻译: 公开了一种基于确定的呼叫者身份来减少电话网络内的语音识别系统中的困惑的方法和系统。 在针对表示语音的存储模板处理输入语音帧的语音识别系统中,创建并存储代表语音的基本词汇表的语音模板的核心库。 还创建并存储多个特定于语音模板的调用者库,每个库包含表示特定地理位置和特定个人的专门词汇和发音的语音模板。 此外,优选地处理呼叫者特定的语音模板库以反映通过电话网络引入到系统中的减少的带宽,传输信道变化​​和其他信号变化。 通过标准呼叫者识别电路连接到网络并且在检测到说话话语之后确定呼叫者的识别,如果呼叫者的身份不能被确定,或针对特定的呼叫者特定的库 如果可以确定呼叫者的身份,从而大大提高系统语音识别的效率和准确性。

    MVC (Model-View-Controller) based multi-modal authoring tool and development environment
    5.
    发明授权
    MVC (Model-View-Controller) based multi-modal authoring tool and development environment 有权
    基于MVC(Model-View-Controller)的多模式创作工具和开发环境

    公开(公告)号:US07900186B2

    公开(公告)日:2011-03-01

    申请号:US11190572

    申请日:2005-07-27

    IPC分类号: G06F9/44

    CPC分类号: G06F8/38

    摘要: Application development tools and method for building multi-channel, multi-device and multi-modal applications, and in particular, to systems and methods for developing applications whereby a user can interact in parallel with the same information via a multiplicity of channels and user interfaces, while a unified, synchronized views of the information are presented across the various channels or devices deployed by the user to interact with the information. In a preferred embodiment, application frameworks and development tools are preferably based on a MVC (Model-View-Controller) design paradigm that is adapted to provide synchronized multi-modal interactions. Multi-channel authoring can be developed using a similar methodology.

    摘要翻译: 用于构建多通道,多设备和多模式应用的应用程序开发工具和方法,特别是用于开发应用程序的系统和方法,由此用户可以通过多个通道和用户界面与相同的信息并行交互 而信息的统一的,同步的视图则呈现在用户部署的各种渠道或设备上,以与信息进行交互。 在优选实施例中,应用框架和开发工具优选地基于适于提供同步多模态交互的MVC(模型 - 视图 - 控制器)设计范例。 可以使用类似的方法开发多渠道创作。

    System and method for user driven interactive application integration
    6.
    发明授权
    System and method for user driven interactive application integration 失效
    用于用户驱动的交互式应用程序集成的系统和方法

    公开(公告)号:US07281217B2

    公开(公告)日:2007-10-09

    申请号:US10448968

    申请日:2003-05-30

    IPC分类号: G06F17/00

    CPC分类号: G06F17/3089

    摘要: A system and method is provided for integrating portlets. When viewing portlets within a portal container, a user is presented with a choice of one or more sources of data and, for each source, one or more actions that the user can take regarding the source. When an action is selected, it causes the source data to be transferred to one or more “target” portlets that have also been activated by the user. The set of actions available from a given source is automatically provided given the available target portlets. As each portlet is initialized, it informs a “broker” of the actions that the portlet supports along with the type of data that is used by the action. When a portal page is being constructed, each portlet identifies to the broker the sources of data within the portlet along with the values and data types corresponding to the sources.

    摘要翻译: 提供了一种集成portlet的系统和方法。 当在门户容器中查看Portlet时,向用户呈现一个或多个数据源的选择,并且对于每个源,用户可以采用关于源的一个或多个动作。 当选择动作时,它会使源数据传输到一个或多个用户也已激活的“目标”portlet。 给定可用的目标portlet会自动提供给定源可用的一组操作。 当每个portlet被初始化时,它会通知“代理”portlet支持的操作以及该操作使用的数据类型。 当构建门户页面时,每个portlet向代理标识portlet中的数据源以及与源对应的值和数据类型。

    Method and system for location-specific speech recognition
    7.
    发明授权
    Method and system for location-specific speech recognition 失效
    位置特定语音识别的方法和系统

    公开(公告)号:US5524169A

    公开(公告)日:1996-06-04

    申请号:US175701

    申请日:1993-12-30

    摘要: A method and system for reducing perplexity in a speech recognition system based upon determined geographic location. In a mobile speech recognition system which processes input frames of speech against stored templates representing speech, a core library of speech templates is created and stored representing a basic vocabulary of speech. Multiple location-specific libraries of speech templates are also created and stored, each library containing speech templates representing a specialized vocabulary for a specific geographic location. The geographic location of the mobile speech recognition system is then periodically determined utilizing a cellular telephone system, a geopositioning satellite system or other similar systems and a particular one of the location-specific libraries of speech templates is identified for the current location of the system. Input frames of speech are then processed against the combination of the core library and the particular location-specific library to greatly enhance the accuracy and efficiency of speech recognition by the system. Each location-specific library preferably includes speech templates representative of location place names, proper names, and business establishments within a specific geographic location.

    摘要翻译: 一种用于基于确定的地理位置减少语音识别系统中的困惑的方法和系统。 在针对表示语音的存储模板处理输入语音帧的移动语音识别系统中,创建并存储代表语音的基本词汇表的语音模板的核心库。 还创建和存储多个位置特定的语音模板库,每个库包含表示特定地理位置的专门词汇表的语音模板。 然后,利用蜂窝电话系统,地理定位卫星系统或其他类似系统周期性地确定移动语音识别系统的地理位置,并为该系统的当前位置识别特定位置的语音模板库。 然后根据核心库和特定位置特定库的组合对输入的语音帧进行处理,以大大增强系统语音识别的准确性和效率。 每个位置特定的图书馆优选地包括表示特定地理位置内的位置地名,专有名称和商业场所的语音模板。