System and method for speech trend analytics with objective function and feature constraints
    2.
    发明授权
    System and method for speech trend analytics with objective function and feature constraints 有权
    具有目标函数和特征约束的语音趋势分析的系统和方法

    公开(公告)号:US09213978B2

    公开(公告)日:2015-12-15

    申请号:US12895337

    申请日:2010-09-30

    CPC classification number: G06N7/005 G06F17/30684 G06Q30/01

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing trend analysis of speech. A system practicing the method receives a speech trend analysis request having candidate feature constraints, an objective function with respect to a speech trend to be analyzed, and a set of speech record constraints. The system selects a subset of speech records from the group of speech records based on the set of speech record constraints to yield selected speech records, identifies features in the selected speech records based on the set of candidate feature constraints to yield identified features, and assigns a weight to each of the identified features based on the objective function. Then the system ranks the identified features by their respective weights to yield ranked identified features, and outputs at least one of the ranked identified features associated with a speech-based trend in response to the speech trend analysis request.

    Abstract translation: 这里公开了用于执行语音趋势分析的系统,方法和非暂时的计算机可读存储介质。 实施该方法的系统接收具有候选特征约束的语音趋势分析请求,关于待分析的语音趋势的目标函数和一组语音记录约束。 该系统基于语音记录约束集从语音记录组中选择语音记录的子集,以产生所选择的语音记录,基于候选特征约束集来识别所选语音记录中的特征,以产生所识别的特征,并且分配 基于目标函数对每个识别的特征的权重。 然后,系统通过它们各自的权重对所识别的特征进行排序,以产生排名确定的特征,并响应于语音趋势分析请求输出与基于语音的趋势相关联的排名确定的特征中的至少一个。

    On-demand language translation for television programs
    5.
    发明授权
    On-demand language translation for television programs 有权
    电视节目的按需语言翻译

    公开(公告)号:US08805668B2

    公开(公告)日:2014-08-12

    申请号:US12897149

    申请日:2010-10-04

    CPC classification number: G06F17/289

    Abstract: In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing.

    Abstract translation: 在一个实施例中,提供了提供按需翻译服务的方法。 用户可能被收取减少的费用或不使用按需翻译服务的费用,以便向订户显示商业消息,基于用户信息选​​择商业消息。 可以接收包括源语言的信息的多媒体信号。 可以从多媒体信号中获取源语言中的文本信息。 文本可以从源语言翻译成目标语言。 基于翻译的文本的翻译信息可以被发送到处理设备以呈现给订阅者。 所接收的多媒体信号可以被发送到多媒体设备以供观看。

    Method and apparatus for predicting word accuracy in automatic speech recognition systems
    6.
    发明授权
    Method and apparatus for predicting word accuracy in automatic speech recognition systems 有权
    在自动语音识别系统中预测字精度的方法和装置

    公开(公告)号:US08538752B2

    公开(公告)日:2013-09-17

    申请号:US13465886

    申请日:2012-05-07

    CPC classification number: G10L15/20 G10L15/10

    Abstract: The invention comprises a method and apparatus for predicting word accuracy. Specifically, the method comprises obtaining an utterance in speech data where the utterance comprises an actual word string, processing the utterance for generating an interpretation of the actual word string, processing the utterance to identify at least one utterance frame, and predicting a word accuracy associated with the interpretation according to at least one stationary signal-to-noise ratio and at least one non-stationary signal to noise ratio, wherein the at least one stationary signal-to-noise ratio and the at least one non-stationary signal to noise ratio are determined according to a frame energy associated with each of the at least one utterance frame.

    Abstract translation: 本发明包括一种用于预测词精度的方法和装置。 具体地说,该方法包括获得语音数据中的话语,其中话语包括实际字串,处理用于产生实际字串的解释的话语,处理话语以识别至少一个话语帧,以及预测相关的单词精度 根据至少一个稳定的信噪比和至少一个非平稳的信噪比的解释,其中所述至少一个固定信噪比和所述至少一个非平稳信号与噪声 根据与所述至少一个话音帧中的每一个相关联的帧能量确定比率。

    SYSTEM AND METHOD FOR AN ENHANCED SHOPPING EXPERIENCE
    7.
    发明申请
    SYSTEM AND METHOD FOR AN ENHANCED SHOPPING EXPERIENCE 审中-公开
    用于增强购物体验的系统和方法

    公开(公告)号:US20120290435A1

    公开(公告)日:2012-11-15

    申请号:US13554578

    申请日:2012-07-20

    CPC classification number: G06Q30/02 G06Q30/0601 G06Q30/0643

    Abstract: Disclosed herein are systems, methods, and computer readable-media for creating a virtual shopping area. The method includes receiving a query from a user and an automated input specific to the user from a computing device, generating a list of merchants based on the query and the automated input, generating a virtual shopping area from the list of merchants and based on one or more constraints, and displaying the virtual shopping area on the computing device. One optional step is presenting to the user an interface to purchase query-related items from merchants in the virtual shopping area. The method optionally includes receiving an indication of intent to purchase an item from the user, displaying an image of the item to the user, and dynamically updating the displayed image of the item as the user specifies item-specific details. The list of merchants can be restricted to merchants geographically close to the user.

    Abstract translation: 这里公开了用于创建虚拟购物区域的系统,方法和计算机可读介质。 该方法包括从计算设备接收来自用户的查询和用户特有的自动输入,基于查询和自动输入生成商家列表,从商家列表中生成虚拟购物区域,并基于一个 或更多的约束,以及在计算设备上显示虚拟购物区域。 一个可选步骤向用户呈现从虚拟购物区域中的商家购买查询相关项目的界面。 该方法可选地包括从用户接收用于购买项目的意图的指示,向用户显示该项目的图像,并且在用户指定项目特定细节时动态地更新该项目的显示图像。 商家列表可以限于地理位置靠近用户的商家。

    System and method for increasing accuracy of searches based on communication network
    8.
    发明授权
    System and method for increasing accuracy of searches based on communication network 有权
    基于通信网络提高搜索精度的系统和方法

    公开(公告)号:US08170866B2

    公开(公告)日:2012-05-01

    申请号:US13164347

    申请日:2011-06-20

    CPC classification number: G10L15/193 G10L15/065 G10L2015/228

    Abstract: Disclosed are systems, methods and computer-readable media for using a local communication network to generate a speech model. The method includes retrieving for an individual a list of numbers in a calling history, identifying a local neighborhood associated with each number in the calling history, truncating the local neighborhood associated with each number based on the at least one parameter, retrieving a local communication network associated with each number in the calling history and each phone number in the local neighborhood, and creating a language model for the individual based on the retrieved local communication network. The generated language model may be used for improved automatic speech recognition for audible searches as well as other modules in a spoken dialog system.

    Abstract translation: 公开了用于使用本地通信网络生成语音模型的系统,方法和计算机可读介质。 该方法包括检索个人在呼叫历史中的号码列表,识别与呼叫历史中的每个号码相关联的本地邻域,基于该至少一个参数截断与每个号码相关联的本地邻域,检索本地通信网络 与呼叫历史中的每个号码和本地邻域中的每个电话号码相关联,以及基于所检索的本地通信网络为个人创建语言模型。 生成的语言模型可以用于改善声音搜索的自动语音识别以及口语对话系统中的其他模块。

    System and method for improving robustness of speech recognition using vocal tract length normalization codebooks
    9.
    发明授权
    System and method for improving robustness of speech recognition using vocal tract length normalization codebooks 有权
    使用声道长度归一化码本提高语音识别鲁棒性的系统和方法

    公开(公告)号:US08160875B2

    公开(公告)日:2012-04-17

    申请号:US12869039

    申请日:2010-08-26

    Applicant: Mazin Gilbert

    Inventor: Mazin Gilbert

    CPC classification number: G10L15/07

    Abstract: Disclosed are systems, methods, and computer readable media for performing speech recognition. The method embodiment comprises selecting a codebook from a plurality of codebooks with a minimal acoustic distance to a received speech sample, the plurality of codebooks generated by a process of (a) computing a vocal tract length for a each of a plurality of speakers, (b) for each of the plurality of speakers, clustering speech vectors, and (c) creating a codebook for each speaker, the codebook containing entries for the respective speaker's vocal tract length, speech vectors, and an optional vector weight for each speech vector, (2) applying the respective vocal tract length associated with the selected codebook to normalize the received speech sample for use in speech recognition, and (3) recognizing the received speech sample based on the respective vocal tract length associated with the selected codebook.

    Abstract translation: 公开了用于执行语音识别的系统,方法和计算机可读介质。 方法实施例包括从具有对接收到的语音样本的最小声距离的多个码本中选择码本,所述多个码本通过(a)计算多个扬声器中的每一个的声道长度的处理而生成( b)对于所述多个扬声器中的每一个,聚类语音向量,以及(c)为每个说话者创建码本,所述码本包含用于每个语音向量的相应说话者声道长度,语音向量和可选矢量权重的条目, (2)应用与所选码本相关联的相应声道长度,以规范化用于语音识别的接收到的语音样本,以及(3)基于与所选码本相关联的相应声道长度来识别所接收的语音样本。

    Finding the website of a business using the business name
    10.
    发明授权
    Finding the website of a business using the business name 有权
    使用企业名称查找企业的网站

    公开(公告)号:US08065300B2

    公开(公告)日:2011-11-22

    申请号:US12075570

    申请日:2008-03-12

    CPC classification number: G06F17/30864 Y10S707/944

    Abstract: A system and method are provided for augmenting information on business directory databases. Using the business name contained in a business directory database and Web data mining technology, the website of a business is found and validated, prior to enriching the database entries.

    Abstract translation: 提供了一种用于增加业务目录数据库信息的系统和方法。 使用商业目录数据库中包含的业务名称和Web数据挖掘技术,在丰富数据库条目之前,会找到并验证业务的网站。

Patent Agency Ranking