USER INTERACTION FOR CONTENT BASED STORAGE AND RETRIEVAL
    1.
    发明申请
    USER INTERACTION FOR CONTENT BASED STORAGE AND RETRIEVAL 审中-公开
    用于基于内容的存储和检索的交互

    公开(公告)号:US20090064008A1

    公开(公告)日:2009-03-05

    申请号:US11848781

    申请日:2007-08-31

    IPC分类号: G06F3/048

    CPC分类号: G06F3/04883 G06F16/54

    摘要: A graphic user interface system for use with a content based retrieval system includes an active display having display areas. For example, the display areas include a main area providing an overview of database contents by displaying representative samples of the database contents. The display areas also include one or more query areas into which one or more of the representative samples can be moved from the main area by a user employing gesture based interaction. A query formulation module employs the one or more representative samples moved into the query area to provide feedback to the content based retrieval system.

    摘要翻译: 用于与基于内容的检索系统一起使用的图形用户界面系统包括具有显示区域的活动显示器。 例如,显示区域包括通过显示数据库内容的代表性样本来提供数据库内容的概述的主区域。 显示区域还包括一个或多个查询区域,一个或多个代表性样本可以由使用基于手势的交互的用户从主区域移动到该区域中。 查询制定模块使用移动到查询区域中的一个或多个代表性样本来向基于内容的检索系统提供反馈。

    Discriminative training for speaker and speech verification
    2.
    发明授权
    Discriminative training for speaker and speech verification 有权
    演讲者和言语验证的歧视性培训

    公开(公告)号:US07454339B2

    公开(公告)日:2008-11-18

    申请号:US11312981

    申请日:2005-12-20

    IPC分类号: G10L15/06 G10L17/00

    CPC分类号: G10L17/04 G10L15/063

    摘要: A method for discriminatively training acoustic models is provided for automated speaker verification (SV) and speech (or utterance) verification (UV) systems. The method includes: defining a likelihood ratio for a given speech segment, whose speaker identity (for SV system) or linguist identity (for UV system) is known, using a corresponding acoustic model, and an alternative acoustic model which represents all other speakers (in SV) or all other linguist identities (in UV); determining an average likelihood ratio score for the likelihood ratio scores over a set of training utterances (referred to as true data set) whose speaker identities (for SV) or linguist identities (for UV) are the same; determining an average likelihood ratio score for the likelihood ratio scores over a competing set of training utterances which excludes the speech data in the true data set (referred to as competing data set); and optimizing a difference between the average likelihood ratio score over the true data set and the average likelihood ratio score over the competing data set, thereby improving the acoustic model.

    摘要翻译: 提供用于区分性训练声学模型的方法用于自动说话人验证(SV)和语音(或话语)验证(UV)系统。 该方法包括:使用相应的声学模型和代表所有其他扬声器的替代声学模型来定义给定语音段的似然比,其中,所述语音段的语音识别(对于SV系统)或语言学家身份(对于UV系统)是已知的) 在SV)或所有其他语言学家身份(在紫外线); 确定其扬声器身份(对于SV)或语言学家身份(对于UV))相同的一组训练话语(称为真实数据集)的似然比分数的平均似然比分数; 确定在排除真实数据集(称为竞争数据集)中的语音数据的竞争性训练话语组之间的似然比分数的平均似然比分数; 并优化真实数据集之间的平均似然比分数与竞争数据集之间的平均似然比分数之间的差异,从而改善声学模型。

    SYSTEM AND METHOD FOR IDENTIFYING OBJECTS IN AN IMAGE USING POSITIONAL INFORMATION
    3.
    发明申请
    SYSTEM AND METHOD FOR IDENTIFYING OBJECTS IN AN IMAGE USING POSITIONAL INFORMATION 审中-公开
    使用位置信息识别图像中的对象的系统和方法

    公开(公告)号:US20100195872A1

    公开(公告)日:2010-08-05

    申请号:US12678262

    申请日:2008-09-18

    IPC分类号: G06K9/00 G06K9/46

    CPC分类号: G06K9/00771

    摘要: A computer-implemented method is provided for identifying objects in an image. The method includes: capturing a series of images of a scene using a camera; receiving a topographical map for the scene that defines distances between objects in the scene; determining distances between objects in the scene from a given image; approximating identities of objects in the given image by comparing the distances between objects as determined from the given image in relation to the distances between objects from the map. The identities of objects can be re-estimated using features of the objects extracted from the other images.

    摘要翻译: 提供了用于识别图像中的对象的计算机实现的方法。 该方法包括:使用相机拍摄场景的一系列图像; 接收场景中定义场景中对象之间距离的地形图; 从给定图像确定场景中对象之间的距离; 通过比较从给定图像确定的对象之间的距离相对于来自地图的对象之间的距离来近似给定图像中的对象的身份。 可以使用从其他图像提取的对象的特征来重新估计对象的身份。

    VIRTUAL KEYPAD SYSTEMS AND METHODS
    4.
    发明申请
    VIRTUAL KEYPAD SYSTEMS AND METHODS 审中-公开
    虚拟键盘系统和方法

    公开(公告)号:US20100164897A1

    公开(公告)日:2010-07-01

    申请号:US12666916

    申请日:2008-06-26

    IPC分类号: G06F3/041

    摘要: Accordingly, a virtual keypad system for inputting text is provided. A virtual keypad system includes a remote controller having at least one touchpad incorporated therein and divided into a plurality of touch zones. A display device is in data communication with the remote controller and is operable to display a user interface including a keypad, where each key of the keypad is mapped to a touch zone of the touchpad. A prediction module, in response to an operator pressing a given touch zone to select a particular character, performs one or more key prediction methods to predict one or more next plausible keys. A key mapping module remaps the touch zones of the touchpad to the keys of the keypad based on the one or more next plausible keys.

    摘要翻译: 因此,提供了用于输入文本的虚拟键盘系统。 虚拟键盘系统包括具有并入其中的至少一个触摸板并被分成多个触摸区域的遥控器。 显示设备与遥控器进行数据通信,并且可操作以显示包括小键盘的用户界面,其中键盘的每个键被映射到触摸板的触摸区域。 预测模块响应于操作者按压给定触摸区域来选择特定字符,执行一个或多个键预测方法来预测一个或多个下一个合理的键。 键映射模块基于一个或多个下一个合理的键将触摸板的触摸区域重新映射到键区的键。

    Virtual keypad systems and methods
    5.
    发明申请
    Virtual keypad systems and methods 有权
    虚拟键盘系统和方法

    公开(公告)号:US20090007001A1

    公开(公告)日:2009-01-01

    申请号:US11977346

    申请日:2007-10-24

    IPC分类号: G06F3/048

    摘要: Accordingly, a virtual keypad system for inputting text is provided. A virtual keypad system includes a remote controller having at least one touchpad incorporated therein and divided into a plurality of touch zones. A display device is in data communication with the remote controller and is operable to display a user interface including a keypad, where each key of the keypad is mapped to a touch zone of the touchpad. A prediction module, in response to an operator pressing a given touch zone to select a particular character, performs one or more key prediction methods to predict one or more next plausible keys. A key mapping module remaps the touch zones of the touchpad to the keys of the keypad based on the one or more next plausible keys.

    摘要翻译: 因此,提供了用于输入文本的虚拟键盘系统。 虚拟键盘系统包括具有并入其中的至少一个触摸板并被分成多个触摸区域的遥控器。 显示设备与遥控器进行数据通信,并且可操作以显示包括小键盘的用户界面,其中键盘的每个键被映射到触摸板的触摸区域。 预测模块响应于操作者按压给定触摸区域来选择特定字符,执行一个或多个键预测方法来预测一个或多个下一个合理的键。 键映射模块基于一个或多个下一个合理的键将触摸板的触摸区域重新映射到键区的键。

    Discriminative training for speaker and speech verification
    6.
    发明申请
    Discriminative training for speaker and speech verification 有权
    演讲者和言语验证的歧视性培训

    公开(公告)号:US20070143109A1

    公开(公告)日:2007-06-21

    申请号:US11312981

    申请日:2005-12-20

    IPC分类号: G10L15/00

    CPC分类号: G10L17/04 G10L15/063

    摘要: A method for discriminatively training acoustic models is provided for automated speaker verification (SV) and speech (or utterance) verification (UV) systems. The method includes: defining a likelihood ratio for a given speech segment, whose speaker identity (for SV system) or linguist identity (for UV system) is known, using a corresponding acoustic model, and an alternative acoustic model which represents all other speakers (in SV) or all other linguist identities (in UV); determining an average likelihood ratio score for the likelihood ratio scores over a set of training utterances (referred to as true data set) whose speaker identities (for SV) or linguist identities (for UV) are the same; determining an average likelihood ratio score for the likelihood ratio scores over a competing set of training utterances which excludes the speech data in the true data set (referred to as competing data set); and optimizing a difference between the average likelihood ratio score over the true data set and the average likelihood ratio score over the competing data set, thereby improving the acoustic model.

    摘要翻译: 提供用于区分性训练声学模型的方法用于自动说话人验证(SV)和语音(或话语)验证(UV)系统。 该方法包括:使用对应的声学模型和代表所有其他扬声器的替代声学模型来定义给定语音段的似然比,其中,所述语音段的扬声器身份(对于SV系统)或语言学家身份(对于UV系统)是已知的) 在SV)或所有其他语言学家身份(在紫外线); 确定其扬声器身份(对于SV)或语言学家身份(对于UV))相同的一组训练话语(称为真实数据集)的似然比分数的平均似然比分数; 确定在排除真实数据集(称为竞争数据集)中的语音数据的竞争性训练话语组之间的似然比分数的平均似然比分数; 并优化真实数据集之间的平均似然比分数与竞争数据集之间的平均似然比分数之间的差异,从而改善声学模型。

    Discriminative training of HMM models using maximum margin estimation for speech recognition
    7.
    发明申请
    Discriminative training of HMM models using maximum margin estimation for speech recognition 审中-公开
    用于语音识别的最大边际估计的HMM模型的辨别性训练

    公开(公告)号:US20070083373A1

    公开(公告)日:2007-04-12

    申请号:US11247854

    申请日:2005-10-11

    IPC分类号: G10L15/14

    CPC分类号: G10L15/144

    摘要: An improved discriminative training method is provided for hidden Markov models. The method includes: defining a measure of separation margin for the data; identifying a subset of training utterances having utterances misrecognized by the models; defining a training criterion for the models based on maximizing the separation margin; formulating the training criterion as a constrained minimax optimization problem; and solving the constrained minimax optimization problem over the subset of training utterances, thereby discriminatively training the models.

    摘要翻译: 为隐马尔可夫模型提供了一种改进的辨别训练方法。 该方法包括:定义数据的分离余量的度量; 识别具有由模型误认的话语的训练话语的子集; 基于最大化分离边界来定义模型的训练标准; 制定训练标准作为约束最小化优化问题; 并且在训练语言的子集上求解约束最小最优化问题,从而区分性地训练模型。

    Virtual keypad systems and methods
    8.
    发明授权
    Virtual keypad systems and methods 有权
    虚拟键盘系统和方法

    公开(公告)号:US08065624B2

    公开(公告)日:2011-11-22

    申请号:US11977346

    申请日:2007-10-24

    IPC分类号: G06F3/048

    摘要: Accordingly, a virtual keypad system for inputting text is provided. A virtual keypad system includes a remote controller having at least one touchpad incorporated therein and divided into a plurality of touch zones. A display device is in data communication with the remote controller and is operable to display a user interface including a keypad, where each key of the keypad is mapped to a touch zone of the touchpad. A prediction module, in response to an operator pressing a given touch zone to select a particular character, performs one or more key prediction methods to predict one or more next plausible keys. A key mapping module remaps the touch zones of the touchpad to the keys of the keypad based on the one or more next plausible keys.

    摘要翻译: 因此,提供了用于输入文本的虚拟键盘系统。 虚拟键盘系统包括具有并入其中的至少一个触摸板并被分成多个触摸区域的遥控器。 显示设备与遥控器进行数据通信,并且可操作以显示包括小键盘的用户界面,其中键盘的每个键被映射到触摸板的触摸区域。 预测模块响应于操作者按压给定触摸区域来选择特定字符,执行一个或多个键预测方法来预测一个或多个下一个合理的键。 键映射模块基于一个或多个下一个合理的键将触摸板的触摸区域重新映射到键区的键。

    Method for efficient, safe and reliable data entry by voice under adverse conditions

    公开(公告)号:US06996528B2

    公开(公告)日:2006-02-07

    申请号:US09921766

    申请日:2001-08-03

    IPC分类号: G10L15/22

    CPC分类号: G10L15/065 G10L15/22

    摘要: A method and apparatus for data entry by voice under adverse conditions is disclosed. More specifically it provides a way for efficient and robust form filling by voice. A form can typically contain one or several fields that must be filled in. The user communicates to a speech recognition system and word spotting is performed upon the utterance. The spotted words of an utterance form a phrase that can contain field-specific values and/or commands. Recognized values are echoed back to the speaker via a text-to-speech system. Unreliable or unsafe inputs for which the confidence measure is found to be low (e.g. ill-pronounced speech or noises) are rejected by the spotter. Speaker adaptation is furthermore performed transparently to improve speech recognition accuracy. Other input modalities can be additionally supported (e.g. keyboard and touch-screen). The system maintains a dialogue history to enable editing and correction operations on all active fields.

    Joint signal and model based noise matching noise robustness method for automatic speech recognition
    10.
    发明授权
    Joint signal and model based noise matching noise robustness method for automatic speech recognition 有权
    基于信号和模型的噪声匹配噪声鲁棒性自动语音识别方法

    公开(公告)号:US07729908B2

    公开(公告)日:2010-06-01

    申请号:US11369936

    申请日:2006-03-06

    IPC分类号: G10L15/20 G10L15/06 G10L21/02

    CPC分类号: G10L15/20 G10L21/0216

    摘要: A noise robustness method operates jointly in a signal domain and a model domain. For example, energy is added in the signal domain for frequency bands where an actual noise level of an incoming signal is lower than a noise level used to train models, thus obtaining a compensated signal. Also, energy is added in the model domain for frequency bands where noise level of the incoming signal or the compensated signal is higher than the noise level used to train the models. Moreover, energy is never removed, thereby avoiding problems of higher sensitivity of energy removal to estimation errors.

    摘要翻译: 噪声鲁棒性方法在信号域和模型域中共同操作。 例如,在信号域中增加能量,其中输入信号的实际噪声电平低于用于训练模型的噪声电平,从而获得补偿信号。 此外,在模型域中增加能量,其中输入信号或补偿信号的噪声电平高于用于训练模型的噪声电平的频带。 此外,能量永远不会被去除,从而避免了能量去除对估计误差的更高灵敏度的问题。