SYSTEM AND METHOD FOR RECOGNIZING ENVIRONMENTAL SOUND
    3.
    发明申请
    SYSTEM AND METHOD FOR RECOGNIZING ENVIRONMENTAL SOUND 有权
    用于识别环境声音的系统和方法

    公开(公告)号:US20120224706A1

    公开(公告)日:2012-09-06

    申请号:US13285971

    申请日:2011-10-31

    IPC分类号: H04R29/00

    CPC分类号: G10L15/10 G10L15/20 G10L25/00

    摘要: A method for recognizing an environmental sound in a client device in cooperation with a server is disclosed. The client device includes a client database having a plurality of sound models of environmental sounds and a plurality of labels, each of which identifies at least one sound model. The client device receives an input environmental sound and generates an input sound model based on the input environmental sound. At the client device, a similarity value is determined between the input sound model and each of the sound models to identify one or more sound models from the client database that are similar to the input sound model. A label is selected from labels associated with the identified sound models, and the selected label is associated with the input environmental sound based on a confidence level of the selected label.

    摘要翻译: 公开了一种与服务器协作来识别客户端设备中的环境声音的方法。 客户端设备包括具有环境声音的多个声音模型的客户数据库和多个标签,每个标签识别至少一个声音模型。 客户端设备接收输入环境声音,并根据输入的环境声音生成输入声音模型。 在客户机设备处,在输入声音模型和每个声音模型之间确定相似性值,以从客户端数据库识别类似于输入声音模型的一个或多个声音模型。 从与识别的声音模型相关联的标签中选择标签,并且所选择的标签基于所选标签的置信水平与输入的环境声音相关联。

    System and method for recognizing environmental sound
    4.
    发明授权
    System and method for recognizing environmental sound 有权
    识别环境声音的系统和方法

    公开(公告)号:US09443511B2

    公开(公告)日:2016-09-13

    申请号:US13285971

    申请日:2011-10-31

    CPC分类号: G10L15/10 G10L15/20 G10L25/00

    摘要: A method for recognizing an environmental sound in a client device in cooperation with a server is disclosed. The client device includes a client database having a plurality of sound models of environmental sounds and a plurality of labels, each of which identifies at least one sound model. The client device receives an input environmental sound and generates an input sound model based on the input environmental sound. At the client device, a similarity value is determined between the input sound model and each of the sound models to identify one or more sound models from the client database that are similar to the input sound model. A label is selected from labels associated with the identified sound models, and the selected label is associated with the input environmental sound based on a confidence level of the selected label.

    摘要翻译: 公开了一种与服务器协作来识别客户端设备中的环境声音的方法。 客户端设备包括具有环境声音的多个声音模型的客户数据库和多个标签,每个标签识别至少一个声音模型。 客户端设备接收输入环境声音,并根据输入的环境声音生成输入声音模型。 在客户机设备处,在输入声音模型和每个声音模型之间确定相似性值,以从客户端数据库识别类似于输入声音模型的一个或多个声音模型。 从与识别的声音模型相关联的标签中选择标签,并且所选择的标签基于所选标签的置信水平与输入的环境声音相关联。

    SOUND RECOGNITION METHOD AND SYSTEM
    5.
    发明申请
    SOUND RECOGNITION METHOD AND SYSTEM 有权
    声音识别方法和系统

    公开(公告)号:US20120226497A1

    公开(公告)日:2012-09-06

    申请号:US13371966

    申请日:2012-02-13

    IPC分类号: G10L15/00

    CPC分类号: G10L15/08

    摘要: A method for generating an anti-model of a sound class is disclosed. A plurality of candidate sound data is provided for generating the anti-model. A plurality of similarity values between the plurality of candidate sound data and a reference sound model of a sound class is determined. An anti-model of the sound class is generated based on at least one candidate sound data having the similarity value within a similarity threshold range.

    摘要翻译: 公开了一种用于产生声级的反模型的方法。 多个候选声音数据被提供用于产生反模型。 确定多个候选声音数据与声音类别的参考声音模型之间的多个相似度值。 基于具有相似性阈值范围内的相似度值的至少一个候选声音数据,生成声音类别的反模型。

    Sound recognition method and system
    6.
    发明授权
    Sound recognition method and system 有权
    声音识别方法和系统

    公开(公告)号:US09224388B2

    公开(公告)日:2015-12-29

    申请号:US13371966

    申请日:2012-02-13

    IPC分类号: G10L15/00 G10L15/08

    CPC分类号: G10L15/08

    摘要: A method for generating an anti-model of a sound class is disclosed. A plurality of candidate sound data is provided for generating the anti-model. A plurality of similarity values between the plurality of candidate sound data and a reference sound model of a sound class is determined. An anti-model of the sound class is generated based on at least one candidate sound data having the similarity value within a similarity threshold range.

    摘要翻译: 公开了一种用于产生声级的反模型的方法。 多个候选声音数据被提供用于产生反模型。 确定多个候选声音数据与声音类别的参考声音模型之间的多个相似度值。 基于具有相似性阈值范围内的相似度值的至少一个候选声音数据,生成声音类别的反模型。

    SYSTEM AND METHOD FOR PROVIDING CONFERENCE INFORMATION
    7.
    发明申请
    SYSTEM AND METHOD FOR PROVIDING CONFERENCE INFORMATION 审中-公开
    用于提供会议信息的系统和方法

    公开(公告)号:US20120142324A1

    公开(公告)日:2012-06-07

    申请号:US13289437

    申请日:2011-11-04

    IPC分类号: H04M3/42

    摘要: A method for providing information for a conference at one or more locations is disclosed. One or more mobile devices monitor one or more starting requirements of the conference and transmit input sound information to a server when the one or more starting requirements of the conference is detected. The one or more starting requirements may include a starting time of the conference, a location of the conference, and/or acoustic characteristics of a conference environment. The server generates conference information based on the input sound information from each mobile device and transmits the conference information to each mobile device. The conference information may include information on attendees, a current speaker among the attendees, an arrangement of the attendees, and/or a meeting log of attendee participation at the conference.

    摘要翻译: 公开了一种在一个或多个位置为会议提供信息的方法。 当检测到会议的一个或多个启动要求时,一个或多个移动设备监视会议的一个或多个启动要求并将输入声音信息发送到服务器。 一个或多个启动要求可以包括会议的开始时间,会议的位置和/或会议环境的声学特性。 服务器基于来自每个移动设备的输入声音信息生成会议信息,并将会议信息发送到每个移动设备。 会议信息可以包括参加者的信息,参加者中的现任演讲者,与会者的安排和/或参加者在会议中的会议记录。

    Camera OCR with context information
    8.
    发明授权
    Camera OCR with context information 有权
    相机OCR与上下文信息

    公开(公告)号:US09082035B2

    公开(公告)日:2015-07-14

    申请号:US13450016

    申请日:2012-04-18

    摘要: Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.

    摘要翻译: 本发明的实施例描述了用于执行上下文敏感的OCR的方法和装置。 设备使用耦合到该设备的照相机来获得图像。 设备识别包括图形对象的图像的一部分。 设备推断与图像相关联的上下文,并且基于与图像相关联的上下文来选择一组图形对象。 使用图形对象组生成改进的OCR结果。 来自包括麦克风,GPS和相机在内的各种传感器的输入以及包括语音,触摸和用户使用模式在内的用户输入可以用于推断用户上下文并选择与所推断的上下文最相关的字典。

    Augmented reality with sound and geometric analysis
    9.
    发明授权
    Augmented reality with sound and geometric analysis 有权
    增强现实与声音和几何分析

    公开(公告)号:US09563265B2

    公开(公告)日:2017-02-07

    申请号:US13585927

    申请日:2012-08-15

    CPC分类号: G06F3/011 G06F3/167

    摘要: A method for responding in an augmented reality (AR) application of a mobile device to an external sound is disclosed. The mobile device detects a target. A virtual object is initiated in the AR application. Further, the external sound is received, by at least one sound sensor of the mobile device, from a sound source. Geometric information between the sound source and the target is determined, and at least one response for the virtual object to perform in the AR application is generated based on the geometric information.

    摘要翻译: 公开了一种用于在移动设备的增强现实(AR)应用中对外部声音进行响应的方法。 移动设备检测目标。 在AR应用程序中启动虚拟对象。 此外,通过移动设备的至少一个声音传感器从声源接收外部声音。 确定声源和目标之间的几何信息,并且基于几何信息生成在AR应用中要执行的虚拟对象的至少一个响应。

    CAMERA OCR WITH CONTEXT INFORMATION
    10.
    发明申请
    CAMERA OCR WITH CONTEXT INFORMATION 有权
    CAMERA OCR与上下文信息

    公开(公告)号:US20130108115A1

    公开(公告)日:2013-05-02

    申请号:US13450016

    申请日:2012-04-18

    IPC分类号: G06K9/20

    摘要: Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.

    摘要翻译: 本发明的实施例描述了用于执行上下文敏感的OCR的方法和装置。 设备使用耦合到该设备的照相机来获得图像。 设备识别包括图形对象的图像的一部分。 设备推断与图像相关联的上下文,并且基于与图像相关联的上下文来选择一组图形对象。 使用图形对象组生成改进的OCR结果。 来自包括麦克风,GPS和相机在内的各种传感器的输入以及包括语音,触摸和用户使用模式在内的用户输入可以用于推断用户上下文并选择与所推断的上下文最相关的字典。