-
公开(公告)号:US20130110521A1
公开(公告)日:2013-05-02
申请号:US13483732
申请日:2012-05-30
申请人: Kyu Woong Hwang , Kisun You , Minho Jin , Peter Jivan Shah , Kwokleung Chan , Taesu Kim
发明人: Kyu Woong Hwang , Kisun You , Minho Jin , Peter Jivan Shah , Kwokleung Chan , Taesu Kim
IPC分类号: G10L19/00
CPC分类号: H04W52/028 , G10L15/02 , G10L15/28 , G10L19/005 , G10L19/008 , G10L19/18 , G10L2015/088 , H04W52/02 , Y02D70/00 , Y02D70/23
摘要: A particular method includes transitioning out of a low-power state at a processor. The method also includes retrieving audio feature data from a buffer after transitioning out of the low-power state. The audio feature data indicates features of audio data received during the low-power state of the processor.
摘要翻译: 一种特定的方法包括在处理器处转换出低功率状态。 该方法还包括在从低功率状态转出之后从缓冲器检索音频特征数据。 音频特征数据指示在处理器的低功率状态期间接收的音频数据的特征。
-
公开(公告)号:US09992745B2
公开(公告)日:2018-06-05
申请号:US13483732
申请日:2012-05-30
申请人: Kyu Woong Hwang , Kisun You , Minho Jin , Peter Jivan Shah , Kwokleung Chan , Taesu Kim
发明人: Kyu Woong Hwang , Kisun You , Minho Jin , Peter Jivan Shah , Kwokleung Chan , Taesu Kim
IPC分类号: H04M1/00 , G10L19/00 , H04W52/02 , G10L19/005 , G10L19/008 , G10L15/08 , G10L15/02 , G10L15/28 , G10L19/18
CPC分类号: H04W52/028 , G10L15/02 , G10L15/28 , G10L19/005 , G10L19/008 , G10L19/18 , G10L2015/088 , H04W52/02 , Y02D70/00 , Y02D70/23
摘要: A processor is configured to transition in and out of a low-power state at a first rate and to operate in a first mode or a second mode. In a particular method, the processor while coupled to a coder/decoder (CODEC) retrieves audio feature data from a buffer after transitioning out of the low-power state. The CODEC is configured to operate at a second rate in the first mode and at a third rate in the second mode, the second rate and the third rate each greater than the first rate. The audio feature data indicates features of audio data received during the low-power state of the processor. A ratio of CODEC activity to processor activity in the second mode is less than the ratio in the first mode.
-
公开(公告)号:US20120224706A1
公开(公告)日:2012-09-06
申请号:US13285971
申请日:2011-10-31
申请人: Kyu Woong Hwang , Taesu Kim , Kisun You
发明人: Kyu Woong Hwang , Taesu Kim , Kisun You
IPC分类号: H04R29/00
摘要: A method for recognizing an environmental sound in a client device in cooperation with a server is disclosed. The client device includes a client database having a plurality of sound models of environmental sounds and a plurality of labels, each of which identifies at least one sound model. The client device receives an input environmental sound and generates an input sound model based on the input environmental sound. At the client device, a similarity value is determined between the input sound model and each of the sound models to identify one or more sound models from the client database that are similar to the input sound model. A label is selected from labels associated with the identified sound models, and the selected label is associated with the input environmental sound based on a confidence level of the selected label.
摘要翻译: 公开了一种与服务器协作来识别客户端设备中的环境声音的方法。 客户端设备包括具有环境声音的多个声音模型的客户数据库和多个标签,每个标签识别至少一个声音模型。 客户端设备接收输入环境声音,并根据输入的环境声音生成输入声音模型。 在客户机设备处,在输入声音模型和每个声音模型之间确定相似性值,以从客户端数据库识别类似于输入声音模型的一个或多个声音模型。 从与识别的声音模型相关联的标签中选择标签,并且所选择的标签基于所选标签的置信水平与输入的环境声音相关联。
-
公开(公告)号:US09443511B2
公开(公告)日:2016-09-13
申请号:US13285971
申请日:2011-10-31
申请人: Kyu Woong Hwang , Taesu Kim , Kisun You
发明人: Kyu Woong Hwang , Taesu Kim , Kisun You
摘要: A method for recognizing an environmental sound in a client device in cooperation with a server is disclosed. The client device includes a client database having a plurality of sound models of environmental sounds and a plurality of labels, each of which identifies at least one sound model. The client device receives an input environmental sound and generates an input sound model based on the input environmental sound. At the client device, a similarity value is determined between the input sound model and each of the sound models to identify one or more sound models from the client database that are similar to the input sound model. A label is selected from labels associated with the identified sound models, and the selected label is associated with the input environmental sound based on a confidence level of the selected label.
摘要翻译: 公开了一种与服务器协作来识别客户端设备中的环境声音的方法。 客户端设备包括具有环境声音的多个声音模型的客户数据库和多个标签,每个标签识别至少一个声音模型。 客户端设备接收输入环境声音,并根据输入的环境声音生成输入声音模型。 在客户机设备处,在输入声音模型和每个声音模型之间确定相似性值,以从客户端数据库识别类似于输入声音模型的一个或多个声音模型。 从与识别的声音模型相关联的标签中选择标签,并且所选择的标签基于所选标签的置信水平与输入的环境声音相关联。
-
公开(公告)号:US20120226497A1
公开(公告)日:2012-09-06
申请号:US13371966
申请日:2012-02-13
申请人: Kisun You , Kyu Woong Hwang , Taesu Kim
发明人: Kisun You , Kyu Woong Hwang , Taesu Kim
IPC分类号: G10L15/00
CPC分类号: G10L15/08
摘要: A method for generating an anti-model of a sound class is disclosed. A plurality of candidate sound data is provided for generating the anti-model. A plurality of similarity values between the plurality of candidate sound data and a reference sound model of a sound class is determined. An anti-model of the sound class is generated based on at least one candidate sound data having the similarity value within a similarity threshold range.
摘要翻译: 公开了一种用于产生声级的反模型的方法。 多个候选声音数据被提供用于产生反模型。 确定多个候选声音数据与声音类别的参考声音模型之间的多个相似度值。 基于具有相似性阈值范围内的相似度值的至少一个候选声音数据,生成声音类别的反模型。
-
公开(公告)号:US09224388B2
公开(公告)日:2015-12-29
申请号:US13371966
申请日:2012-02-13
申请人: Kisun You , Kyu Woong Hwang , Taesu Kim
发明人: Kisun You , Kyu Woong Hwang , Taesu Kim
CPC分类号: G10L15/08
摘要: A method for generating an anti-model of a sound class is disclosed. A plurality of candidate sound data is provided for generating the anti-model. A plurality of similarity values between the plurality of candidate sound data and a reference sound model of a sound class is determined. An anti-model of the sound class is generated based on at least one candidate sound data having the similarity value within a similarity threshold range.
摘要翻译: 公开了一种用于产生声级的反模型的方法。 多个候选声音数据被提供用于产生反模型。 确定多个候选声音数据与声音类别的参考声音模型之间的多个相似度值。 基于具有相似性阈值范围内的相似度值的至少一个候选声音数据,生成声音类别的反模型。
-
公开(公告)号:US20120142324A1
公开(公告)日:2012-06-07
申请号:US13289437
申请日:2011-11-04
申请人: Taesu Kim , Kisun You , Kyu Woong Hwang , Te-Won Lee
发明人: Taesu Kim , Kisun You , Kyu Woong Hwang , Te-Won Lee
IPC分类号: H04M3/42
CPC分类号: H04M3/568 , H04L65/403 , H04M2201/38 , H04M2203/6054 , H04M2203/6063 , H04M2207/18 , H04N7/15
摘要: A method for providing information for a conference at one or more locations is disclosed. One or more mobile devices monitor one or more starting requirements of the conference and transmit input sound information to a server when the one or more starting requirements of the conference is detected. The one or more starting requirements may include a starting time of the conference, a location of the conference, and/or acoustic characteristics of a conference environment. The server generates conference information based on the input sound information from each mobile device and transmits the conference information to each mobile device. The conference information may include information on attendees, a current speaker among the attendees, an arrangement of the attendees, and/or a meeting log of attendee participation at the conference.
摘要翻译: 公开了一种在一个或多个位置为会议提供信息的方法。 当检测到会议的一个或多个启动要求时,一个或多个移动设备监视会议的一个或多个启动要求并将输入声音信息发送到服务器。 一个或多个启动要求可以包括会议的开始时间,会议的位置和/或会议环境的声学特性。 服务器基于来自每个移动设备的输入声音信息生成会议信息,并将会议信息发送到每个移动设备。 会议信息可以包括参加者的信息,参加者中的现任演讲者,与会者的安排和/或参加者在会议中的会议记录。
-
公开(公告)号:US09082035B2
公开(公告)日:2015-07-14
申请号:US13450016
申请日:2012-04-18
申请人: Kyuwoong Hwang , Te-Won Lee , Duck Hoon Kim , Kisun You , Minho Jin , Taesu Kim , Hyun-Mook Cho
发明人: Kyuwoong Hwang , Te-Won Lee , Duck Hoon Kim , Kisun You , Minho Jin , Taesu Kim , Hyun-Mook Cho
CPC分类号: G06K9/2054 , G06K9/033 , G06K9/723 , G06K9/726 , G06K2209/01 , G06K2209/011 , G06K2209/27
摘要: Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.
摘要翻译: 本发明的实施例描述了用于执行上下文敏感的OCR的方法和装置。 设备使用耦合到该设备的照相机来获得图像。 设备识别包括图形对象的图像的一部分。 设备推断与图像相关联的上下文,并且基于与图像相关联的上下文来选择一组图形对象。 使用图形对象组生成改进的OCR结果。 来自包括麦克风,GPS和相机在内的各种传感器的输入以及包括语音,触摸和用户使用模式在内的用户输入可以用于推断用户上下文并选择与所推断的上下文最相关的字典。
-
公开(公告)号:US09563265B2
公开(公告)日:2017-02-07
申请号:US13585927
申请日:2012-08-15
申请人: Kisun You , Taesu Kim , Kyuwoong Hwang , Minho Jin , Hyun-Mook Cho , Te-Won Lee
发明人: Kisun You , Taesu Kim , Kyuwoong Hwang , Minho Jin , Hyun-Mook Cho , Te-Won Lee
摘要: A method for responding in an augmented reality (AR) application of a mobile device to an external sound is disclosed. The mobile device detects a target. A virtual object is initiated in the AR application. Further, the external sound is received, by at least one sound sensor of the mobile device, from a sound source. Geometric information between the sound source and the target is determined, and at least one response for the virtual object to perform in the AR application is generated based on the geometric information.
摘要翻译: 公开了一种用于在移动设备的增强现实(AR)应用中对外部声音进行响应的方法。 移动设备检测目标。 在AR应用程序中启动虚拟对象。 此外,通过移动设备的至少一个声音传感器从声源接收外部声音。 确定声源和目标之间的几何信息,并且基于几何信息生成在AR应用中要执行的虚拟对象的至少一个响应。
-
公开(公告)号:US20130108115A1
公开(公告)日:2013-05-02
申请号:US13450016
申请日:2012-04-18
申请人: Kyuwoong HWANG , Te-Won Lee , Duck Hoon Kim , Kisun You , Minho Jin , Taesu Kim , Hyun-Mook Cho
发明人: Kyuwoong HWANG , Te-Won Lee , Duck Hoon Kim , Kisun You , Minho Jin , Taesu Kim , Hyun-Mook Cho
IPC分类号: G06K9/20
CPC分类号: G06K9/2054 , G06K9/033 , G06K9/723 , G06K9/726 , G06K2209/01 , G06K2209/011 , G06K2209/27
摘要: Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.
摘要翻译: 本发明的实施例描述了用于执行上下文敏感的OCR的方法和装置。 设备使用耦合到该设备的照相机来获得图像。 设备识别包括图形对象的图像的一部分。 设备推断与图像相关联的上下文,并且基于与图像相关联的上下文来选择一组图形对象。 使用图形对象组生成改进的OCR结果。 来自包括麦克风,GPS和相机在内的各种传感器的输入以及包括语音,触摸和用户使用模式在内的用户输入可以用于推断用户上下文并选择与所推断的上下文最相关的字典。
-
-
-
-
-
-
-
-
-