Dynamic image recognition model updates

    公开(公告)号:US10115185B2

    公开(公告)日:2018-10-30

    申请号:US14561353

    申请日:2014-12-05

    Abstract: A method includes receiving first image data at an electronic device, and performing a first image recognition operation on the first image data based on a first image recognition model stored in a memory of the electronic device. The method may include sending an image recognition model update request from the electronic device to a server, in response to determining that a result of the first image recognition operation fails to satisfy a confidence threshold. The method includes receiving image recognition model update information from the server and updating the first image recognition model based on the image recognition model update information to generate a second image recognition model. The method further includes performing a second image recognition operation based on the second image recognition model.

    DYNAMIC IMAGE RECOGNITION MODEL UPDATES

    公开(公告)号:US20220020127A1

    公开(公告)日:2022-01-20

    申请号:US17490709

    申请日:2021-09-30

    Abstract: Aspects of the subject disclosure may include, for example, receiving first image data for a first image from an electronic device, the first image data including first information associated with a first object identified by the electronic device as being in the first image. Second image data for a second image is received from the electronic device subsequent to receiving the first image data. The second image data includes second information associated with a second object identified by the electronic device as being in the second image. Location information is determined based on the first information and the second information and sent to the electronic device. Other embodiments are disclosed.

    Dynamic image recognition model updates

    公开(公告)号:US11164294B2

    公开(公告)日:2021-11-02

    申请号:US16132802

    申请日:2018-09-17

    Abstract: A method includes capturing a first image via a camera of an electronic device using a first set of image capture parameters. The method includes identifying a first object in the first image based on an object recognition model. The method includes sending first image data to a server. The first image data includes first information associated with the first object. The method includes capturing a second image via the camera using a second set of image capture parameters. The method includes identifying a second object in the second image based on the object recognition model. The method includes sending second image data to the server. The second image data includes second information associated with the second object. The method also includes receiving location information from the server. The location information is determined by the server based on the first information and the second information.

    System and method of providing speech processing in user interface

    公开(公告)号:US09530415B2

    公开(公告)日:2016-12-27

    申请号:US14928193

    申请日:2015-10-30

    CPC classification number: G10L15/26 G06F3/0416 G06F3/162 G10L15/22 G10L15/30

    Abstract: Disclosed are systems, methods and computer-readable media for enabling speech processing in a user interface of a device. The method includes receiving an indication of a field and a user interface of a device, the indication also signaling that speech will follow, receiving the speech from the user at the device, the speech being associated with the field, transmitting the speech as a request to public, common network node that receives and processes speech, processing the transmitted speech and returning text associated with the speech to the device and inserting the text into the field. Upon a second indication from the user, the system processes the text in the field as programmed by the user interface. The present disclosure provides a speech mash up application for a user interface of a mobile or desktop device that does not require expensive speech processing technologies.

    System and method for distributed voice models across cloud and device for embedded text-to-speech
    15.
    发明授权
    System and method for distributed voice models across cloud and device for embedded text-to-speech 有权
    跨云的分布式语音模型和嵌入式文本到语音的设备的系统和方法

    公开(公告)号:US09218804B2

    公开(公告)日:2015-12-22

    申请号:US14025344

    申请日:2013-09-12

    CPC classification number: G10L13/04 G10L13/047 G10L13/07

    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify a speech synthesis context, and determine, based on a local cache of text-to-speech units for a text-to-speech voice and based on the speech synthesis context, additional text-to-speech units which are not in the local cache. The system can request from a server the additional text-to-speech units, and store the additional text-to-speech units in the local cache. The system can then synthesize speech using the text-to-speech units and the additional text-to-speech units in the local cache. The system can prune the cache as the context changes, based on availability of local storage, or after synthesizing the speech. The local cache can store a core set of text-to-speech units associated with the text-to-speech voice that cannot be pruned from the local cache.

    Abstract translation: 用于智能缓存用于语音合成的级联语音单元的系统,方法和计算机可读存储介质。 配置为实施该方法的系统可以识别语音合成上下文,并且基于用于文本到语音语音的文本到语音单元的本地高速缓存并且基于语音合成上下文来确定附加的文本 - 不在本地缓存中的语音单元。 系统可以从服务器请求附加的文本到语音单元,并将附加的文本到语音单元存储在本地高速缓存中。 然后,系统可以使用本地高速缓存中的文本到语音单元和附加的文本到语音单元来合成语音。 系统可以根据本地存储的可用性,或合成语音之后随着上下文的变化修剪缓存。 本地缓存可以存储与文本到语音语音相关联的文本到语音单元的核心集合,其不能从本地高速缓存中修剪。

    SYSTEM AND METHOD FOR DISTRIBUTED VOICE MODELS ACROSS CLOUD AND DEVICE FOR EMBEDDED TEXT-TO-SPEECH
    16.
    发明申请
    SYSTEM AND METHOD FOR DISTRIBUTED VOICE MODELS ACROSS CLOUD AND DEVICE FOR EMBEDDED TEXT-TO-SPEECH 有权
    用于分布式语音模型的系统和方法用于嵌入式文本到语音的云和设备

    公开(公告)号:US20150073805A1

    公开(公告)日:2015-03-12

    申请号:US14025344

    申请日:2013-09-12

    CPC classification number: G10L13/04 G10L13/047 G10L13/07

    Abstract: Disclosed herein are systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify a speech synthesis context, and determine, based on a local cache of text-to-speech units for a text-to-speech voice and based on the speech synthesis context, additional text-to-speech units which are not in the local cache. The system can request from a server the additional text-to-speech units, and store the additional text-to-speech units in the local cache. The system can then synthesize speech using the text-to-speech units and the additional text-to-speech units in the local cache. The system can prune the cache as the context changes, based on availability of local storage, or after synthesizing the speech. The local cache can store a core set of text-to-speech units associated with the text-to-speech voice that cannot be pruned from the local cache.

    Abstract translation: 本文公开了用于智能缓存用于语音合成中的级联语音单元的系统,方法和计算机可读存储介质。 配置为实施该方法的系统可以识别语音合成上下文,并且基于用于文本到语音语音的文本到语音单元的本地高速缓存并且基于语音合成上下文来确定附加的文本 - 不在本地缓存中的语音单元。 系统可以从服务器请求附加的文本到语音单元,并将附加的文本到语音单元存储在本地高速缓存中。 然后,系统可以使用本地高速缓存中的文本到语音单元和附加的文本到语音单元来合成语音。 系统可以根据本地存储的可用性,或合成语音之后随着上下文的变化修剪缓存。 本地缓存可以存储与文本到语音语音相关联的文本到语音单元的核心集合,其不能从本地高速缓存中修剪。

Patent Agency Ranking