CUSTOMIZABLE SPEECH RECOGNITION SYSTEM
    22.
    发明申请

    公开(公告)号:US20200327884A1

    公开(公告)日:2020-10-15

    申请号:US16383312

    申请日:2019-04-12

    Applicant: ADOBE INC.

    Abstract: Methods and systems are provided for generating a customized speech recognition neural network system comprised of an adapted automatic speech recognition neural network and an adapted language model neural network. The automatic speech recognition neural network is first trained in a generic domain and then adapted to a target domain. The language model neural network is first trained in a generic domain and then adapted to a target domain. Such a customized speech recognition neural network system can be used to understand input vocal commands.

    Dialog System Training using a Simulated User System

    公开(公告)号:US20200160842A1

    公开(公告)日:2020-05-21

    申请号:US16198302

    申请日:2018-11-21

    Applicant: Adobe Inc.

    Abstract: Dialog system training techniques using a simulated user system are described. In one example, a simulated user system supports multiple agents. The dialog system, for instance, may be configured for use with an application (e.g., digital image editing application). The simulated user system may therefore simulate user actions involving both the application and the dialog system which may be used to train the dialog system. Additionally, the simulated user system is not limited to simulation of user interactions by a single input mode (e.g., natural language inputs), but also supports multimodal inputs. Further, the simulated user system may also support use of multiple goals within a single dialog session

    Domain-specific speech recognizers in a digital medium environment

    公开(公告)号:US10586528B2

    公开(公告)日:2020-03-10

    申请号:US15423429

    申请日:2017-02-02

    Applicant: Adobe Inc.

    Abstract: Domain-specific speech recognizer generation with crowd sourcing is described. The domain-specific speech recognizers are generated for voice user interfaces (VUIs) configured to replace or supplement application interfaces. In accordance with the described techniques, the speech recognizers are generated for a respective such application interface and are domain-specific because they are each generated based on language data that corresponds to the respective application interface. This domain-specific language data is used to build a domain-specific language model. The domain-specific language data is also used to collect acoustic data for building an acoustic model. In particular, the domain-specific language data is used to generate user interfaces that prompt crowd-sourcing participants to say selected words represented by the language data for recording. The recordings of these selected words are then used to build the acoustic model. The domain-specific speech recognizers are generated by combining a respective domain-specific language model and crowd-sourced acoustic model.

    Multitask Machine-Learning Model Training and Training Data Augmentation

    公开(公告)号:US20230419164A1

    公开(公告)日:2023-12-28

    申请号:US17846428

    申请日:2022-06-22

    Applicant: Adobe Inc.

    CPC classification number: G06N20/00

    Abstract: Multitask machine-learning model training and training data augmentation techniques are described. In one example, training is performed for multiple tasks simultaneously as part of training a multitask machine-learning model using question pairs. Examples of the multiple tasks include question summarization and recognizing question entailment. Further, a loss function is described that incorporates a parameter sharing loss that is configured to adjust an amount that parameters are shared between corresponding layers trained for the first and second tasks, respectively. In an implementation, training data augmentation techniques are also employed by synthesizing question pairs, automatically and without user intervention, to improve accuracy in model training.

Patent Agency Ranking