METHOD AND SYSTEM FOR EFFICIENT SPOKEN TERM DETECTION USING CONFUSION NETWORKS
    5.
    发明申请
    METHOD AND SYSTEM FOR EFFICIENT SPOKEN TERM DETECTION USING CONFUSION NETWORKS 有权
    使用混沌网络进行有效检测的方法和系统

    公开(公告)号:US20150279358A1

    公开(公告)日:2015-10-01

    申请号:US14230790

    申请日:2014-03-31

    IPC分类号: G10L15/08

    摘要: Systems and methods for spoken term detection are provided. A method for spoken term detection, comprises receiving phone level out-of-vocabulary (OOV) keyword queries, converting the phone level OOV keyword queries to words, generating a confusion network (CN) based keyword searching (KWS) index, and using the CN based KWS index for both in-vocabulary (IV) keyword queries and the OOV keyword queries.

    摘要翻译: 提供了用于词汇检测的系统和方法。 一种用于口语术语检测的方法,包括接收电话级词汇(OOV)关键字查询,将电话级OOV关键字查询转换为单词,生成基于混合网络(CN)的关键词搜索(KWS)索引,并使用 基于CN的KWS索引用于词汇(IV)关键词查询和OOV关键字查询。

    END TO END SPOKEN LANGUAGE UNDERSTANDING MODEL

    公开(公告)号:US20220319494A1

    公开(公告)日:2022-10-06

    申请号:US17218618

    申请日:2021-03-31

    IPC分类号: G10L15/06 G06K9/62 G10L13/02

    摘要: An approach to training an end-to-end spoken language understanding model may be provided. A pre-trained general automatic speech recognition model may be adapted to a domain specific spoken language understanding model. The pre-trained general automatic speech recognition model may be a recurrent neural network transducer model. The adaptation may provide transcription data annotated with spoken language understanding labels. Adaptation may include audio data may also be provided for in addition to verbatim transcripts annotated with spoken language understanding labels. The spoken language understanding labels may be entity and/or intent based with values associated with each label.

    MULTILINGUAL INTENT RECOGNITION
    9.
    发明申请

    公开(公告)号:US20220148581A1

    公开(公告)日:2022-05-12

    申请号:US17093673

    申请日:2020-11-10

    IPC分类号: G10L15/16 G06N3/04 G06K9/62

    摘要: Embodiments of the present invention provide computer implemented methods, computer program products and computer systems. For example, embodiments of the present invention can access one or more intents and associated entities from limited amount of speech to text training data in a single language. Embodiments of the present invention can locate speech to text training data in one or more other languages using the accessed one or more intents and associated entities to locate speech to text training data in the one or more other languages different than the single language. Embodiments of the present invention can then train a neural network based on the limited amount of speech to text training data in the single language and the located speech to text training data in the one or more other languages.