NEURAL SPEECH-TO-MEANING TRANSLATION
    23.
    发明公开

    公开(公告)号:EP3832644A1

    公开(公告)日:2021-06-09

    申请号:EP20211702.4

    申请日:2020-12-03

    申请人: SoundHound, Inc.

    IPC分类号: G10L15/16 G10L15/18 G06F40/40

    摘要: A neural speech-to-meaning system is trained on speech audio expressing specific intents. The system receives speech audio and produces indications of when the speech in the audio matches the intent. Intents may include variables that can have a large range of values, such as the names of places. The neural speech-to-meaning system simultaneously recognizes enumerated values of variables and general intents. Recognized variable values can serve as arguments to API requests made in response to recognized intents. Accordingly, neural speech-to-meaning supports voice virtual assistants that serve users based on API hits.

    A SPEECH PROCESSING SYSTEM AND A METHOD OF PROCESSING A SPEECH SIGNAL

    公开(公告)号:EP4266306A1

    公开(公告)日:2023-10-25

    申请号:EP22201970.5

    申请日:2022-10-17

    摘要: A computer implemented speech processing method for generating translated speech comprising:
    receiving a first speech signal corresponding to speech spoken in a second language;
    generating first text data from the first speech signal, the first text data corresponding to text in the second language;
    generating second text data from the first text data, the second text data corresponding to text in a first language;
    responsive to obtaining a second speech signal corresponding to the second text spoken in the first language and in a second voice:
    extracting first acoustic data from the second speech signal;
    modifying the first acoustic data based on one or more acoustic data characteristics corresponding to a first voice; and
    generating an output speech signal using a text to speech synthesis model taking the second text data as input and using the modified first acoustic data, the output speech signal corresponding to the second text spoken in the first language.