SPOKEN LANGUAGE RECOGNITION
    2.
    发明公开

    公开(公告)号:US20240257798A1

    公开(公告)日:2024-08-01

    申请号:US18104434

    申请日:2023-02-01

    Applicant: ADOBE INC.

    CPC classification number: G10L15/005 G10L25/30

    Abstract: Some aspects of the technology described herein employ a neural network with an efficient and lightweight architecture to perform spoken language recognition. Given an audio signal comprising speech, features are generated from the audio signal, for instance, by converting the audio signal to a normalized spectrogram. The features are input to the neural network, which has one or more convolutional layers and an output activation layer. Each neuron of the output activation layer corresponds to a language from a set of language and generates an activation value. Based on the activations values, an indication of zero or more languages from the set of languages is provided for the audio signal.

Patent Agency Ranking