Detection of music segment in audio signal

    公开(公告)号:US11037583B2

    公开(公告)日:2021-06-15

    申请号:US16116042

    申请日:2018-08-29

    IPC分类号: G10L25/81 G10L25/21

    摘要: A technique for detecting a music segment in an audio signal is disclosed. A time window is set for each section in an audio signal. A maximum and a statistic of the audio signal within the time window are calculated. A density index is computed for the section using the maximum and the statistic. The density index is a measure of the statistic relative to the maximum. The section is estimated as a music segment based, at least in part, on a condition with respect to the density index.

    GENERATING PHONEMES OF LOAN WORDS USING TWO CONVERTERS

    公开(公告)号:US20190096388A1

    公开(公告)日:2019-03-28

    申请号:US15717194

    申请日:2017-09-27

    摘要: A technique for estimating phonemes for a word written in a different language is disclosed. A sequence of graphemes of a given word in a source language is received. The sequence of the graphemes in the source language is converted into a sequence of phonemes in the source language. One or more sequences of phonemes in a target language are generated from the sequence of the phonemes in the source language by using a neural network model. One sequence of phonemes in the target language is determined for the given word. Also, technique for estimating graphemes of a word from phonemes in a different language is disclosed.

    ALTERNATIVE SOFT LABEL GENERATION
    18.
    发明申请

    公开(公告)号:US20220188622A1

    公开(公告)日:2022-06-16

    申请号:US17118139

    申请日:2020-12-10

    摘要: An approach to identifying alternate soft labels for training a student model may be provided. A teaching model may generate a soft label for a labeled training data. The training data can be an acoustic file for speech or a spoken natural language. A pool of soft labels previously generated by teacher models can be searched at the label level to identify soft labels that are similar to the generated soft label. The similar soft labels can have similar length or sequence at the word phoneme, and/or state level. The identified similar soft labels can be used in conjunction with the generated soft label to train a student model.

    Adaptation of a trained neural network

    公开(公告)号:US11151449B2

    公开(公告)日:2021-10-19

    申请号:US15878933

    申请日:2018-01-24

    IPC分类号: G06N3/08 G06N5/02

    摘要: A method, computer program product, and apparatus for adapting a trained neural network having one or more batch normalization layers are provided. The method includes adapting only the one or more batch normalization layers using adaptation data. The method also includes adapting the whole of the neural network having the one or more adapted batch normalization layers, using the adaptation data.