TECHNOLOGIES FOR ROBUST CRYING DETECTION USING TEMPORAL CHARACTERISTICS OF ACOUSTIC FEATURES
    1.
    发明申请
    TECHNOLOGIES FOR ROBUST CRYING DETECTION USING TEMPORAL CHARACTERISTICS OF ACOUSTIC FEATURES 审中-公开
    利用声学特征的时间特性实现强大的检测技术

    公开(公告)号:WO2017112261A1

    公开(公告)日:2017-06-29

    申请号:PCT/US2016/063347

    申请日:2016-11-22

    CPC classification number: G10L25/51 G10L15/16 G10L25/24 G10L25/27 G10L25/72

    Abstract: Technologies for identifying sounds are disclosed. A sound identification device may capture sound data, and split the sound data into frames. The sound identification device may then determine an acoustic feature vector for each frame, and determine parameters based on how each acoustic feature varies over the duration of time corresponding to the frames. The sound identification device may then determine if the sound matches a pre-defined sound based on the parameters. In one embodiment, the sound identification device may be a baby monitor, and the pre-defined sound may be a baby crying.

    Abstract translation: 公开了用于识别声音的技术。 声音识别设备可以捕捉声音数据,并将声音数据分成帧。 声音识别设备然后可以为每个帧确定声学特征向量,并且基于每个声学特征在对应于帧的时间段内如何变化来确定参数。 声音识别设备然后可以基于参数来确定声音是否匹配预定义的声音。 在一个实施例中,声音识别设备可以是婴儿监视器,并且预定义的声音可以是婴儿哭闹。

    LOW RESOURCE KEY PHRASE DETECTION FOR WAKE ON VOICE
    2.
    发明申请
    LOW RESOURCE KEY PHRASE DETECTION FOR WAKE ON VOICE 审中-公开
    语音上的低资源关键词相位检测

    公开(公告)号:WO2017091270A1

    公开(公告)日:2017-06-01

    申请号:PCT/US2016/049909

    申请日:2016-09-01

    Abstract: Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include updating a start state based rejection model and a key phrase model based on scores of sub-phonetic units from an acoustic model to generate a rejection likelihood score and a key phrase likelihood score and determining whether received audio input is associated with a predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score.

    Abstract translation: 讨论了与诸如唤醒语音之类的应用的关键短语检测有关的技术。 这样的技术可以包括基于来自声学模型的子语音单元的分数来更新基于开始状态的排斥模型和关键短语模型以生成排斥可能性分数和关键短语可能性分数,并且确定接收到的音频输入是否与 预定的关键短语基于拒绝似然分和关键短语似然分。

    DYNAMIC ENROLLMENT OF USER-DEFINED WAKE-UP KEY-PHRASE FOR SPEECH ENABLED COMPUTER SYSTEM

    公开(公告)号:WO2019133153A1

    公开(公告)日:2019-07-04

    申请号:PCT/US2018/061728

    申请日:2018-11-19

    Abstract: Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operation of an automatic speech recognition (ASR) processor in response to the detection. The method further includes updating an ASR language model based on the user-provided key-phrase. The update includes one of embedding the WOV key-phrase model into the ASR language model, converting sub-phonetic units of the WOV key-phrase model and embedding the converted WOV key-phrase model into the ASR language model, or generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided key-phrase and embedding the generated ASR key-phrase model into the ASR language model.

Patent Agency Ranking