MEDIA SEGMENT REPRESENTATION USING FIXED WEIGHTS

    公开(公告)号:US20240127809A1

    公开(公告)日:2024-04-18

    申请号:US18047562

    申请日:2022-10-18

    摘要: A device includes a memory configured to store a collection of sets of weights, each of the sets of weights representing a respective media segment. The device also includes one or more processors configured to generate data representing the detected first input speech segment and to pass the data representing the detected first input speech segment into a collection of memory units. Each memory unit of the collection of memory units includes a set of weights from the collection of sets of weights. The one or more processors are also configured to generate a first estimate of an associated media segment that represents the detected first input speech segment. The associated media segment corresponds to a first memory unit in the collection of memory units.

    MATCHING AUDIO USING MACHINE LEARNING BASED AUDIO REPRESENTATIONS

    公开(公告)号:US20240127827A1

    公开(公告)日:2024-04-18

    申请号:US18047565

    申请日:2022-10-18

    IPC分类号: G10L19/00 H04L65/70

    CPC分类号: G10L19/00 H04L65/70

    摘要: Systems and techniques are described herein for encoding and/or decoding audio information. For example, a process can process an input audio segment to generate a representation of the input audio segment, and can compare the representation of the input audio segment to representations stored in a memory. The representations represent a plurality of audio segments. The process can determine, based on the comparison, target representation(s) of target audio segment(s) from the representations stored in the memory. The process can determine one or more indices associated with the target audio segment(s). The process can then packetize the one or more indices and transmit the one or more packetized indices (e.g., to a decoder configured to decode the packetized indices).