SOUND SEARCH
    10.
    发明公开
    SOUND SEARCH 审中-公开

    公开(公告)号:US20240134908A1

    公开(公告)日:2024-04-25

    申请号:US18326261

    申请日:2023-05-30

    CPC classification number: G06F16/685 G06F16/632 G06F16/638 G06F16/686

    Abstract: A device includes one or more processors configured to generate one or more query caption embeddings based on a query. The processor(s) are further configured to select one or more caption embeddings from among a set of embeddings associated with a set of media files of a file repository. Each caption embedding represents a corresponding sound caption, and each sound caption includes a natural-language text description of a sound. The caption embedding(s) are selected based on a similarity metric indicative of similarity between the caption embedding(s) and the query caption embedding(s). The processor(s) are further configured to generate search results identifying one or more first media files of the set of media files. Each of the first media file(s) is associated with at least one of the caption embedding(s).

Patent Agency Ranking