JOINT HETEROGENEOUS LANGUAGE-VISION EMBEDDINGS FOR VIDEO TAGGING AND SEARCH

    公开(公告)号:US20170357720A1

    公开(公告)日:2017-12-14

    申请号:US15620232

    申请日:2017-06-12

    IPC分类号: G06F17/30 G06K9/00 G06N3/08

    摘要: Systems, methods and articles of manufacture for modeling a joint language-visual space. A textual query to be evaluated relative to a video library is received from a requesting entity. The video library contains a plurality of instances of video content. One or more instances of video content from the video library that correspond to the textual query are determined, by analyzing the textual query using a data model that includes a soft-attention neural network module that is jointly trained with a language Long Short-term Memory (LSTM) neural network module and a video LSTM neural network module. At least an indication of the one or more instances of video content is returned to the requesting entity.

    INTERACTION PROVIDING METHOD FOR DELETING QUERY

    公开(公告)号:US20170255640A1

    公开(公告)日:2017-09-07

    申请号:US15433611

    申请日:2017-02-15

    申请人: NAVER Corporation

    IPC分类号: G06F17/30 G06F3/16 G06F3/0488

    摘要: Disclosed is an interaction providing method implemented by a computer, for deleting a query input into a user terminal. The interaction providing method includes: receiving the query from the user terminal; reading the query in response to the receiving of the query input; providing the query and a query result concerning the query to the user terminal in response to the reading of the query such that the query and the query result are output; receiving an input of a swipe command over the query output to the user terminal; and deleting the query and the query result in response to the receiving of the input of the swipe command over the query.

    Systems and Methods for Discovering Artists
    7.
    发明申请
    Systems and Methods for Discovering Artists 审中-公开
    发现艺术家的系统与方法

    公开(公告)号:US20170046724A1

    公开(公告)日:2017-02-16

    申请号:US15335011

    申请日:2016-10-26

    IPC分类号: G06Q30/02 G06F7/02

    摘要: A musician discovery system is provided. The musician discovery system includes a first interface for displaying a plurality of musicians organized according to a musical characteristic. The system includes a second interface for presenting multimedia information about a first musician from the plurality of musicians displayed on the first interface. The system includes means for comparing a second plurality of musicians with the first musician using the multimedia information presented on the second interface about the first musician. Furthermore, the system includes a third interface for recommending a second musician from the second plurality of musicians based on the comparing means.

    摘要翻译: 提供音乐人发现系统。 音乐家发现系统包括用于显示根据音乐特征组织的多个音乐家的第一界面。 该系统包括用于从显示在第一界面上的多个音乐家呈现关于第一音乐人的多媒体信息的第二界面。 该系统包括用于使用在关于第一音乐家的第二界面上呈现的多媒体信息来比较第二多个音乐家与第一音乐家的装置。 此外,系统包括第三界面,用于基于比较装置来推荐来自第二多个音乐家的第二音乐家。

    CORRECTING VOICE RECOGNITION USING SELECTIVE RE-SPEAK
    8.
    发明申请
    CORRECTING VOICE RECOGNITION USING SELECTIVE RE-SPEAK 审中-公开
    使用选择性重播来纠正语音识别

    公开(公告)号:US20160322049A1

    公开(公告)日:2016-11-03

    申请号:US15140891

    申请日:2016-04-28

    申请人: Google Inc.

    摘要: Implementations of the present disclosure include actions of providing first text for display on a computing device of a user, the first text being provided from a first speech recognition engine based on first speech received from the computing device, and being displayed as a search query, receiving a speech correction indication from the computing device, the speech correction indication indicating a portion of the first text that is to be corrected, receiving second speech from the computing device, receiving second text from a second speech recognition engine based on the second speech, the second speech recognition engine being different from the first speech recognition engine, replacing the portion of the first text with the second text to provide a combined text, and providing the combined text for display on the computing device as a revised search query.

    摘要翻译: 本公开的实现包括提供用于在用户的计算设备上显示的第一文本的动作,第一文本基于从计算设备接收的第一语音从第一语音识别引擎提供,并且被显示为搜索查询, 从所述计算设备接收语音校正指示,所述语音校正指示指示要被校正的第一文本的一部分,从所述计算设备接收第二语音,基于所述第二语音接收来自第二语音识别引擎的第二文本, 所述第二语音识别引擎与所述第一语音识别引擎不同,用所述第二文本替换所述第一文本的所述部分以提供组合文本,以及提供所述组合文本以在所述计算设备上显示为修改的搜索查询。

    DATA COMMUNICATION WITH ACOUSTIC SIGNAL COMMUNICATION
    9.
    发明申请
    DATA COMMUNICATION WITH ACOUSTIC SIGNAL COMMUNICATION 审中-公开
    数字通信与声音信号通信

    公开(公告)号:US20160182172A1

    公开(公告)日:2016-06-23

    申请号:US14977502

    申请日:2015-12-21

    申请人: Daniel SEEMILLER

    发明人: Daniel SEEMILLER

    摘要: A composite signal having frequencies within a sonic first frequency bandwidth may be received from a communication media on a receiver. The composite signal may include an audio base signal and at least one code signal. The code signal may be encoded with a code, may have a duration shorter than a duration of the base signal, and may have a second frequency bandwidth within the first frequency bandwidth. The composite signal may be output on a speaker, the speaker converting the composite signal into sound. While outputting the composite signal, a signal processing device may detect the output sound corresponding to the code signal. The code may be determined from the detected output sound corresponding to the code signal. Data associated with the code may be retrieved from a data storage device. The retrieved data may be displayed on a display device.

    摘要翻译: 可以从接收机上的通信介质接收具有声波第一频率带宽内的频率的复合信号。 复合信号可以包括音频基础信号和至少一个代码信号。 代码信号可以用代码编码,可以具有比基本信号的持续时间短的持续时间,并且可以在第一频率带宽内具有第二频率带宽。 复合信号可以在扬声器上输出,扬声器将复合信号转换成声音。 在输出复合信号的同时,信号处理装置可以检测对应于代码信号的输出声音。 可以根据与代码信号相对应的检测到的输出声音来确定代码。 可以从数据存储设备检索与代码相关联的数据。 所检索的数据可以显示在显示装置上。

    Visually-Represented Results To Search Queries In Rich Media Content
    10.
    发明申请
    Visually-Represented Results To Search Queries In Rich Media Content 审中-公开
    在Rich Media内容中搜索查询的视觉表示结果

    公开(公告)号:US20140324907A1

    公开(公告)日:2014-10-30

    申请号:US14266738

    申请日:2014-04-30

    申请人: AOL Inc.

    发明人: Rakesh Agrawal

    IPC分类号: G06F17/30

    摘要: When executed, a computer program product generates a graphical user interface that renders results that are responsive to a search query of a rich media file. The graphical user interface includes a chronological representation of the rich media file, one or more occurrence markers along the chronological representation corresponding to actual occurrences of a desired term at an indicated chronological location in the rich media file, and an execution icon configured to launch a rich media application that renders a relevant portion that is responsive to the search query.

    摘要翻译: 当执行时,计算机程序产品生成图形用户界面,其呈现响应于富媒体文件的搜索查询的结果。 图形用户界面包括富媒体文件的时间顺序表示,沿着时间表示的一个或多个出现标记,其对应于在富媒体文件中指示的时间顺序位置处的期望术语的实际出现,以及被配置为发送 富媒体应用程序,呈现响应搜索查询的相关部分。