专利检索 cpc:"G06F16/638" 第 1 页

1.

发明申请
AUDIO REQUEST INTERACTION SYSTEM 审中-公开

公开(公告)号：US20190107997A1

公开(公告)日：2019-04-11

申请号：US16216626

申请日：2018-12-11

申请人： NCR Corporation

发明人： Michael Cain Finley , Michael Dudgeon , John Wade , James Lee Fortuna

IPC分类号： G06F3/16 , H04R1/08 , G06F16/63 , G06Q30/06 , H04L29/08 , H04H60/63 , H04H60/37 , H04H20/88 , H04H20/71 , G10L25/48

CPC分类号： G06F3/167 , G06F16/63 , G06F16/634 , G06F16/638 , G06F16/639 , G06F16/683 , G06F16/685 , G06Q30/0603 , G06Q30/0633 , G10L25/48 , H04H20/71 , H04H20/88 , H04H60/372 , H04H60/375 , H04H60/58 , H04H60/63 , H04H60/88 , H04L67/10 , H04R1/08

摘要： A person can use a portable electronic device to electronically purchase or otherwise request a product, service or other deliverable related to audio programming to which the person is listening at the time they initiate the request. The request is fulfilled by a service that analyzes the audio content to identify the deliverable the person desires.

2.

发明申请
INTERFACE INTELLIGENT INTERACTION CONTROL METHOD, APPARATUS AND SYSTEM, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20190066682A1

公开(公告)日：2019-02-28

申请号：US16114693

申请日：2018-08-28

申请人： BAIDU ONLINE NETWORK TECHNOLOGY (BEIJNG) CO., LTD.

发明人： Gaofei CHENG , Xiangtao JIANG , Ben XU , Linxin OU , Qin XIONG

IPC分类号： G10L15/22 , G06F17/30 , G10L15/02

CPC分类号： G10L15/22 , G06F16/638 , G10L15/02 , G10L15/26 , G10L2015/221 , G10L2015/225 , G10L2015/228

摘要： The present disclosure provides an interface intelligent interaction control method, apparatus and system, and a storage medium, wherein the method comprises: receiving user-input speech information, and obtaining a speech recognition result; determining scenario elements associated with the speech recognition result; generating an entry corresponding to each scenario element and sending the speech recognition result and the entry to a cloud server; receiving an entry which is best matched with the speech recognition result, returned by the cloud server and selected from the received entries; performing an interface operation corresponding to the best-matched entry. The solution of the present disclosure can be applied to improve flexibility and accuracy of the speech control.

3.

发明申请
AUDIO FILE PROCESSING TO REDUCE LATENCIES IN PLAY START TIMES FOR CLOUD SERVED AUDIO FILES 审中-公开

公开(公告)号：US20190026069A1

公开(公告)日：2019-01-24

申请号：US16140116

申请日：2018-09-24

申请人： Google LLC

发明人： Neel B. Parekh

IPC分类号： G06F3/16 , G06F17/30 , H04N21/845

CPC分类号： G06F3/165 , G06F16/632 , G06F16/638 , G11B27/034 , G11B27/105 , G11B2020/10768 , H04N21/4331 , H04N21/47202 , H04N21/8106 , H04N21/8456

摘要： Methods, systems, and computer programs are presented for managing audio files of a user to reduce latencies in play start times on local devices. The audio files are stored on cloud storage managed by a server. One method includes processing a plurality of audio files associated with a user, where the processing is configured to create audio snippet files from each of the plurality of audio files. The audio snippet files representing a beginning part of each of the plurality of audio files. The method also includes transmitting the audio snippet files to a client device and detecting a request from the client to begin playing a first audio file from the plurality of audio files of the user. The first audio file being stored on the cloud storage managed by the server.

4.

发明申请
JOINT HETEROGENEOUS LANGUAGE-VISION EMBEDDINGS FOR VIDEO TAGGING AND SEARCH 审中-公开

公开(公告)号：US20170357720A1

公开(公告)日：2017-12-14

申请号：US15620232

申请日：2017-06-12

申请人： Disney Enterprises, Inc.

发明人： Atousa TORABI , Leonid SIGAL

IPC分类号： G06F17/30 , G06K9/00 , G06N3/08

CPC分类号： G06F16/638 , G06K9/00718 , G06N3/0445 , G06N3/0454 , G06N3/08 , G06N3/084

摘要： Systems, methods and articles of manufacture for modeling a joint language-visual space. A textual query to be evaluated relative to a video library is received from a requesting entity. The video library contains a plurality of instances of video content. One or more instances of video content from the video library that correspond to the textual query are determined, by analyzing the textual query using a data model that includes a soft-attention neural network module that is jointly trained with a language Long Short-term Memory (LSTM) neural network module and a video LSTM neural network module. At least an indication of the one or more instances of video content is returned to the requesting entity.

5.

发明申请
SPEECH RECOGNITION SYSTEMS AND METHODS USING RELATIVE AND ABSOLUTE SLOT DATA 审中-公开

公开(公告)号：US20170316783A1

公开(公告)日：2017-11-02

申请号：US15141596

申请日：2016-04-28

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： RON M. HECHT , ARIEL TELPAZ , YAEL SHMUELI FRIEDLAND , ELI TZIRKEL-HANCOCK

IPC分类号： G10L17/22 , G10L15/18 , G06F17/30 , H04M1/60

CPC分类号： G10L17/22 , G06F16/637 , G06F16/638 , G10L15/1815 , G10L15/22 , G10L2015/226 , H04M1/6041 , H04M1/6075 , H04M2250/74

摘要： Methods and systems are provided for managing speech of a speech system. In one embodiment, a method includes: receiving, by a processor, relative information comprising graph data from at least one relative data datasource; processing, by a processor, the graph data of the relative information to determine at least one of an association and a relationship associated with an element defined in the speech system; and storing, by a processor, the at least one of association and relationship as relative slot data for use by at least one of a speech recognition method and a dialog management method.

6.

发明申请
INTERACTION PROVIDING METHOD FOR DELETING QUERY 审中-公开

公开(公告)号：US20170255640A1

公开(公告)日：2017-09-07

申请号：US15433611

申请日：2017-02-15

申请人： NAVER Corporation

发明人： Hyeeun Noh , Hyo Jung Kim

IPC分类号： G06F17/30 , G06F3/16 , G06F3/0488

CPC分类号： G06F16/162 , G06F3/0481 , G06F3/0488 , G06F3/04883 , G06F3/167 , G06F16/338 , G06F16/638 , G06Q50/01

摘要： Disclosed is an interaction providing method implemented by a computer, for deleting a query input into a user terminal. The interaction providing method includes: receiving the query from the user terminal; reading the query in response to the receiving of the query input; providing the query and a query result concerning the query to the user terminal in response to the reading of the query such that the query and the query result are output; receiving an input of a swipe command over the query output to the user terminal; and deleting the query and the query result in response to the receiving of the input of the swipe command over the query.

7.

发明申请
Systems and Methods for Discovering Artists 审中-公开
标题翻译：发现艺术家的系统与方法

公开(公告)号：US20170046724A1

公开(公告)日：2017-02-16

申请号：US15335011

申请日：2016-10-26

申请人： Viacom International, Inc.

发明人： Peter KAY , Mark MEZRICH , Daniel SHEARER , Ryan SHAFER

IPC分类号： G06Q30/02 , G06F7/02

CPC分类号： G06Q30/0201 , G06F7/026 , G06F16/635 , G06F16/638 , G06F16/68 , G06F16/683 , G06F16/686 , G06Q30/0241 , G06Q30/0282

摘要： A musician discovery system is provided. The musician discovery system includes a first interface for displaying a plurality of musicians organized according to a musical characteristic. The system includes a second interface for presenting multimedia information about a first musician from the plurality of musicians displayed on the first interface. The system includes means for comparing a second plurality of musicians with the first musician using the multimedia information presented on the second interface about the first musician. Furthermore, the system includes a third interface for recommending a second musician from the second plurality of musicians based on the comparing means.

摘要翻译： 提供音乐人发现系统。音乐家发现系统包括用于显示根据音乐特征组织的多个音乐家的第一界面。该系统包括用于从显示在第一界面上的多个音乐家呈现关于第一音乐人的多媒体信息的第二界面。该系统包括用于使用在关于第一音乐家的第二界面上呈现的多媒体信息来比较第二多个音乐家与第一音乐家的装置。此外，系统包括第三界面，用于基于比较装置来推荐来自第二多个音乐家的第二音乐家。

8.

发明申请
CORRECTING VOICE RECOGNITION USING SELECTIVE RE-SPEAK 审中-公开
标题翻译：使用选择性重播来纠正语音识别

公开(公告)号：US20160322049A1

公开(公告)日：2016-11-03

申请号：US15140891

申请日：2016-04-28

申请人： Google Inc.

发明人： Dhruv Bakshi , Zaheed Sabur , Tilke Mary Judd , Nicholas G. Fey

IPC分类号： G10L15/22 , G06F17/30 , G06F17/27 , G10L15/26 , G10L15/32

CPC分类号： G10L15/22 , G06F16/3329 , G06F16/632 , G06F16/638 , G06F16/685 , G06F17/273 , G10L15/26 , G10L15/32 , G10L2015/223

摘要： Implementations of the present disclosure include actions of providing first text for display on a computing device of a user, the first text being provided from a first speech recognition engine based on first speech received from the computing device, and being displayed as a search query, receiving a speech correction indication from the computing device, the speech correction indication indicating a portion of the first text that is to be corrected, receiving second speech from the computing device, receiving second text from a second speech recognition engine based on the second speech, the second speech recognition engine being different from the first speech recognition engine, replacing the portion of the first text with the second text to provide a combined text, and providing the combined text for display on the computing device as a revised search query.

摘要翻译： 本公开的实现包括提供用于在用户的计算设备上显示的第一文本的动作，第一文本基于从计算设备接收的第一语音从第一语音识别引擎提供，并且被显示为搜索查询，从所述计算设备接收语音校正指示，所述语音校正指示指示要被校正的第一文本的一部分，从所述计算设备接收第二语音，基于所述第二语音接收来自第二语音识别引擎的第二文本，所述第二语音识别引擎与所述第一语音识别引擎不同，用所述第二文本替换所述第一文本的所述部分以提供组合文本，以及提供所述组合文本以在所述计算设备上显示为修改的搜索查询。

9.

发明申请
DATA COMMUNICATION WITH ACOUSTIC SIGNAL COMMUNICATION 审中-公开
标题翻译：数字通信与声音信号通信

公开(公告)号：US20160182172A1

公开(公告)日：2016-06-23

申请号：US14977502

申请日：2015-12-21

申请人： Daniel SEEMILLER

发明人： Daniel SEEMILLER

IPC分类号： H04H20/93 , G10L25/54 , G06F17/30 , H04H60/64 , H04H60/58 , G10L19/018 , H04H60/73

CPC分类号： H04H20/93 , G06F16/632 , G06F16/638 , G10L19/018 , G10L25/54 , H04H60/13 , H04H60/33 , H04H60/58 , H04H60/64 , H04H60/73 , H04H2201/13 , H04H2201/37

摘要： A composite signal having frequencies within a sonic first frequency bandwidth may be received from a communication media on a receiver. The composite signal may include an audio base signal and at least one code signal. The code signal may be encoded with a code, may have a duration shorter than a duration of the base signal, and may have a second frequency bandwidth within the first frequency bandwidth. The composite signal may be output on a speaker, the speaker converting the composite signal into sound. While outputting the composite signal, a signal processing device may detect the output sound corresponding to the code signal. The code may be determined from the detected output sound corresponding to the code signal. Data associated with the code may be retrieved from a data storage device. The retrieved data may be displayed on a display device.

摘要翻译： 可以从接收机上的通信介质接收具有声波第一频率带宽内的频率的复合信号。复合信号可以包括音频基础信号和至少一个代码信号。代码信号可以用代码编码，可以具有比基本信号的持续时间短的持续时间，并且可以在第一频率带宽内具有第二频率带宽。复合信号可以在扬声器上输出，扬声器将复合信号转换成声音。在输出复合信号的同时，信号处理装置可以检测对应于代码信号的输出声音。可以根据与代码信号相对应的检测到的输出声音来确定代码。可以从数据存储设备检索与代码相关联的数据。所检索的数据可以显示在显示装置上。

10.

发明申请
Visually-Represented Results To Search Queries In Rich Media Content 审中-公开
标题翻译：在Rich Media内容中搜索查询的视觉表示结果

公开(公告)号：US20140324907A1

公开(公告)日：2014-10-30

申请号：US14266738

申请日：2014-04-30

申请人： AOL Inc.

发明人： Rakesh Agrawal

IPC分类号： G06F17/30

CPC分类号： G06F16/438 , G06F16/40 , G06F16/489 , G06F16/638 , G06F16/685 , G06F16/738 , G06F16/7844 , Y10S707/913 , Y10S707/914

摘要： When executed, a computer program product generates a graphical user interface that renders results that are responsive to a search query of a rich media file. The graphical user interface includes a chronological representation of the rich media file, one or more occurrence markers along the chronological representation corresponding to actual occurrences of a desired term at an indicated chronological location in the rich media file, and an execution icon configured to launch a rich media application that renders a relevant portion that is responsive to the search query.

摘要翻译： 当执行时，计算机程序产品生成图形用户界面，其呈现响应于富媒体文件的搜索查询的结果。图形用户界面包括富媒体文件的时间顺序表示，沿着时间表示的一个或多个出现标记，其对应于在富媒体文件中指示的时间顺序位置处的期望术语的实际出现，以及被配置为发送富媒体应用程序，呈现响应搜索查询的相关部分。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类