-
公开(公告)号:US08429212B1
公开(公告)日:2013-04-23
申请号:US13342532
申请日:2012-01-03
申请人: Samy Bengio , Gal Chechik , Sergey Ioffe , Jay Yagnik
发明人: Samy Bengio , Gal Chechik , Sergey Ioffe , Jay Yagnik
IPC分类号: G06F17/00
CPC分类号: G06K9/66 , G06F17/30244 , G06F17/3053 , G06K9/6267 , Y10S707/913 , Y10S707/915
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training scoring models. One method includes storing data identifying a plurality of positive and a plurality of negative training images for a query. The method further includes selecting a first image from either the positive group of images or the negative group of images, and applying a scoring model to the first image. The method further includes selecting a plurality of candidate images from the other group of images, applying the scoring model to each of the candidate images, and then selecting a second image from the candidate images according to scores for the images. The method further includes determining that the scores for the first image and the second image fail to satisfy a criterion, updating the scoring model, and storing the updated scoring model.
-
公开(公告)号:US08131786B1
公开(公告)日:2012-03-06
申请号:US12624001
申请日:2009-11-23
申请人: Samy Bengio , Gal Chechik , Sergey Ioffe , Jay Yagnik
发明人: Samy Bengio , Gal Chechik , Sergey Ioffe , Jay Yagnik
IPC分类号: G06F17/00
CPC分类号: G06K9/66 , G06F17/30244 , G06F17/3053 , G06K9/6267 , Y10S707/913 , Y10S707/915
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training scoring models. One method includes storing data identifying a plurality of positive and a plurality of negative training images for a query. The method further includes selecting a first image from either the positive group of images or the negative group of images, and applying a scoring model to the first image. The method further includes selecting a plurality of candidate images from the other group of images, applying the scoring model to each of the candidate images, and then selecting a second image from the candidate images according to scores for the images. The method further includes determining that the scores for the first image and the second image fail to satisfy a criterion, updating the scoring model, and storing the updated scoring model.
摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于训练评分模型。 一种方法包括存储识别用于查询的多个正训练图像和多个负训练图像的数据。 该方法还包括从图像的正组或负图像组中选择第一图像,以及将评分模型应用于第一图像。 该方法还包括从另一组图像中选择多个候选图像,将评分模型应用于每个候选图像,然后根据图像的分数从候选图像中选择第二图像。 该方法还包括确定第一图像和第二图像的分数不能满足标准,更新评分模型,并存储更新的评分模型。
-
公开(公告)号:US08589457B1
公开(公告)日:2013-11-19
申请号:US13616108
申请日:2012-09-14
申请人: Samy Bengio , Gal Chechik , Sergey Ioffe , Jay Yagnik
发明人: Samy Bengio , Gal Chechik , Sergey Ioffe , Jay Yagnik
IPC分类号: G06F17/00
CPC分类号: G06K9/66 , G06F17/30244 , G06F17/3053 , G06K9/6267 , Y10S707/913 , Y10S707/915
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training scoring models. One method includes storing data identifying a plurality of positive and a plurality of negative training images for a query. The method further includes selecting a first image from either the positive group of images or the negative group of images, and applying a scoring model to the first image. The method further includes selecting a plurality of candidate images from the other group of images, applying the scoring model to each of the candidate images, and then selecting a second image from the candidate images according to scores for the images. The method further includes determining that the scores for the first image and the second image fail to satisfy a criterion, updating the scoring model, and storing the updated scoring model.
-
公开(公告)号:US20110047163A1
公开(公告)日:2011-02-24
申请号:US12546436
申请日:2009-08-24
申请人: Gal Chechik , Samy Bengio
发明人: Gal Chechik , Samy Bengio
IPC分类号: G06F17/30
CPC分类号: G06F16/7867 , G06F16/70 , G06F16/738 , G06F16/743 , G06F16/78 , G06F16/783
摘要: A system, computer readable storage medium, and computer-implemented method presents video search results responsive to a user keyword query. The video hosting system uses a machine learning process to learn a feature-keyword model associating features of media content from a labeled training dataset with keywords descriptive of their content. The system uses the learned model to provide video search results relevant to a keyword query based on features found in the videos. Furthermore, the system determines and presents one or more thumbnail images representative of the video using the learned model.
摘要翻译: 系统,计算机可读存储介质和计算机实现的方法响应于用户关键词查询呈现视频搜索结果。 视频托管系统使用机器学习过程来学习将标记训练数据集中的媒体内容的特征与描述其内容的关键字相关联的特征关键字模型。 该系统使用学习模型,根据视频中的功能提供与关键字查询相关的视频搜索结果。 此外,系统使用所学习的模型来确定并呈现代表视频的一个或多个缩略图。
-
5.
公开(公告)号:US08463719B2
公开(公告)日:2013-06-11
申请号:US12722437
申请日:2010-03-11
申请人: Richard F. Lyon , Martin Rehn , Thomas Walters , Samy Bengio , Gal Chechik
发明人: Richard F. Lyon , Martin Rehn , Thomas Walters , Samy Bengio , Gal Chechik
IPC分类号: G06F15/18
CPC分类号: G10L25/48 , G06F17/30743
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, are provided for using audio features to classify audio for information retrieval. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model; extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query.
摘要翻译: 提供方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用音频特征来分类用于信息检索的音频。 通常,本说明书中描述的主题的一个方面可以包括生成听觉图像的集合的动作的方法,每个听觉图像根据听觉模型从各个音频文件生成; 从集合中的每个听觉图像中提取稀疏特征以生成表示相应音频文件的稀疏特征向量; 以及响应于包括使用所述稀疏特征向量的一个或多个单词的查询和将稀疏特征向量与所述查询中的单词相关联的匹配函数进行排序。
-
6.
公开(公告)号:US20100257129A1
公开(公告)日:2010-10-07
申请号:US12722437
申请日:2010-03-11
申请人: Robert F. Lyon , Martin Rehn , Thomas Walters , Samy Bengio , Gal Chechik
发明人: Robert F. Lyon , Martin Rehn , Thomas Walters , Samy Bengio , Gal Chechik
CPC分类号: G10L25/48 , G06F17/30743
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, are provided for using audio features to classify audio for information retrieval. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model; extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query.
摘要翻译: 提供方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用音频特征来分类用于信息检索的音频。 通常,本说明书中描述的主题的一个方面可以包括生成听觉图像的集合的动作的方法,每个听觉图像根据听觉模型从各个音频文件生成; 从集合中的每个听觉图像中提取稀疏特征以生成表示相应音频文件的稀疏特征向量; 以及响应于包括使用所述稀疏特征向量的一个或多个单词的查询和将稀疏特征向量与查询中的单词相关联的匹配函数进行排序。
-
公开(公告)号:US08429168B1
公开(公告)日:2013-04-23
申请号:US12638704
申请日:2009-12-15
申请人: Gal Chechik , Samy Bengio , Varun Sharma
发明人: Gal Chechik , Samy Bengio , Varun Sharma
IPC分类号: G06F17/30
CPC分类号: G06K9/66 , G06F17/30247 , G06F17/30265 , G06K9/6212 , G06K9/6276
摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for identifying similar images. In some implementations, a method is provided that includes receiving a collection of images and data associated with each image in the collection of images; generating a sparse feature representation for each image in the collection of images; and training an image similarity function using image triplets sampled from the collection of images and corresponding sparse feature representations.
-
公开(公告)号:US08995716B1
公开(公告)日:2015-03-31
申请号:US13547550
申请日:2012-07-12
申请人: Asaf Zomet , Gal Chechik
发明人: Asaf Zomet , Gal Chechik
IPC分类号: G06K9/00
CPC分类号: G06K9/00664 , G06F17/30247 , G06F17/3087 , G06K2209/27
摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for retrieving images based on a seasonal time period. In one aspect, a method includes receiving a query including image data defining an image depicting a subject. A location at which the image was captured by an image capturing device is determined. One or more additional images of the subject that were captured by an image capturing device at the identified location are identified. At least a portion of the additional images are provided in response to receiving the query.
摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于基于季节时间段检索图像。 在一个方面,一种方法包括接收包括定义描绘对象的图像的图像数据的查询。 确定由图像捕获装置拍摄图像的位置。 识别由识别的位置处的图像捕获装置捕获的被摄体的一个或多个附加图像。 响应于接收查询而提供附加图像的至少一部分。
-
公开(公告)号:US08949125B1
公开(公告)日:2015-02-03
申请号:US12816563
申请日:2010-06-16
申请人: Gal Chechik
发明人: Gal Chechik
CPC分类号: G10L15/08 , G10L13/02 , G10L13/06 , G10L2015/085
摘要: Systems and methods are provided to select a most typical pronunciation of a location name on a map from a plurality of user pronunciations. A server generates a reference speech model based on user pronunciations, compares the user pronunciations with the speech model and selects a pronunciation based on comparison. Alternatively, the server compares the distance between one the user pronunciations and every other user pronunciations and selects a pronunciation based on comparison. The server then annotates the map with the selected pronunciation and provides the audio output of the location name to a user device upon a user's request.
摘要翻译: 提供系统和方法以从多个用户发音中选择地图上位置名称的最典型的发音。 服务器基于用户发音生成参考语音模型,将用户发音与语音模型进行比较,并根据比较选择发音。 或者,服务器比较用户发音之间的距离和每个其他用户发音之间的距离,并且基于比较选择发音。 然后,服务器用所选择的发音注释地图,并且在用户请求时将位置名称的音频输出提供给用户设备。
-
公开(公告)号:US09665773B2
公开(公告)日:2017-05-30
申请号:US13532468
申请日:2012-06-25
申请人: Asaf Zomet , Ehud Rivlin , Gal Chechik
发明人: Asaf Zomet , Ehud Rivlin , Gal Chechik
IPC分类号: G06K9/00
CPC分类号: G06K9/00677
摘要: A system, computer-implemented method and non-transitory computer-readable medium for automatically searching for images from events is provided. One or more personal identity tags are provided, wherein the personal identity tags relate to identification information for one or more individuals. Next, at least one event group is identified, wherein the event group is a collection of images associated with an event, the collection of images including one or more images tagged with one or more of the provided personal identity tags. A collection of the images for each of the identified event groups is then received.
-
-
-
-
-
-
-
-
-