-
公开(公告)号:US10592769B2
公开(公告)日:2020-03-17
申请号:US15240838
申请日:2016-08-18
发明人: Linjun Yang , Xian-Sheng Hua , Yang Cai
摘要: Techniques describe submitting a video clip as a query by a user. A process retrieves images and information associated with the images in response to the query. The process decomposes the video clip into a sequence of frames to extract the features in a frame and to quantize the extracted features into descriptive words. The process further tracks the extracted features as points in the frame, a first set of points to correspond to a second set of points in consecutive frames to construct a sequence of points. Then the process identifies the points that satisfy criteria of being stable points and being centrally located in the frame to represent the video clip as a bag of descriptive words for searching for images and information related to the video clip.
-
公开(公告)号:US20150332124A1
公开(公告)日:2015-11-19
申请号:US14810181
申请日:2015-07-27
发明人: Linjun Yang , Lifeng Shang , Xian-Sheng Hua , Fei Wang
CPC分类号: G06K9/6215 , G06F16/783 , G06K9/00751 , G06K9/00758 , G06K9/4609 , G06K9/4652
摘要: A similarity of a first video to a second video may be identified automatically. Images are received from the videos, and divided into sub-images. The sub-images are evaluated based on a feature common to each of the sub-images. Binary representations of the images may be created based on the evaluation of the sub-images. A similarity of the first video to the second video may be determined based on a number of occurrences of a binary representation in the first video and the second video.
摘要翻译: 可以自动识别第一视频与第二视频的相似度。 从视频接收图像,并分成子图像。 基于每个子图像共有的特征来评估子图像。 可以基于子图像的评估来创建图像的二进制表示。 可以基于第一视频和第二视频中的二进制表示的出现次数来确定第一视频与第二视频的相似度。
-
公开(公告)号:US20160358036A1
公开(公告)日:2016-12-08
申请号:US15240838
申请日:2016-08-18
发明人: Linjun Yang , Xian-Sheng Hua , Yang Cai
摘要: Techniques describe submitting a video clip as a query by a user. A process retrieves images and information associated with the images in response to the query. The process decomposes the video clip into a sequence of frames to extract the features in a frame and to quantize the extracted features into descriptive words. The process further tracks the extracted features as points in the frame, a first set of points to correspond to a second set of points in consecutive frames to construct a sequence of points. Then the process identifies the points that satisfy criteria of being stable points and being centrally located in the frame to represent the video clip as a bag of descriptive words for searching for images and information related to the video clip.
摘要翻译: 技术描述提交视频剪辑作为用户的查询。 响应于查询,进程检索与图像相关联的图像和信息。 该过程将视频剪辑分解成帧序列以提取帧中的特征并将提取的特征量化为描述性词。 该过程进一步跟踪提取的特征作为帧中的点,第一组点对应于连续帧中的第二组点以构成点序列。 然后,该过程识别满足稳定点的标准并且位于帧中心的点以将视频剪辑表示为用于搜索与视频剪辑相关的图像和信息的描述词的一袋。
-
公开(公告)号:US11372914B2
公开(公告)日:2022-06-28
申请号:US15936117
申请日:2018-03-26
发明人: Yokesh Kumar , Kuang-Huei Lee , Houdong Hu , Li Huang , Arun Sacheti , Meenaz Merchant , Linjun Yang , Tianjun Xiao , Saurajit Mukherjee
IPC分类号: G06F16/583 , G06F16/58 , G06F16/51 , G06F16/538 , G06N5/02 , G06F16/9535 , G06N20/00
摘要: The description relates to diversified hybrid image annotation for annotating images. One implementation includes generating first image annotations for a query image using a retrieval-based image annotation technique. Second image annotations can be generated for the query image using a model-based image annotation technique. The first and second image annotations can be integrated to generate a diversified hybrid image annotation result for the query image.
-
公开(公告)号:US20200019628A1
公开(公告)日:2020-01-16
申请号:US16036224
申请日:2018-07-16
发明人: Xi Chen , Houdong Hu , Li Huang , Jiapei Huang , Arun Sacheti , Linjun Yang , Rui Xia , Kuang-Huei Lee , Meenaz Merchant , Sean Chang Culatana
摘要: Representative embodiments disclose mechanisms to perform visual intent classification or visual intent detection or both on an image. Visual intent classification utilizes a trained machine learning model that classifies subjects in the image according to a classification taxonomy. The visual intent classification can be used as a pre-triggering mechanism to initiate further action in order to substantially save processing time. Example further actions include user scenarios, query formulation, user experience enhancement, and so forth. Visual intent detection utilizes a trained machine learning model to identify subjects in an image, place a bounding box around the image, and classify the subject according to the taxonomy. The trained machine learning model utilizes multiple feature detectors, multi-layer predictions, multilabel classifiers, and bounding box regression.
-
公开(公告)号:US20190236487A1
公开(公告)日:2019-08-01
申请号:US15883686
申请日:2018-01-30
发明人: Jiapei Huang , Houdong Hu , Li Huang , Xi Chen , Linjun Yang
IPC分类号: G06N99/00
CPC分类号: G06N20/00 , G06F3/04842
摘要: A technique for hyperparameter tuning can be performed via a hyperparameter tuning tool. In the technique, computer-readable values for each of one or more machine learning hyperparameters can be received. Multiple computer-readable hyperparameter value sets can be defined using different combinations of the values. In response to a request to start, an overall hyperparameter tuning operation can be performed via the tool, with the overall operation including a tuning job for each of the hyperparameter sets. A computer-readable comparison of the results of the parameter tuning operations can be generated for the hyperparameter sets, with the comparison indicating effectiveness of the hyperparameter sets, as compared to each other, in the tuning jobs.
-
公开(公告)号:US20160247070A1
公开(公告)日:2016-08-25
申请号:US15145563
申请日:2016-05-03
发明人: Shipeng Li , Yang Yang , Bin Benjamin Zhu , Rui Guo , Linjun Yang
CPC分类号: G06N5/022 , G06F2221/2133 , G06N3/12 , G06N3/126 , G06N5/04 , H04L63/1416
摘要: Technologies for a human computation framework suitable for answering common sense questions that are difficult for computers to answer but easy for humans to answer. The technologies support solving general common sense problems without a priori knowledge of the problems; support for determining whether an answer is from a bot or human so as to screen out spurious answers from bots; support for distilling answers collected from human users to ensure high quality solutions to the questions asked; and support for preventing malicious elements in or out of the system from attacking other system elements or contaminating the solutions produced by the system, and preventing users from being compensated without contributing answers.
-
公开(公告)号:US11074289B2
公开(公告)日:2021-07-27
申请号:US15885568
申请日:2018-01-31
发明人: Houdong Hu , Yan Wang , Linjun Yang , Li Huang , Xi Chen , Jiapei Huang , Ye Wu , Arun K. Sacheti , Meenaz Merchant
IPC分类号: G06F16/53 , G06F16/532 , G06T7/00 , G06K9/62 , G06K9/46 , G06N3/08 , G06F16/51 , G06F16/56 , G06F16/583 , G06F16/2457
摘要: Systems and methods can be implemented to conduct searches based on images used as queries in a variety of applications. In various embodiments, a set of visual words representing a query image are generated from features extracted from the query image and are compared with visual words of index images. A set of candidate images is generated from the index images resulting from matching one or more visual words in the comparison. A multi-level ranking is conducted to sort the candidate images of the set of candidate images, and results of the multi-level ranking are returned to a user device that provided the query image. Additional systems and methods are disclosed.
-
公开(公告)号:US10664515B2
公开(公告)日:2020-05-26
申请号:US14975340
申请日:2015-12-18
发明人: Arun Sacheti , Ming Ye , Linjun Yang , Karim Hasham , Pavel Komlev
IPC分类号: G06F7/00 , G06F16/583 , G06T1/00 , G06F3/0481 , G06F16/248 , G06F16/9535 , G06F16/14
摘要: Systems, computing devices, and methods for performing an image search are presented. A search query including an image is received from a user. A segment associated with the image is identified. A user intent associated with the image and the segment is identified. Search results associated with the identified segment and user intent are generated, and presented to the user.
-
公开(公告)号:US20190243910A1
公开(公告)日:2019-08-08
申请号:US15888960
申请日:2018-02-05
发明人: Yan Wang , Houdong Hu , Li Huang , Arun K. Sacheti , Linjun Yang
IPC分类号: G06F17/30
CPC分类号: G06F16/5838 , G06F16/24578 , G06F16/51 , G06F16/56 , G06F16/583 , G06F16/5866 , G06F16/587
摘要: Systems and methods can be implemented to conduct a visual search as a service in a variety of applications. In various embodiments, a system is configured to provide searching capabilities of content provided by a first entity in response to a search request by a second entity. An image provided by the second entity can be used by the system as a query image to search the content of the first entity. In an embodiment, the first entity can be a commercial entity providing such a system with image related content regarding its products and services such that any number of individual consumers can search for particular products and services of the commercial entity via their communication enabled devices. In addition, such systems can be arranged for other embodiments to provide customized searches of a single source by many individual devices. Additional systems and methods are disclosed.
-
-
-
-
-
-
-
-
-