-
公开(公告)号:US11922489B2
公开(公告)日:2024-03-05
申请号:US16272902
申请日:2019-02-11
Applicant: A9.com, Inc.
Inventor: Rupa Chaturvedi , Xing Zhang , Frank Partalis , Yu Lou , Colin Jon Taylor , Simon Fox
IPC: G06T19/00 , G06F3/04847 , G06N3/084 , G06Q30/0251 , G06Q30/0601 , G06T7/50 , A61B90/00
CPC classification number: G06Q30/0643 , G06F3/04847 , G06N3/084 , G06Q30/0257 , G06Q30/0633 , G06T7/50 , G06T19/006 , A61B2090/365 , G06T2207/20081 , G06T2207/20084
Abstract: A camera is used to capture image data of representations of a physical environment. Planes and surfaces are determined from a representation. The planes and the surfaces are analyzed using relationships there between to obtain shapes and depth information for available spaces within the physical environment. Locations of the camera with respect to the physical environment are determined. The shapes and the depth information are analyzed using a trained neural network to determine items fitting the available spaces. A live camera view is overlaid with a selection from the items to provide an augmented reality (AR) view of the physical environment from an individual location of the locations. The AR view is enabled so that a user can port to a different location than the individual location by an input received to the AR view while the selection from the items remains anchored to the individual location.
-
公开(公告)号:US20200334882A1
公开(公告)日:2020-10-22
申请号:US16903932
申请日:2020-06-17
Applicant: A9.com, Inc.
Inventor: Jesse Chang , Jared Corso , Xing Zhang , Arnab Sanat Kumar Dhua , Yu Lou , Jason Freund
Abstract: Approaches in accordance with various embodiments provide for the presentation of augmented reality (AR) content with respect to optically challenging surfaces. Such surfaces can be difficult to locate using conventional optical-based approaches that rely on visible features. Embodiments can utilize the fact that horizontal surfaces can be located relatively easily, and can determine intersections or boundaries of those horizontal surfaces that likely indicate the presence of another surface, such as a vertical wall. This boundary can be determined automatically, through user input, or using a combination of such approaches. Once such an intersection is located, a virtual plane can be determined whose relative location to a device displaying AR content can be tracked and used as a reference for displaying AR content.
-
公开(公告)号:US10755485B2
公开(公告)日:2020-08-25
申请号:US16259387
申请日:2019-01-28
Applicant: A9.com, Inc.
Inventor: David Creighton Mott , Arnab Sanat Kumar Dhua , Colin Jon Taylor , Yu Lou , Chun-Kai Wang , Sudeshna Pantham , Himanshu Arora , Xi Zhang
Abstract: Systems and methods for displaying 3D containers in a computer generated environment are described. A computing device may provide a user with a catalog of objects which may be purchased. In order to view what an object may look like prior to purchasing the object, a computing device may show a 3D container that has the same dimensions as the object. As discussed herein, the 3D container may be located and oriented based on a two-dimensional marker. Moreover, some 3D containers may contain a representation of an object, which may be a 2D image of the object.
-
公开(公告)号:US20200258144A1
公开(公告)日:2020-08-13
申请号:US16272902
申请日:2019-02-11
Applicant: A9.com, Inc.
Inventor: Rupa Chaturvedi , Xing Zhang , Frank Partalis , Yu Lou , Colin Jon Taylor , Simon Fox
Abstract: A camera is used to capture image data of representations of a physical environment. Planes and surfaces are determined from a representation. The planes and the surfaces are analyzed using relationships there between to obtain shapes and depth information for available spaces within the physical environment. Locations of the camera with respect to the physical environment are determined. The shapes and the depth information are analyzed using a trained neural network to determine items fitting the available spaces. A live camera view is overlaid with a selection from the items to provide an augmented reality (AR) view of the physical environment from an individual location of the locations. The AR view is enabled so that a user can port to a different location than the individual location by an input received to the AR view while the selection from the items remains anchored to the individual location.
-
公开(公告)号:US10558857B2
公开(公告)日:2020-02-11
申请号:US15911850
申请日:2018-03-05
Applicant: A9.com, Inc.
Inventor: Peiqi Tang , Andrea Zehr , Rupa Chaturvedi , Yu Lou , Colin Jon Taylor , Mark Scott Waldo , Shaun Michael Post
IPC: G06T19/00 , G06K9/00 , G06K9/62 , G06T13/80 , G06K9/46 , G06F16/532 , G06F16/583
Abstract: Various embodiments of the present disclosure provide systems and method for visual search and augmented reality, in which an onscreen body of visual markers overlayed on the interface signals the current state of an image recognition process. Specifically, the body of visual markers may take on a plurality of behaviors, in which a particular behavior is indicative of a particular state. Thus, the user can tell what the current state of the scanning process is by the behavior of the body of visual markers. The behavior of the body of visual markers may also indicate to the user recommended actions that can be taken to improve the scanning condition or otherwise facilitate the process. In various embodiments, as the scanning process goes from one state to another state, the onscreen body of visual markers may move or seamlessly transition from one behavior to another behavior, accordingly.
-
公开(公告)号:US20170330009A1
公开(公告)日:2017-11-16
申请号:US15668501
申请日:2017-08-03
Applicant: A9.com, Inc.
Inventor: Chun-Kai Wang , Yu Lou
CPC classification number: G06K7/1443 , G06K7/10722 , G06K7/1417
Abstract: Various algorithms are presented that enable an image of a data matrix to be analyzed and decoded for use in obtaining information about an object or item associated with the data matrix. The algorithms can account for variations in position and/or alignment of the data matrix. In one approach, the image is analyzed to determine a connected region of pixels. The connected region of pixels can be analyzed to determine a pair of pixels, included in the connected region of pixels, that is separated a greatest distance wherein a first pixel and second pixel of the pair of pixels is associated with image coordinates. Using the image coordinates of the pair of pixels, a potential area of the image that includes the visual code can be determined and the potential area can be analyzed to verify the presence of a potential data matrix.
-
公开(公告)号:US09736361B2
公开(公告)日:2017-08-15
申请号:US15094518
申请日:2016-04-08
Applicant: A9.com, Inc.
Inventor: Adam Wiggen Kraft , Kathy Wing Lam Ma , Xiaofan Lin , Arnab Sanat Kumar Dhua , Yu Lou
IPC: H04N5/225 , H04N5/232 , G06F3/0484 , G06F17/27 , G06K9/18 , G06T15/08 , H04N7/18 , G06F3/0482 , G06F17/24 , G06Q30/06 , G06T15/00 , H04N1/00 , G06T7/70 , G06T7/194
CPC classification number: H04N5/23222 , G06F3/0482 , G06F3/04842 , G06F17/24 , G06F17/2715 , G06K9/18 , G06Q30/0603 , G06Q30/0625 , G06T7/194 , G06T7/70 , G06T15/00 , G06T15/08 , G06T2210/22 , G06T2215/16 , H04N1/00 , H04N7/183
Abstract: Various approaches provide for detecting and recognizing text to enable a user to perform various functions or tasks. For example, a user could point a camera at an object with text, in order to capture an image of that object. The camera can be integrated with a portable computing device that is capable of taking the image and processing the image (or providing the image for processing) to recognize, identify, and/or isolate the text in order to send the image of the object as well as recognized text to an application, function, or system, such as an electronic marketplace.
-
公开(公告)号:US20170024595A1
公开(公告)日:2017-01-26
申请号:US15282829
申请日:2016-09-30
Applicant: A9.com, Inc.
Inventor: Chun-Kai Wang , Yu Lou
IPC: G06K7/14
CPC classification number: G06K7/1443 , G06K7/10722 , G06K7/1417
Abstract: Various algorithms are presented that enable an image of a data matrix to be analyzed and decoded for use in obtaining information about an object or item associated with the data matrix. The algorithms can account for variations in position and/or alignment of the data matrix. In one approach, the image is analyzed to determine a connected region of pixels. The connected region of pixels can be analyzed to determine a pair of pixels, included in the connected region of pixels, that is separated a greatest distance wherein a first pixel and second pixel of the pair of pixels is associated with image coordinates. Using the image coordinates of the pair of pixels, a potential area of the image that includes the visual code can be determined and the potential area can be analyzed to verify the presence of a potential data matrix.
Abstract translation: 提出了使得能够分析和解码数据矩阵的图像以用于获得关于与数据矩阵相关联的对象或项目的信息的各种算法。 算法可以解释数据矩阵的位置和/或对齐方式的变化。 在一种方法中,分析图像以确定像素的连接区域。 可以分析连接的像素区域,以确定包括在连接的像素区域中的一对像素,其被分离出最大距离,其中该对像素对的第一像素和第二像素与图像坐标相关联。 使用该对像素的图像坐标,可以确定包括可视代码的图像的潜在区域,并且可以分析潜在区域以验证潜在数据矩阵的存在。
-
公开(公告)号:US09436883B2
公开(公告)日:2016-09-06
申请号:US14816943
申请日:2015-08-03
Applicant: A9.com, Inc.
Inventor: Xiaofan Lin , Adam Wiggen Kraft , Yu Lou , Douglas Ryan Gray , Colin Jon Taylor
CPC classification number: G06K9/18 , G06K9/00456 , G06K9/00523 , G06K9/228 , G06K2209/01 , G06T7/11
Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.
-
公开(公告)号:US09256795B1
公开(公告)日:2016-02-09
申请号:US13842433
申请日:2013-03-15
Applicant: A9.com, Inc.
Inventor: Douglas Ryan Gray , Xiaofan Lin , Arnab Sanat Kumar Dhua , Yu Lou
IPC: G06K9/20
CPC classification number: G06K9/325 , G06F3/14 , G06F17/24 , G06K9/00671 , G06K9/2054 , G06K9/2072 , G06K9/6215 , G06K9/6857 , G06K9/723 , G06K2209/01
Abstract: Various embodiments enable the identification of semi-structured text entities in an imager. The identification of the text entities is a relatively simple problem when the text is stored in a computer and free of errors, but much more challenging if the source is the output of an optical character recognition (OCR) engine from a natural scene image. Accordingly, output from an OCR engine is analyzed to isolate a character string indicative of a text entity. Each character of the string is then assigned to a character class to produce a character class string and the text entity of the string is identified based in part on a pattern of the character class string.
Abstract translation: 各种实施例使得能够在成像器中识别半结构化文本实体。 当文本存储在计算机中并且没有错误时,文本实体的识别是相对简单的问题,但是如果源是来自自然场景图像的光学字符识别(OCR)引擎的输出,则更具挑战性。 因此,分析来自OCR引擎的输出以隔离指示文本实体的字符串。 然后将字符串的每个字符分配给字符类以产生字符类字符串,并且部分地基于字符类字符串的模式来标识字符串的文本实体。
-
-
-
-
-
-
-
-
-