-
公开(公告)号:US11227326B2
公开(公告)日:2022-01-18
申请号:US16673477
申请日:2019-11-04
Applicant: A9.com, Inc.
Inventor: Xiaofan Lin , Arnab Sanat Kumar Dhua , Douglas Ryan Gray , Atul Kumar , Yu Lou
Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.
-
公开(公告)号:US10540378B1
公开(公告)日:2020-01-21
申请号:US15195445
申请日:2016-06-28
Applicant: A9.com, Inc.
Inventor: Edward Hsiao , Douglas Ryan Gray , Nityananda Jayadevaprakash , Xiaofan Lin , Mark Jay Nitzberg , Shruti Sheorey
Abstract: Approaches provide for analyzing image data to determine and/or recognize text in the image data. The recognized text can be used to generate a search query that can be automatically submitted to a search engine without having to type the search query to identify a product (or related products) associated with the image. For example, a camera of a computing device can be used to capture a live camera view (or single images) an item. An application executing on the computing device (or at least in communication with the computing device) can analyze the image data of the live camera view to determine a set of keywords (e.g., identified text) based on visual features extracted from the image data. The keywords can be used to query an index of product titles, common search queries, among other indexed data to return a ranked list of search suggestions based on a relevance function. The relevance function can consider the ordering of the keywords to rank search suggestions more highly that contain the keywords having the same word order. Further, the relevance function can consider the confidence of the visual recognition of each keyword, the confidence of each search suggestion, customer impact, as well as other factors to determine the ranking of the search suggestions. The search suggestions can be further refined to ensure search results that the user will be more likely to view and/or purchase.
-
公开(公告)号:US09646335B2
公开(公告)日:2017-05-09
申请号:US14863325
申请日:2015-09-23
Applicant: A9.com, Inc.
Inventor: Xiaofan Lin , Arnab Sanat Kumar Dhua , Douglas Ryan Gray , Atul Kumar , Yu Lou
CPC classification number: H04N5/23293 , G06F3/005 , G06K9/00201 , G06K9/00456 , G06K9/00671 , G06K9/344 , G06Q30/0269 , G06Q30/0623 , G06Q30/0641 , G06T11/60
Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.
-
公开(公告)号:US09098888B1
公开(公告)日:2015-08-04
申请号:US14105028
申请日:2013-12-12
Applicant: A9.com, Inc.
Inventor: Xiaofan Lin , Adam Wiggen Kraft , Yu Lou , Douglas Ryan Gray , Colin Jon Taylor
IPC: G06T7/00
CPC classification number: G06K9/18 , G06K9/00456 , G06K9/00523 , G06K9/228 , G06K2209/01 , G06T7/11
Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.
Abstract translation: 各种实施例提供了通过在文本检测中应用合适的文本检测参数来识别图像中的文本的方法和系统。 可以基于来自一个或多个文本识别子任务的参数度量反馈(例如文本检测,文本识别,预处理,字符集映射,模式匹配和验证)来确定合适的文本检测参数。 在一些实施例中,可以通过对图像执行字形检测来将图像定义为一个或多个图像区域。 可以基于相应图像区域中测量的一个或多个参数度量来调整应用于一个或多个图像区域中的每一个的文本检测参数。
-
公开(公告)号:US10769200B1
公开(公告)日:2020-09-08
申请号:US14789443
申请日:2015-07-01
Applicant: A9.com, Inc.
Inventor: Xiaofan Lin
IPC: G06K9/00 , G06F16/248 , G06F16/2455 , G06F16/583 , G06F16/242
Abstract: A user can capture an image of a text object of interest and have that image submitted for processing. The image can be pre-processed to improve quality and then submitted to an optical character recognition process to identify the words, characters, or strings in the image. At least some of these results can be submitted as a query to a search engine to obtain potential matches. In order to improve the accuracy of the results, information such as the titles for the results can be compared against each recognized word, character, or string from the image, including the ordering of those elements. An updated relevancy score can then be generated based on the full, ordered set. The recognized text is also analyzed to attempt to recognize model numbers or other identifiers that can be weighted more heavily as being indicative of accurate matches. Matches are selected from the re-ranked results.
-
公开(公告)号:US09870633B2
公开(公告)日:2018-01-16
申请号:US15387219
申请日:2016-12-21
Applicant: A9.com, Inc.
Inventor: Adam Wiggen Kraft , Arnab Sanat Kumar Dhua , Douglas Ryan Gray , Xiaofan Lin , Yu Lou , Sunil Ramesh , Colin Jon Taylor , David Creighton Mott
CPC classification number: G06T11/60 , G06K9/00577 , G06T19/006
Abstract: Various embodiments enable a computing device to perform tasks such as highlighting words in an augmented reality view that are important to a user. For example, word lists can be generated and the user, by pointing a camera of a computing device at a volume of text, can cause words from the word list within the volume of text to be highlighted in a live field of view of the camera displayed thereon. Accordingly, users can quickly identify textual information that is meaningful to them in an Augmented Reality view to aid the user in sifting through real-world text.
-
公开(公告)号:US09582735B2
公开(公告)日:2017-02-28
申请号:US15063050
申请日:2016-03-07
Applicant: A9.com, Inc.
Inventor: Simant Dube , Sunil Ramesh , Xiaofan Lin , Arnab Sanat Kumar Dhua , Colin Jon Taylor , Jaishanker K. Pillai
CPC classification number: G06K9/6215 , G06F17/30247 , G06K9/00523 , G06K9/4676 , G06K9/52 , G06K9/6206 , G06K9/6211 , G06K9/6218 , G06K9/6232 , G06K9/6256 , G06K9/6267 , G06K9/6269 , G06K9/6276 , G06K9/6277 , G06K9/6284 , G06K9/66 , G06K2209/19 , H04N19/426 , H04N19/90
Abstract: Various embodiments may increase scalability of image representations stored in a database for use in image matching and retrieval. For example, a system providing image matching can obtain images of a number of inventory items, extract features from each image using a feature extraction algorithm, and transform the same into their feature descriptor representations. These feature descriptor representations can be subsequently stored and used to compare against query images submitted by users. Though the size of each feature descriptor representation isn't particularly large, the total number of these descriptors requires a substantial amount of storage space. Accordingly, feature descriptor representations are compressed to minimize storage and, in one example, machine learning can be used to compensate for information lost as a result of the compression.
Abstract translation: 各种实施例可以增加存储在用于图像匹配和检索的数据库中的图像表示的可扩展性。 例如,提供图像匹配的系统可以获得多个库存物品的图像,使用特征提取算法从每个图像中提取特征,并将其转换成它们的特征描述符表示。 这些特征描述符表示可随后存储并用于与用户提交的查询图像进行比较。 虽然每个特征描述符表示的大小不是特别大,但是这些描述符的总数需要大量的存储空间。 因此,压缩特征描述符表示以最小化存储,并且在一个示例中,可以使用机器学习来补偿由于压缩而丢失的信息。
-
公开(公告)号:US09350913B2
公开(公告)日:2016-05-24
申请号:US14874272
申请日:2015-10-02
Applicant: A9.com, Inc.
Inventor: Adam Wiggen Kraft , Kathy Wing Lam Ma , Xiaofan Lin , Arnab Sanat Kumar Dhua , Yu Lou
CPC classification number: H04N5/23222 , G06F3/0482 , G06F3/04842 , G06F17/24 , G06F17/2715 , G06K9/18 , G06Q30/0603 , G06Q30/0625 , G06T7/194 , G06T7/70 , G06T15/00 , G06T15/08 , G06T2210/22 , G06T2215/16 , H04N1/00 , H04N7/183
Abstract: Various approaches provide for detecting and recognizing text to enable a user to perform various functions or tasks. For example, a user could point a camera at an object with text, in order to capture an image of that object. The camera can be integrated with a portable computing device that is capable of taking the image and processing the image (or providing the image for processing) to recognize, identify, and/or isolate the text in order to send the image of the object as well as recognized text to an application, function, or system, such as an electronic marketplace.
-
公开(公告)号:US09292739B1
公开(公告)日:2016-03-22
申请号:US14105084
申请日:2013-12-12
Applicant: A9.com, Inc.
Inventor: Douglas Ryan Gray , Colin Jay Taylor , Xiaofan Lin , Adam Wiggen Kraft , Yu Lou , Arnab Sanat Kumar Dhua
CPC classification number: G06K9/033 , G06K9/228 , G06K9/6292 , G06K2009/2045 , G06K2209/01
Abstract: Various embodiments enable text aggregation from multiple image frames of text. Accordingly, in order to stitch newly scanned areas of a document together, text in a respective image is recognized and analyzed using an algorithm to identify pairs of corresponding words in other images. Upon identifying a minimum number of matching pairs between two respective images, a mapping between the same can be determined based at least in part on a geometric correspondence between respective identified pairs. Based on this mapping, the recognized text of the two images can be merged by adding words of one image to the other using the matching word pairs as alignment data points.
Abstract translation: 各种实施例使得能够从文本的多个图像帧进行文本聚合。 因此,为了将文档的新扫描区域一起缝合,使用用于识别其他图像中的对应词对的算法来识别和分析各个图像中的文本。 在识别两个相应图像之间的匹配对的最小数量时,可以至少部分地基于相应的识别对之间的几何对应来确定相同之间的映射。 基于该映射,可以通过使用匹配词对作为对齐数据点将一个图像的单词添加到另一个图像来合并两个图像的识别文本。
-
公开(公告)号:US09280560B1
公开(公告)日:2016-03-08
申请号:US14133252
申请日:2013-12-18
Applicant: A9.com, Inc.
Inventor: Simant Dube , Sunil Ramesh , Xiaofan Lin , Arnab Sanat Kumar Dhua , Colin Jon Taylor , Jaishanker K. Pillai
CPC classification number: G06K9/6215 , G06F17/30247 , G06K9/00523 , G06K9/4676 , G06K9/52 , G06K9/6206 , G06K9/6211 , G06K9/6218 , G06K9/6232 , G06K9/6256 , G06K9/6267 , G06K9/6269 , G06K9/6276 , G06K9/6277 , G06K9/6284 , G06K9/66 , G06K2209/19 , H04N19/426 , H04N19/90
Abstract: Various embodiments may increase scalability of image representations stored in a database for use in image matching and retrieval. For example, a system providing image matching can obtain images of a number of inventory items, extract features from each image using a feature extraction algorithm, and transform the same into their feature descriptor representations. These feature descriptor representations can be subsequently stored and used to compare against query images submitted by users. Though the size of each feature descriptor representation isn't particularly large, the total number of these descriptors requires a substantial amount of storage space. Accordingly, feature descriptor representations are compressed to minimize storage and, in one example, machine learning can be used to compensate for information lost as a result of the compression.
Abstract translation: 各种实施例可以增加存储在用于图像匹配和检索的数据库中的图像表示的可扩展性。 例如,提供图像匹配的系统可以获得多个库存物品的图像,使用特征提取算法从每个图像中提取特征,并将其转换成它们的特征描述符表示。 这些特征描述符表示可随后存储并用于与用户提交的查询图像进行比较。 虽然每个特征描述符表示的大小不是特别大,但是这些描述符的总数需要大量的存储空间。 因此,压缩特征描述符表示以最小化存储,并且在一个示例中,可以使用机器学习来补偿由于压缩而丢失的信息。
-
-
-
-
-
-
-
-
-