Media processing techniques for enhancing content

    公开(公告)号:US12106333B2

    公开(公告)日:2024-10-01

    申请号:US18200806

    申请日:2023-05-23

    Applicant: A9.com, Inc.

    CPC classification number: G06Q30/0277 G06F40/14

    Abstract: A computing device can obtain data describing at least one document, the at least one document referencing at least one media object, wherein a portion of the at least one media object includes one or more characters. The computing device can obtain data describing the one or more characters in the at least one media object in the at least one document. The computing device can generate an updated copy of the at least one document that includes the data describing the one or more characters in the at least one media object. The computing device can present, on a display screen of the computing device and through an interface, the updated copy of the at least one document, wherein the one or more characters in the at least one media object are able to be selected or searched.

    Media processing techniques for enhancing content

    公开(公告)号:US11699174B2

    公开(公告)日:2023-07-11

    申请号:US17170010

    申请日:2021-02-08

    Applicant: A9.com, Inc.

    CPC classification number: G06Q30/0277 G06F40/14

    Abstract: A computing device can obtain data describing at least one document, the at least one document referencing at least one media object, wherein a portion of the at least one media object includes one or more characters. The computing device can obtain data describing the one or more characters in the at least one media object in the at least one document. The computing device can generate an updated copy of the at least one document that includes the data describing the one or more characters in the at least one media object. The computing device can present, on a display screen of the computing device and through an interface, the updated copy of the at least one document, wherein the one or more characters in the at least one media object are able to be selected or searched.

    Image-based character recognition

    公开(公告)号:US09390340B2

    公开(公告)日:2016-07-12

    申请号:US14721696

    申请日:2015-05-26

    Applicant: A9.com, Inc.

    Abstract: Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device for performing a function (e.g., calling a number, opening an internet browser, etc.) associated with the recognized text. In at least one embodiment, processing the image includes substantially simultaneously or concurrently processing the image with at least two recognition engines, such as at least two optical character recognition (OCR) engines, running in a multithreaded mode. In at least one embodiment, the recognition engines can be tuned so that their respective processing speeds are roughly the same. Utilizing multiple recognition engines enables processing latency to be close to that of using only one recognition engine.

    Video content alignment
    5.
    发明授权
    Video content alignment 有权
    视频内容对齐

    公开(公告)号:US09275682B1

    公开(公告)日:2016-03-01

    申请号:US14498818

    申请日:2014-09-26

    Applicant: A9.com, Inc.

    Abstract: Various embodiments identify differences between frame sequences of a video. For example, to determine a difference between two versions of a video, a fingerprint of each frame of the two versions is generated. From the fingerprints, a run-length encoded representation of each version is generated. The fingerprints which appear only once (i.e., unique fingerprints) in the entire video are identified from each version and compared to identify matching unique fingerprints across versions. The matching unique fingerprints are sorted and filtered to determine split points, which are used to align the two versions of the video. Accordingly, each version is segmented into smaller frame sequences using the split points. Once segmented, the individual frames of each segment are aligned across versions using a dynamic programming algorithm. After aligning the segments at a frame level, the segments are reassembled to generate a global alignment output.

    Abstract translation: 各种实施例识别视频的帧序列之间的差异。 例如,为了确定视频的两个版本之间的差异,生成两个版本的每个帧的指纹。 从指纹中,生成每个版本的游程长度编码表示。 从每个版本识别整个视频中仅出现一次的指纹(即,唯一指纹),并进行比较以识别跨越版本的匹配的唯一指纹。 匹配的唯一指纹被分类和过滤以确定分割点,其用于对准视频的两个版本。 因此,使用分割点将每个版本分割成较小的帧序列。 一旦分段,每个段的各个帧在版本之间使用动态规划算法对齐。 在帧级别对齐段之后,重新组合段以产生全局对准输出。

    Augmented reality recommendations
    6.
    发明授权
    Augmented reality recommendations 有权
    增强现实建议

    公开(公告)号:US09183583B2

    公开(公告)日:2015-11-10

    申请号:US13893235

    申请日:2013-05-13

    Applicant: A9.com, Inc.

    Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.

    Abstract translation: 各种实施例使得计算设备能够执行诸如处理图像以识别图像中的文本或对象的任务,以识别与文本或对象相关联的特定产品或相关产品。 响应于将文本或对象识别为与可从电子市场购买的产品相关联,可以向用户显示与产品相关联的一个或多个广告或产品列表。 因此,可以显示关联产品的附加信息,使得用户能够通过便携式计算设备从电子市场了解更多关于和购买产品的信息。

    Augmented reality recommendations

    公开(公告)号:US11227326B2

    公开(公告)日:2022-01-18

    申请号:US16673477

    申请日:2019-11-04

    Applicant: A9.com, Inc.

    Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.

    Visual search suggestions
    8.
    发明授权

    公开(公告)号:US10540378B1

    公开(公告)日:2020-01-21

    申请号:US15195445

    申请日:2016-06-28

    Applicant: A9.com, Inc.

    Abstract: Approaches provide for analyzing image data to determine and/or recognize text in the image data. The recognized text can be used to generate a search query that can be automatically submitted to a search engine without having to type the search query to identify a product (or related products) associated with the image. For example, a camera of a computing device can be used to capture a live camera view (or single images) an item. An application executing on the computing device (or at least in communication with the computing device) can analyze the image data of the live camera view to determine a set of keywords (e.g., identified text) based on visual features extracted from the image data. The keywords can be used to query an index of product titles, common search queries, among other indexed data to return a ranked list of search suggestions based on a relevance function. The relevance function can consider the ordering of the keywords to rank search suggestions more highly that contain the keywords having the same word order. Further, the relevance function can consider the confidence of the visual recognition of each keyword, the confidence of each search suggestion, customer impact, as well as other factors to determine the ranking of the search suggestions. The search suggestions can be further refined to ensure search results that the user will be more likely to view and/or purchase.

    NEURAL NETWORK-BASED IMAGE MANIPULATION
    9.
    发明申请

    公开(公告)号:US20190138851A1

    公开(公告)日:2019-05-09

    申请号:US16222318

    申请日:2018-12-17

    Applicant: A9.com, Inc.

    Abstract: An image creation and editing tool can use the data produced from training a neural network to add stylized representations of an object to an image. An object classification will correspond to an object representation, and pixel values for the object representation can be added to, or blended with, the pixel values of an image in order to add a visualization of a type of object to the image. Such an approach can be used to add stylized representations of objects to existing images or create new images based on those representations. The visualizations can be used to create patterns and textures as well, as may be used to paint or fill various regions of an image. Such patterns can enable regions to be filled where image data has been deleted, such as to remove an undesired object, in a way that appears natural for the contents of the image.

    Video content alignment
    10.
    发明授权

    公开(公告)号:US09984728B2

    公开(公告)日:2018-05-29

    申请号:US14997351

    申请日:2016-01-15

    Applicant: A9.com, Inc.

    Abstract: Various embodiments identify differences between frame sequences of a video. For example, to determine a difference between two versions of a video, a fingerprint of each frame of the two versions is generated. From the fingerprints, a run-length encoded representation of each version is generated. The fingerprints which appear only once (i.e., unique fingerprints) in the entire video are identified from each version and compared to identify matching unique fingerprints across versions. The matching unique fingerprints are sorted and filtered to determine split points, which are used to align the two versions of the video. Accordingly, each version is segmented into smaller frame sequences using the split points. Once segmented, the individual frames of each segment are aligned across versions using a dynamic programming algorithm. After aligning the segments at a frame level, the segments are reassembled to generate a global alignment output.

Patent Agency Ranking