Detection of cast members in video content
    12.
    发明授权
    Detection of cast members in video content 有权
    在视频内容中检测演员

    公开(公告)号:US09449216B1

    公开(公告)日:2016-09-20

    申请号:US13860347

    申请日:2013-04-10

    Applicant: A9.com, Inc.

    Abstract: Disclosed are various embodiments for detection of cast members in video content such as movies, television shows, and other programs. Data indicating cast members who appear in a video program is obtained. Each cast member is associated with a reference image depicting a face of the cast member. A frame is obtained from the video program, and a face is detected in the frame. The detected face in the frame is recognized as being a particular cast member based at least in part on the reference image depicting the cast member. An association between the cast member and the frame is generated in response to the detected face in the frame being recognized as the cast member.

    Abstract translation: 公开了用于在诸如电影,电视节目和其他节目的视频内容中检测演员的各种实施例。 获取表示出现在视频节目中的演员的数据。 每个铸造构件与描绘铸件的表面的参考图像相关联。 从视频节目获得帧,并且在帧中检测到一个脸部。 至少部分地基于描绘铸件的参考图像将框架中检测到的面部识别为特定铸件。 响应于被识别为铸件的框架中检测到的面而产生铸造构件和框架之间的关联。

    Collaborative text detection and recognition
    13.
    发明授权
    Collaborative text detection and recognition 有权
    协同文本检测和识别

    公开(公告)号:US09098888B1

    公开(公告)日:2015-08-04

    申请号:US14105028

    申请日:2013-12-12

    Applicant: A9.com, Inc.

    Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.

    Abstract translation: 各种实施例提供了通过在文本检测中应用合适的文本检测参数来识别图像中的文本的方法和系统。 可以基于来自一个或多个文本识别子任务的参数度量反馈(例如文本检测,文本识别,预处理,字符集映射,模式匹配和验证)来确定合适的文本检测参数。 在一些实施例中,可以通过对图像执行字形检测来将图像定义为一个或多个图像区域。 可以基于相应图像区域中测量的一个或多个参数度量来调整应用于一个或多个图像区域中的每一个的文本检测参数。

    Crowdsourced audio normalization for presenting media content

    公开(公告)号:US10466955B1

    公开(公告)日:2019-11-05

    申请号:US14313773

    申请日:2014-06-24

    Applicant: A9.com, Inc.

    Abstract: Various embodiments provide methods and systems for providing a recommended volume level in presentation of media content. In some embodiments, volume adjustment events made by a user and/or similar users while watching media content can be detected and automatically recorded. The media content may include a plurality of segments. A normalized volume level for at least one segment of the media content can be determined by aggregating the recorded volume adjustment events corresponding to the at least one segment of the media content. When the media content is played back on a user device, at least some embodiments cause the at least one segment of the media content to be played back at a recommended volume level determined based at least in part upon one of the normalized audio level of the corresponding segment, the audio system of the user device, or historical data and personal profile of the user.

    Attribute similarity-based search
    16.
    发明授权

    公开(公告)号:US10043109B1

    公开(公告)日:2018-08-07

    申请号:US15413083

    申请日:2017-01-23

    Applicant: A9.com, Inc.

    Abstract: A set of training images is obtained by analyzing text associated with various images to identify images likely demonstrating a visual attribute. Localization can be used to extract patches corresponding to these attributes, which can then have features or feature vectors determined to train, for example, a convolutional neural network. A query image can be received and analyzed using the trained network to determine a set of items whose images demonstrate visual similarity to the query image at least with respect to the attribute of interest. The similarity can be output from the network or determined using distances in attribute space. Content for at least a determined number of highest ranked, or most similar, items can then be provided in response to the query image.

    Augmented reality Camera Lucida
    17.
    发明授权

    公开(公告)号:US09965895B1

    公开(公告)日:2018-05-08

    申请号:US14221104

    申请日:2014-03-20

    Applicant: A9.com, Inc.

    Abstract: Approaches are described for enabling a user to create an accurate perspective rendering of a source (e.g., a scene, object, subject, point of interest, etc.) on a drawing surface. For example, various approaches enable superimposition of the source being viewed upon a drawing surface upon which a user is drawing. In this way, the user can view both the source and drawing surface simultaneously. This allows the user to duplicate key points of the source on the drawing surface by viewing a display of a device, thus aiding in the accurate rendering of perspective.

Patent Agency Ranking