-
公开(公告)号:US10534965B2
公开(公告)日:2020-01-14
申请号:US15926745
申请日:2018-03-20
Applicant: Amazon Technologies, Inc.
Inventor: Nitin Singhal , Vivek Bhadauria , Ranju Das , Gaurav D. Ghare , Roman Goldenberg , Stephen Gould , Kuang Han , Jonathan Andrew Hedley , Gowtham Jeyabalan , Vasant Manohar , Andrea Olgiati , Stefano Stefani , Joseph Patrick Tighe , Praveen Kumar Udayakumar , Renjun Zheng
Abstract: Techniques for analyzing stored video upon a request are described. For example, a method of receiving a first application programming interface (API) request to analyze a stored video, the API request to include a location of the stored video and at least one analysis action to perform on the stored video; accessing the location of the stored video to retrieve the stored video; segmenting the accessed video into chunks; processing each chunk with a chunk processor to perform the at least one analysis action, each chunk processor to utilize at least one machine learning model in performing the at least one analysis action; joining the results of the processing of each chunk to generate a final result; storing the final result; and providing the final result to a requestor in response to a second API request is described.
-
公开(公告)号:US09349202B1
公开(公告)日:2016-05-24
申请号:US13632864
申请日:2012-10-01
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Vasant Manohar , Viswanath Sankaranarayanan
CPC classification number: G06T11/60 , G06K9/00463 , G06K9/481 , G06K9/6828
Abstract: A method of generating a reflowable content file from a physical text source is described. An image of the physical text source is segmented into a plurality of glyphs and a character and font is determined for each of the glyphs. The font for each of the plurality of glyphs is determined based on two or more of the glyphs.
Abstract translation: 描述从物理文本源生成可回流内容文件的方法。 物理文本源的图像被分割成多个字形,并且为每个字形确定字符和字体。 基于两个或更多个字形来确定多个字形中的每一个的字体。
-
公开(公告)号:US10761893B1
公开(公告)日:2020-09-01
申请号:US16199014
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Vivek Bhadauria , Praveenkumar Udayakumar , Jonathan Andrew Hedley , Vasant Manohar , Andrea Olgiati , Rakesh Madhavan Nambiar , Gowtham Jeyabalan , Shubham Chandra Gupta , Palak Mehta
Abstract: Techniques are described for automatically scaling (or “auto scaling”) compute resources—for example, virtual machine (VM) instances, containers, or standalone servers—used to support execution of service-oriented software applications and other types of applications that may process heterogeneous workloads. The resource requirements for a software application can be approximated by measuring “worker pool” utilization of instances of each service, where a worker pool represents a number of requests that the service can process concurrently. A scaling service can thus be configured to scale the compute instances provisioned for a service in proportion to worker pool utilization, that is, compute instances can be added as the fleet's worker pools become more “busy,” while compute instances can be removed when worker pools become inactive.
-
公开(公告)号:US09191554B1
公开(公告)日:2015-11-17
申请号:US13677096
申请日:2012-11-14
Applicant: Amazon Technologies, Inc.
Inventor: Vasant Manohar , Sridhar Godavarthy , Viswanath Sankaranarayanan
CPC classification number: H04N1/00198 , G06F17/21 , H04N5/14
Abstract: Some implementations include using a trained classifier to identify page-turn events in a video. The video may be divided into multiple segments based on the page-turn events, with each segment of the multiple segments corresponding to a pair of adjacent pages in a book. Exemplar frames that provide non-redundant data compared to other frames may be chosen from each segment. The exemplar frames may be cropped to include content portions of pages. The exemplar frames may be aligned such that a pixel is located in a same position in each frame. Optical character recognition (OCR) may be performed on exemplar frames and the OCR for exemplar frames in each segment may be combined. The exemplar frames in each segment may be combined to create a composite image for each pair of adjacent pages in the book, and OCR may be performed on the composite image.
Abstract translation: 一些实现包括使用经过训练的分类器来识别视频中的翻页事件。 视频可以基于翻页事件被划分成多个片段,多个片段的每个片段对应于书中的一对相邻页面。 可以从每个段选择与其他帧相比提供非冗余数据的示例帧。 可以裁剪示例帧以包括页面的内容部分。 示例性帧可以对准,使得像素位于每个帧中的相同位置。 可以在示例性帧上执行光学字符识别(OCR),并且可以组合每个段中的示例帧的OCR。 每个段中的示例帧可以被组合以为书中的每对相邻页创建合成图像,并且可以在合成图像上执行OCR。
-
5.
公开(公告)号:US20220139063A1
公开(公告)日:2022-05-05
申请号:US17525413
申请日:2021-11-12
Applicant: Amazon Technologies, Inc.
Inventor: Kunwar Yashraj Singh , Keith Young Johnson , Vivek Bhadauria , Sean R. Flynn , Binglei Du , Dylan C. Thomas , Vasant Manohar , Jonathan Hedley , Wei Xia
Abstract: Objects detected in data may be filtered from an object recognition index. Data for object detection may be received. An object detection technique may be applied to the data to detect an object. If the object does not satisfy indexing criteria for the object recognition index, then the detected object may be excluded from the object recognition index.
-
公开(公告)号:US11126854B1
公开(公告)日:2021-09-21
申请号:US15612651
申请日:2017-06-02
Applicant: Amazon Technologies, Inc.
Inventor: Andrea Olgiati , Nitin Singhal , Yuri Natanzon , Vasant Manohar , Davide Modolo
Abstract: Technologies are disclosed for efficiently identifying objects in videos using deep neural networks and motion information. Using the disclosed technologies, the amount of time required to identify objects in videos can be greatly reduced. Motion information for a video, such as motion vectors, are extracted during the encoding or decoding of the video. The motion information is used to determine whether there is sufficient motion between frames of the video to warrant performing object detection on the frames. If there is insufficient movement from one frame to a subsequent frame, the subsequent frame will not be processed to identify objects contained therein. In this way, object detection will not be performed on video frames that have changed minimally as compared to a previous frame, thereby reducing the amount of time and the number of processing operations required to identify the objects in the video.
-
公开(公告)号:US10242277B1
公开(公告)日:2019-03-26
申请号:US14794351
申请日:2015-07-08
Applicant: Amazon Technologies, Inc.
Inventor: Vasant Manohar , Janarthanan Lakshmipathy
Abstract: Devices, systems and methods are disclosed for validating an electronic publication and determining a source of identified errors in a rendering of the electronic publication. The rendering may be captured as a rendered image and rendered data may be extracted from the rendering. The rendered data may be compared to actual input data to the renderer used to generate the rendered image. If errors are visible in the rendering, a source of the errors may be identified based on the comparison between the extracted rendered data to the actual input data. If errors are not visible in the rendering, the rendering may be validated.
-
公开(公告)号:US10095677B1
公开(公告)日:2018-10-09
申请号:US14316704
申请日:2014-06-26
Applicant: Amazon Technologies, Inc.
Inventor: Vasant Manohar , Eric Allen Menninga , Ashley Alonzo Ricardo Karl Mitchell , Joseph King , Mugunthan Govindaraju
Abstract: Disclosed are techniques and systems to detect a layout of a source document. A process may include receiving content from a first page and a second page of the source document, designating sections in each page along a first direction of the page, and assigning similar sections to a group. For the group, the process may proceed by dividing sections for each page into discrete portions associated with 2D coordinate areas, and identifying sets of 2D coordinate areas for the discrete portions that contain content. The number of times each portion contains some content may be compared to a threshold to determine a layout of the group of sections.
-
公开(公告)号:US11610143B1
公开(公告)日:2023-03-21
申请号:US16915744
申请日:2020-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Vivek Bhadauria , Vasant Manohar , Anand Dhandhania
Abstract: A network-based service may provide a machine learning model for different clients. The network-based service may implement an interface that allows a client to identify a test data set for validating versions of the machine learning model specifically for the client. When a new version of the machine learning model is created, a validation test using the test data set identified by the client may be used. Results of the validation test may be used to make a decision regard whether to migrate workloads for the client to the new version of the machine learning model.
-
10.
公开(公告)号:US11176403B1
公开(公告)日:2021-11-16
申请号:US16183365
申请日:2018-11-07
Applicant: Amazon Technologies, Inc.
Inventor: Kunwar Yashraj Singh , Keith Young Johnson , Vivek Bhadauria , Sean R. Flynn , Binglei Du , Dylan C. Thomas , Vasant Manohar , Jonathan Hedley , Wei Xia
Abstract: Objects detected in data may be filtered from an object recognition index. Data for object detection may be received. An object detection technique may be applied to the data to detect an object. If the object does not satisfy indexing criteria for the object recognition index, then the detected object may be excluded from the object recognition index.
-
-
-
-
-
-
-
-
-