-
公开(公告)号:US11886489B2
公开(公告)日:2024-01-30
申请号:US18189776
申请日:2023-03-24
Applicant: Google LLC
Inventor: David Petrou , Matthew Bridges , Shailesh Nalawadi , Hartwig Adam , Matthew R. Casey , Hartmut Neven , Andrew Harp
IPC: G06F16/583 , G06F16/535 , G06F16/50 , G06F16/9535 , G06V10/10 , G06V10/56 , G06V10/96 , G06V20/20 , G06V20/62 , G06V30/142 , G06F18/2413 , H04N23/00 , G06V30/19 , G06F16/9538 , G06F3/048
CPC classification number: G06F16/535 , G06F3/048 , G06F16/50 , G06F16/5838 , G06F16/5846 , G06F16/9535 , G06F16/9538 , G06F18/24133 , G06V10/10 , G06V10/56 , G06V10/96 , G06V20/20 , G06V20/63 , G06V30/142 , G06V30/19173 , H04N23/00
Abstract: A system and method of identifying objects is provided. In one aspect, the system and method includes a hand-held device with a display, camera and processor. As the camera captures images and displays them on the display, the processor compares the information retrieved in connection with one image with information retrieved in connection with subsequent images. The processor uses the result of such comparison to determine the object that is likely to be of greatest interest to the user. The display simultaneously displays the images the images as they are captured, the location of the object in an image, and information retrieved for the object.
-
公开(公告)号:US11074504B2
公开(公告)日:2021-07-27
申请号:US16611604
申请日:2018-11-14
Applicant: Google LLC
Inventor: Liang-Chieh Chen , Alexander Hermans , Georgios Papandreou , Gerhard Florian Schroff , Peng Wang , Hartwig Adam
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for instance segmentation. In one aspect, a system generates: (i) data identifying one or more regions of the image, wherein an object is depicted in each region, (ii) for each region, a predicted type of object that is depicted in the region, and (iii) feature channels comprising a plurality of semantic channels and one or more direction channels. The system generates a region descriptor for each of the one or more regions, and provides the region descriptor for each of the one or more regions to a segmentation neural network that processes a region descriptor for a region to generate a predicted segmentation of the predicted type of object depicted in the region.
-
公开(公告)号:US20210081796A1
公开(公告)日:2021-03-18
申请号:US17107745
申请日:2020-11-30
Applicant: Google LLC
Inventor: Barret Zoph , Jonathon Shlens , Yukun Zhu , Maxwell Donald Collins , Liang-Chieh Chen , Gerhard Florian Schroff , Hartwig Adam , Georgios Papandreou
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes obtaining training data for a dense image prediction task; and determining an architecture for a neural network configured to perform the dense image prediction task, comprising: searching a space of candidate architectures to identify one or more best performing architectures using the training data, wherein each candidate architecture in the space of candidate architectures comprises (i) the same first neural network backbone that is configured to receive an input image and to process the input image to generate a plurality of feature maps and (ii) a different dense prediction cell configured to process the plurality of feature maps and to generate an output for the dense image prediction task; and determining the architecture for the neural network based on the best performing candidate architectures.
-
公开(公告)号:US10853726B2
公开(公告)日:2020-12-01
申请号:US16425900
申请日:2019-05-29
Applicant: Google LLC
Inventor: Barret Zoph , Jonathon Shlens , Yukun Zhu , Maxwell Donald Emmet Collins , Liang-Chieh Chen , Gerhard Florian Schroff , Hartwig Adam , Georgios Papandreou
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes obtaining training data for a dense image prediction task; and determining an architecture for a neural network configured to perform the dense image prediction task, comprising: searching a space of candidate architectures to identify one or more best performing architectures using the training data, wherein each candidate architecture in the space of candidate architectures comprises (i) the same first neural network backbone that is configured to receive an input image and to process the input image to generate a plurality of feature maps and (ii) a different dense prediction cell configured to process the plurality of feature maps and to generate an output for the dense image prediction task; and determining the architecture for the neural network based on the best performing candidate architectures.
-
公开(公告)号:US20200175375A1
公开(公告)日:2020-06-04
申请号:US16611604
申请日:2018-11-14
Applicant: Google LLC
Inventor: Liang-Chieh Chen , Alexander Hermans , Georgios Papandreou , Gerhard Florian Schroff , Peng Wang , Hartwig Adam
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for instance segmentation. In one aspect, a system generates: (i) data identifying one or more regions of the image, wherein an object is depicted in each region, (ii) for each region, a predicted type of object that is depicted in the region, and (iii) feature channels comprising a plurality of semantic channels and one or more direction channels. The system generates a region descriptor for each of the one or more regions, and provides the region descriptor for each of the one or more regions to a segmentation neural network that processes a region descriptor for a region to generate a predicted segmentation of the predicted type of object depicted in the region.
-
公开(公告)号:US20200151211A1
公开(公告)日:2020-05-14
申请号:US16744998
申请日:2020-01-16
Applicant: Google LLC
Inventor: David Petrou , Matthew J. Bridges , Shailesh Nalawadi , Hartwig Adam , Matthew R. Casey , Hartmut Neven , Andrew Harp
IPC: G06F16/583 , G06F3/048 , G06K9/62 , G06K9/46 , G06K9/32 , G06K9/22 , G06K9/00 , H04N5/225 , G06K9/78 , G06F16/9535 , G06F16/50
Abstract: A system and method of identifying objects is provided. In one aspect, the system and method includes a hand-held device with a display, camera and processor. As the camera captures images and displays them on the display, the processor compares the information retrieved in connection with one image with information retrieved in connection with subsequent images. The processor uses the result of such comparison to determine the object that is likely to be of greatest interest to the user. The display simultaneously displays the images the images as they are captured, the location of the object in an image, and information retrieved for the object.
-
公开(公告)号:US10515114B2
公开(公告)日:2019-12-24
申请号:US16030316
申请日:2018-07-09
Applicant: Google LLC
Inventor: David Petrou , Andrew Rabinovich , Hartwig Adam
IPC: G06K9/00 , G06F16/58 , G06F16/532 , G06F16/583 , G06F16/9535 , G06F16/2457 , A42B1/00
Abstract: A facial recognition search system identifies one or more likely names (or other personal identifiers) corresponding to the facial image(s) in a query as follows. After receiving the visual query with one or more facial images, the system identifies images that potentially match the respective facial image in accordance with visual similarity criteria. Then one or more persons associated with the potential images are identified. For each identified person, person-specific data comprising metrics of social connectivity to the requester are retrieved from a plurality of applications such as communications applications, social networking applications, calendar applications, and collaborative applications. An ordered list of persons is then generated by ranking the identified persons in accordance with at least metrics of visual similarity between the respective facial image and the potential image matches and with the social connection metrics. Finally, at least one person identifier from the list is sent to the requester.
-
公开(公告)号:US20190146993A1
公开(公告)日:2019-05-16
申请号:US16243660
申请日:2019-01-09
Applicant: Google LLC
Inventor: David Petrou , Matthew J. Bridges , Shailesh Nalawadi , Hartwig Adam , Matthew R. Casey , Hartmut Neven , Andrew Harp
IPC: G06F16/583 , G06K9/46 , G06F16/9535 , H04N5/225 , G06K9/78 , G06K9/62 , G06K9/32 , G06K9/22 , G06K9/00 , G06F3/048 , G06F16/50
CPC classification number: G06F16/5838 , G06F3/048 , G06F16/50 , G06F16/5846 , G06F16/9535 , G06K9/00671 , G06K9/00993 , G06K9/228 , G06K9/3258 , G06K9/4652 , G06K9/6271 , G06K9/78 , H04N5/225
Abstract: A system and method of identifying objects is provided. In one aspect, the system and method includes a hand-held device with a display, camera and processor. As the camera captures images and displays them on the display, the processor compares the information retrieved in connection with one image with information retrieved in connection with subsequent images. The processor uses the result of such comparison to determine the object that is likely to be of greatest interest to the user. The display simultaneously displays the images the images as they are captured, the location of the object in an image, and information retrieved for the object.
-
公开(公告)号:US20240371189A1
公开(公告)日:2024-11-07
申请号:US18775932
申请日:2024-07-17
Applicant: Google LLC
Inventor: Matthew J. Bridges , Alessandro Fin , Hartwig Adam , Jeffrey M. Gilbert
IPC: G06V30/413 , G06F3/04842 , G06F18/20 , G06F18/24 , G06V10/98 , G06V20/20 , G06V30/19 , H04N5/445 , H04N5/765 , H04N23/62 , H04N23/68
Abstract: A computing system includes one or more memory devices to store instructions; and one or more processors to execute the instructions to perform operations. The operations include: receiving a plurality of images captured by a camera; selecting a portion of the plurality of images having a quality rating above a threshold level; processing, by a coarse classifier, a first image among the portion of the plurality of images to determine whether the first image depicts at least one object from one or more particular classes of objects; in response to determining the first image depicts the at least one object from the one or more particular classes of objects, performing an object recognition process to recognize the at least one object; and presenting content related to the at least one object recognized via the object recognition process.
-
公开(公告)号:US20240202232A1
公开(公告)日:2024-06-20
申请号:US18592132
申请日:2024-02-29
Applicant: Google LLC
Inventor: David Karam , Li Zhang , Ariel Gilder , Yuzo Watanabe , Eric Penner , Farooq Ahmad , Hartwig Adam
IPC: G06F16/583 , G06F16/535 , G06F16/55 , G06N20/00
CPC classification number: G06F16/583 , G06F16/535 , G06F16/55 , G06N20/00
Abstract: The present disclosure is directed to processing imagery using one or more machine learning (ML) models. In particular, data describing imagery comprising a plurality of different and distinct frames can be received; and based at least in part on one or more ML models and the data describing the imagery, and for each frame of the plurality of different and distinct frames, one or more scores can be determined for the frame. Each score of the score(s) can indicate a determined measure of suitability of the frame with respect to one or more of various different and distinct uses for which the ML model(s) are configured to determine suitability of imagery.
-
-
-
-
-
-
-
-
-