-
公开(公告)号:US08811656B2
公开(公告)日:2014-08-19
申请号:US14019670
申请日:2013-09-06
Applicant: Google Inc.
Inventor: Shlomo Urbach , Tal Yadid , Yuval Netzer , Andrea Frome , Noam Ben-Haim
IPC: G06K9/00
CPC classification number: G06K9/723 , G06F17/241 , G06F17/2705 , G06F17/30241 , G06K9/00664 , G06K2209/01
Abstract: Establishments are identified in geo-tagged images. According to one aspect, text regions are located in a geo-tagged image and text strings in the text regions are recognized using Optical Character Recognition (OCR) techniques. Text phrases are extracted from information associated with establishments known to be near the geographic location specified in the geo-tag of the image. The text strings recognized in the image are compared with the phrases for the establishments for approximate matches, and an establishment is selected as the establishment in the image based on the approximate matches. According to another aspect, text strings recognized in a collection of geo-tagged images are compared with phrases for establishments in the geographic area identified by the geo-tags to generate scores for image-establishment pairs. Establishments in each of the large collection of images as well as representative images showing each establishment are identified using the scores.
Abstract translation: 在地理标签图像中标识企业。 根据一个方面,文本区域位于地理标记的图像中,并且使用光学字符识别(OCR)技术识别文本区域中的文本串。 从与已知在图像的地理标签中指定的地理位置附近的企业相关联的信息中提取文本短语。 将图像中识别的文本字符串与用于近似匹配的场所的短语进行比较,并且基于近似匹配来选择企业作为图像中的建立。 根据另一方面,将在地理标签图像的集合中识别的文本字符串与由地理标签识别的地理区域中的企业的短语进行比较,以生成图像建立对的分数。 使用分数来确定每个大图像集合中的各个场所以及显示每个机构的代表性图像。
-
2.
公开(公告)号:US09057618B1
公开(公告)日:2015-06-16
申请号:US14034746
申请日:2013-09-24
Applicant: Google Inc.
Inventor: Abhijit S. Ogale , Stephane Lafon , Andrea Frome
CPC classification number: G01C21/34 , G01C21/26 , G01C21/32 , G01C21/3476 , G01C21/3602 , G01S5/16 , G01S17/08 , G01S19/51
Abstract: Systems and methods provide approximations of latitude and longitude coordinates of objects, for example a business, in street level images. The images may be collected by a camera. An image of a business is collected along with GPS coordinates and direction of the camera. Depth maps of the images may be generated, for example, based on laser depth detection or displacement of the business between two images caused by a change in the position of the camera. After identifying a business in one or more images, the distance from the camera to a point or area relative to the business in the one or more images may be determined based on the depth maps. Using this distance and the direction of the camera which collected the one or more images and GPS coordinates of the camera, the approximate GPS coordinates of the business may be determined.
Abstract translation: 系统和方法提供了街道级图像中对象(例如商家)的纬度和经度坐标的近似。 图像可以由相机收集。 收集企业的图像以及相机的GPS坐标和方向。 图像的深度图可以例如基于激光深度检测或由相机的位置的改变引起的两个图像之间的业务的移位而产生。 在识别一个或多个图像中的业务之后,可以基于深度图确定从一个或多个图像中的摄像机到相对于业务的点或区域的距离。 使用该距离和收集摄像机的一个或多个图像和GPS坐标的摄像机的方向,可以确定业务的近似GPS坐标。
-
公开(公告)号:US20140286573A1
公开(公告)日:2014-09-25
申请号:US14296781
申请日:2014-06-05
Applicant: Google Inc.
Inventor: Bo Wu , Alessandro Bissacco , Raymond W. Smith , Kong Man Cheung , Andrea Frome , Shlomo Urbach
IPC: G06K9/18
CPC classification number: G06K9/3258 , G06K9/00 , G06K9/18 , G06K2209/01 , G06Q50/10
Abstract: A system and method is provided for automatically recognizing building numbers in street level images. In one aspect, a processor selects a street level image that is likely to be near an address of interest. The processor identifies those portions of the image that are visually similar to street numbers, and then extracts the numeric values of the characters displayed in such portions. If an extracted value corresponds with the building number of the address of interest such as being substantially equal to the address of interest, the extracted value and the image portion are displayed to a human operator. The human operator confirms, by looking at the image portion, whether the image portion appears to be a building number that matches the extracted value. If so, the processor stores a value that associates that building number with the street level image.
Abstract translation: 提供了一种用于自动识别街道图像中的建筑物编号的系统和方法。 在一个方面,处理器选择可能靠近感兴趣的地址的街道级图像。 处理器识别图像中与街道号码视觉相似的那些部分,然后提取在这些部分中显示的字符的数值。 如果提取的值对应于感兴趣的地址的建筑物号码,例如基本上等于感兴趣的地址,则提取的值和图像部分被显示给人类操作者。 人类操作者通过观察图像部分来确认图像部分是否看起来是与提取的值相匹配的建筑物号码。 如果是这样,处理器存储将建筑物号码与街道图像相关联的值。
-
公开(公告)号:US20140003650A1
公开(公告)日:2014-01-02
申请号:US14019670
申请日:2013-09-06
Applicant: Google Inc.
Inventor: Shlomo Urbach , Tal Yadid , Yuval Netzer , Andrea Frome , Noam Ben-Haim
CPC classification number: G06K9/723 , G06F17/241 , G06F17/2705 , G06F17/30241 , G06K9/00664 , G06K2209/01
Abstract: Establishments are identified in geo-tagged images. According to one aspect, text regions are located in a geo-tagged image and text strings in the text regions are recognized using Optical Character Recognition (OCR) techniques. Text phrases are extracted from information associated with establishments known to be near the geographic location specified in the geo-tag of the image. The text strings recognized in the image are compared with the phrases for the establishments for approximate matches, and an establishment is selected as the establishment in the image based on the approximate matches. According to another aspect, text strings recognized in a collection of geo-tagged images are compared with phrases for establishments in the geographic area identified by the geo-tags to generate scores for image-establishment pairs. Establishments in each of the large collection of images as well as representative images showing each establishment are identified using the scores.
Abstract translation: 在地理标签图像中标识企业。 根据一个方面,文本区域位于地理标记的图像中,并且使用光学字符识别(OCR)技术识别文本区域中的文本串。 从与已知在图像的地理标签中指定的地理位置附近的企业相关联的信息中提取文本短语。 将图像中识别的文本字符串与用于近似匹配的场所的短语进行比较,并且基于近似匹配来选择企业作为图像中的建立。 根据另一方面,将在地理标签图像的集合中识别的文本字符串与由地理标签识别的地理区域中的企业的短语进行比较,以生成图像建立对的分数。 使用分数来确定每个大图像集合中的各个场所以及显示每个机构的代表性图像。
-
公开(公告)号:US09436886B2
公开(公告)日:2016-09-06
申请号:US14660141
申请日:2015-03-17
Applicant: Google Inc.
Inventor: Bo Wu , Alessandro Bissacco , Raymond W. Smith , Kong Man Cheung , Andrea Frome , Shlomo Urbach
CPC classification number: G06K9/3258 , G06K9/00 , G06K9/18 , G06K2209/01 , G06Q50/10
Abstract: A system and method is provided for automatically recognizing building numbers in street level images. In one aspect, a processor selects a street level image that is likely to be near an address of interest. The processor identifies those portions of the image that are visually similar to street numbers, and then extracts the numeric values of the characters displayed in such portions. If an extracted value corresponds with the building number of the address of interest such as being substantially equal to the address of interest, the extracted value and the image portion are displayed to a human operator. The human operator confirms, by looking at the image portion, whether the image portion appears to be a building number that matches the extracted value. If so, the processor stores a value that associates that building number with the street level image.
-
公开(公告)号:US20160188991A1
公开(公告)日:2016-06-30
申请号:US14660141
申请日:2015-03-17
Applicant: Google Inc.
Inventor: Bo Wu , Alessandro Bissacco , Raymond W. Smith , Kong Man Cheung , Andrea Frome , Shlomo Urbach
CPC classification number: G06K9/3258 , G06K9/00 , G06K9/18 , G06K2209/01 , G06Q50/10
Abstract: A system and method is provided for automatically recognizing building numbers in street level images. In one aspect, a processor selects a street level image that is likely to be near an address of interest. The processor identifies those portions of the image that are visually similar to street numbers, and then extracts the numeric values of the characters displayed in such portions. If an extracted value corresponds with the building number of the address of interest such as being substantially equal to the address of interest, the extracted value and the image portion are displayed to a human operator. The human operator confirms, by looking at the image portion, whether the image portion appears to be a building number that matches the extracted value. If so, the processor stores a value that associates that building number with the street level image.
Abstract translation: 提供了一种用于自动识别街道图像中的建筑物编号的系统和方法。 在一个方面,处理器选择可能靠近感兴趣的地址的街道级图像。 处理器识别图像中与街道号码视觉相似的那些部分,然后提取在这些部分中显示的字符的数值。 如果提取的值对应于感兴趣的地址的建筑物号码,例如基本上等于感兴趣的地址,则提取的值和图像部分被显示给人类操作者。 人类操作者通过观察图像部分来确认图像部分是否看起来是与提取的值相匹配的建筑物号码。 如果是这样,处理器存储将建筑物号码与街道图像相关联的值。
-
7.
公开(公告)号:US20150153188A1
公开(公告)日:2015-06-04
申请号:US14034746
申请日:2013-09-24
Applicant: Google Inc.
Inventor: Abhijit S. Ogale , Stephane Lafon , Andrea Frome
IPC: G01C21/34
CPC classification number: G01C21/34 , G01C21/26 , G01C21/32 , G01C21/3476 , G01C21/3602 , G01S5/16 , G01S17/08 , G01S19/51
Abstract: Systems and methods provide approximations of latitude and longitude coordinates of objects, for example a business, in street level images. The images may be collected by a camera. An image of a business is collected along with GPS coordinates and direction of the camera. Depth maps of the images may be generated, for example, based on laser depth detection or displacement of the business between two images caused by a change in the position of the camera. After identifying a business in one or more images, the distance from the camera to a point or area relative to the business in the one or more images may be determined based on the depth maps. Using this distance and the direction of the camera which collected the one or more images and GPS coordinates of the camera, the approximate GPS coordinates of the business may be determined.
Abstract translation: 系统和方法提供了街道级图像中对象(例如商家)的纬度和经度坐标的近似。 图像可以由相机收集。 收集企业的图像以及相机的GPS坐标和方向。 图像的深度图可以例如基于激光深度检测或由相机的位置的改变引起的两个图像之间的业务的移位而产生。 在识别一个或多个图像中的业务之后,可以基于深度图确定从一个或多个图像中的摄像机到相对于业务的点或区域的距离。 使用该距离和收集摄像机的一个或多个图像和GPS坐标的摄像机的方向,可以确定业务的近似GPS坐标。
-
公开(公告)号:US09020265B2
公开(公告)日:2015-04-28
申请号:US14296781
申请日:2014-06-05
Applicant: Google Inc.
Inventor: Bo Wu , Alessandro Bissacco , Raymond W. Smith , Kong Man Cheung , Andrea Frome , Shlomo Urbach
CPC classification number: G06K9/3258 , G06K9/00 , G06K9/18 , G06K2209/01 , G06Q50/10
Abstract: A system and method is provided for automatically recognizing building numbers in street level images. In one aspect, a processor selects a street level image that is likely to be near an address of interest. The processor identifies those portions of the image that are visually similar to street numbers, and then extracts the numeric values of the characters displayed in such portions. If an extracted value corresponds with the building number of the address of interest such as being substantially equal to the address of interest, the extracted value and the image portion are displayed to a human operator. The human operator confirms, by looking at the image portion, whether the image portion appears to be a building number that matches the extracted value. If so, the processor stores a value that associates that building number with the street level image.
Abstract translation: 提供了一种用于自动识别街道图像中的建筑物编号的系统和方法。 在一个方面,处理器选择可能靠近感兴趣的地址的街道级图像。 处理器识别图像中与街道号码视觉相似的那些部分,然后提取在这些部分中显示的字符的数值。 如果提取的值对应于感兴趣的地址的建筑物号码,例如基本上等于感兴趣的地址,则提取的值和图像部分被显示给人类操作者。 人类操作者通过观察图像部分来确认图像部分是否看起来是与提取的值相匹配的建筑物号码。 如果是这样,处理器存储将建筑物号码与街道图像相关联的值。
-
公开(公告)号:US08942415B1
公开(公告)日:2015-01-27
申请号:US14062383
申请日:2013-10-24
Applicant: Google Inc.
Inventor: Andrea Frome
CPC classification number: G06K9/00268 , G06K9/00228 , G06K9/00288 , G06Q30/02 , G06T7/00 , G09F9/30 , H04N5/2723
Abstract: A system and method is provided wherein, in one aspect, a processor determines whether multiple street level images have captured a nearly-identical face. If so, the images are processed to determine whether the face appears to be part of an advertisement. Once it is determined that the face is displayed on an advertisement, the boundaries of the advertisement may be determined and the location of the advertisement is stored for future use, e.g., potentially replacing the advertisement in the image with a different advertisement.
Abstract translation: 提供了一种系统和方法,其中在一个方面,处理器确定多个街道级图像是否捕获了几乎相同的面部。 如果是这样,则处理图像以确定脸部是否看起来是广告的一部分。 一旦确定在广告上显示面部,则可以确定广告的边界,并且将广告的位置存储以备将来使用,例如潜在地用不同的广告替换图像中的广告。
-
-
-
-
-
-
-
-