-
公开(公告)号:US20250022273A1
公开(公告)日:2025-01-16
申请号:US18221766
申请日:2023-07-13
Applicant: HITACHI, Ltd.
Inventor: Rahul VISHWAKARMA , Ravigopal VENNELAKANTI , Nam HUYN , Masato TAMURA , Malarvizhi SANKARANARAYANASAMY
Abstract: Systems and methods for summarizing an infrastructure surface inspection video, which can involve processing frames of the infrastructure surface inspection video through a spatial in-frame process configured to conduct feature and text extraction on the frames on detected observations, the extracted features and text used to generate spatial metadata; processing the frames of the infrastructure surface inspection video through a temporal cross-frame process configured to detect changes across sequences of the frames including the spatial metadata, the temporal cross-frame process configured to generate temporal metadata encapsulating the detected changes; and processing the frames of the infrastructure surface inspection video through an activity decision process configured to intake the spatial metadata and temporal metadata to detect and track structural features or defects of interest as activity in the infrastructure surface inspection video and generate summary metadata from the spatial metadata, temporal metadata, and detected segments of interest.
-
公开(公告)号:US20240428510A1
公开(公告)日:2024-12-26
申请号:US18214157
申请日:2023-06-26
Applicant: HITACHI, Ltd.
Inventor: Masato TAMURA , Ravigopal VENNELAKANTI , Rahul VISHWAKARMA , Nam HUYN , Malarvizhi SANKARANARAYANASAMY
IPC: G06T17/00 , G06T7/00 , G06V10/764 , G06V10/774 , G06V20/70
Abstract: Generating a 3D attention model from use of a trained classifier configured to generate an attention map from 2D image frames and a 3D reconstruction process configured to generate a 3D reconstructed representation from the 2D image frames, which can involve, for an input of the 2D image frames creating, through a 3D reconstruction process, the 3D reconstructed representation using the 2D image frames after data collection of an inspection process, the 3D reconstructed representation associated with a mapping to the 2D image frames; executing the trained classifier on the 2D image frames of the video to generate attention maps of the 2D image frames; projecting the attention maps of the 2D image frames to the 3D reconstructed representation based on the mapping to the 2D image frames; and storing the 3D attention model involving the associated 3D attention maps and the 3D reconstructed representation.
-
公开(公告)号:US20240144794A1
公开(公告)日:2024-05-02
申请号:US17976687
申请日:2022-10-28
Applicant: Hitachi, Ltd.
Inventor: Masato TAMURA , Ravigopal VENNELAKANTI
CPC classification number: G08B13/19608 , G06T7/246 , G06V10/25 , G06V10/761 , G06V20/53 , G06V20/64 , G06V40/10 , G06T2207/30196 , G06T2207/30232
Abstract: A method for tracking and monitoring subjects and a plurality of objects. The method may include obtaining an image, wherein the image contains the subjects and the plurality of objects; extracting the subjects and the plurality of objects in the image through first feature extraction; detecting object interactions between the subjects and the plurality of objects; and tracking, through second feature extraction, subject-object pairs having detected object interactions.
-
公开(公告)号:US20230005268A1
公开(公告)日:2023-01-05
申请号:US17784472
申请日:2020-10-10
Applicant: Hitachi, Ltd.
Inventor: Masato TAMURA , Tomoaki YOSHINAGA , Atsushi HIROIKE , Hiromu NAKAMAE , Yuta YANASHIMA
Abstract: An object of the invention is to configure an object search device capable of expressing information on shapes and irregularities as features only by images, in a search for an object that is characteristic in shape or irregularity, and performing an accurate search.
The object search device includes: an image feature extraction unit that is configured with a first neural network, and is configured to input an image to extract an image feature; a three-dimensional data feature extraction unit that is configured with a second neural network, and is configured to input three-dimensional data to extract a three-dimensional data feature; a learning unit that is configured to extract an image feature and a three-dimensional data feature from an image and three-dimensional data of an object obtained from a same individual, respectively, and update an image feature extraction parameter so as to reduce a difference between the image feature and the three-dimensional data feature; and a search unit that is configured to extract image features of a query image and a gallery image of the object by the image feature extraction unit using the updated image feature extraction parameter, and calculate a similarity between the image features of both images to search for the object.-
公开(公告)号:US20210287503A1
公开(公告)日:2021-09-16
申请号:US17143638
申请日:2021-01-07
Applicant: HITACHI, LTD.
Inventor: Toshiaki TARUI , Tomokazu MURAKAMI , Shun FUKUDA , Masato TAMURA , Keiichi HIROKI
IPC: G08B13/196 , G06K9/00
Abstract: An object of the present invention is to provide a video analysis system and a video analysis method in which ownership between a person and baggage can be comprehensively determined. In a system that analyzes videos photographed by plural cameras, a detection/tracking process is performed for first and second objects using videos of plural cameras, and a relationship degree between the first and second objects is determined on the basis of the types of the first and second objects and a distance between the objects to be stored in a database.
-
-
-
-