CONTENT-AWARE AI-ASSISTED VIDEO SUMMARIZATION

    公开(公告)号:US20250022273A1

    公开(公告)日:2025-01-16

    申请号:US18221766

    申请日:2023-07-13

    Applicant: HITACHI, Ltd.

    Abstract: Systems and methods for summarizing an infrastructure surface inspection video, which can involve processing frames of the infrastructure surface inspection video through a spatial in-frame process configured to conduct feature and text extraction on the frames on detected observations, the extracted features and text used to generate spatial metadata; processing the frames of the infrastructure surface inspection video through a temporal cross-frame process configured to detect changes across sequences of the frames including the spatial metadata, the temporal cross-frame process configured to generate temporal metadata encapsulating the detected changes; and processing the frames of the infrastructure surface inspection video through an activity decision process configured to intake the spatial metadata and temporal metadata to detect and track structural features or defects of interest as activity in the infrastructure surface inspection video and generate summary metadata from the spatial metadata, temporal metadata, and detected segments of interest.

    VISUAL INSPECTION METHOD
    2.
    发明申请

    公开(公告)号:US20240428510A1

    公开(公告)日:2024-12-26

    申请号:US18214157

    申请日:2023-06-26

    Applicant: HITACHI, Ltd.

    Abstract: Generating a 3D attention model from use of a trained classifier configured to generate an attention map from 2D image frames and a 3D reconstruction process configured to generate a 3D reconstructed representation from the 2D image frames, which can involve, for an input of the 2D image frames creating, through a 3D reconstruction process, the 3D reconstructed representation using the 2D image frames after data collection of an inspection process, the 3D reconstructed representation associated with a mapping to the 2D image frames; executing the trained classifier on the 2D image frames of the video to generate attention maps of the 2D image frames; projecting the attention maps of the 2D image frames to the 3D reconstructed representation based on the mapping to the 2D image frames; and storing the 3D attention model involving the associated 3D attention maps and the 3D reconstructed representation.

    OBJECT SEARCH DEVICE AND OBJECT SEARCH METHOD

    公开(公告)号:US20230005268A1

    公开(公告)日:2023-01-05

    申请号:US17784472

    申请日:2020-10-10

    Applicant: Hitachi, Ltd.

    Abstract: An object of the invention is to configure an object search device capable of expressing information on shapes and irregularities as features only by images, in a search for an object that is characteristic in shape or irregularity, and performing an accurate search.
    The object search device includes: an image feature extraction unit that is configured with a first neural network, and is configured to input an image to extract an image feature; a three-dimensional data feature extraction unit that is configured with a second neural network, and is configured to input three-dimensional data to extract a three-dimensional data feature; a learning unit that is configured to extract an image feature and a three-dimensional data feature from an image and three-dimensional data of an object obtained from a same individual, respectively, and update an image feature extraction parameter so as to reduce a difference between the image feature and the three-dimensional data feature; and a search unit that is configured to extract image features of a query image and a gallery image of the object by the image feature extraction unit using the updated image feature extraction parameter, and calculate a similarity between the image features of both images to search for the object.

    VIDEO ANALYSIS SYSTEM AND VIDEO ANALYSIS METHOD

    公开(公告)号:US20210287503A1

    公开(公告)日:2021-09-16

    申请号:US17143638

    申请日:2021-01-07

    Applicant: HITACHI, LTD.

    Abstract: An object of the present invention is to provide a video analysis system and a video analysis method in which ownership between a person and baggage can be comprehensively determined. In a system that analyzes videos photographed by plural cameras, a detection/tracking process is performed for first and second objects using videos of plural cameras, and a relationship degree between the first and second objects is determined on the basis of the types of the first and second objects and a distance between the objects to be stored in a database.

Patent Agency Ranking