-
公开(公告)号:US20240135486A1
公开(公告)日:2024-04-25
申请号:US18048975
申请日:2022-10-23
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Sebastien Gilbert , Michele Merler , Dhiraj Joshi , Apurv Gupta , Shyama Prosad Chowdhury , CHIDANSH AMITKUMAR BHATT , Nirmit V. Desai
CPC classification number: G06T3/0006 , G06T3/4046
Abstract: Described are techniques for oblique image rectification. The techniques include receiving an original image depicting an oblique view of a circular object and pre-processing the original image into an edge image. The techniques further include generating, by a machine learning model based on the edge image, a heatmap including an ellipse formed by the oblique view of the circular object. The techniques further include computing ellipse parameters describing the ellipse of the heatmap. The techniques further include performing, using the ellipse parameters, an affine transformation on the original image to generate a rectified image, where the rectified image converts the ellipse to a circle.
-
公开(公告)号:US20230186121A1
公开(公告)日:2023-06-15
申请号:US17548182
申请日:2021-12-10
Applicant: International Business Machines Corporation
Inventor: Jenny S. Li , Nirmit V. Desai , Dhiraj Joshi , Raghu Ramaswamy , Satish Rajani
CPC classification number: G06N5/04 , G06K9/6259 , G06N20/00
Abstract: A method, computer program product, and system include a processor(s) that engages, based on a request for an inference, from a group of sensors of multiple modalities at a physical location, sensor(s) of a main modality to provide data to a pipeline to generate the inference. The pipeline includes one or more machine learning models which generate the inference for a downstream task. The processor(s) obtains raw data from the sensor(s) of the main modality and applies an outlier detector to the raw data. Based on determining that there is an outlier the processor(s) automatically engages sensor(s) of at least one different modality than the main modality from the group of sensors of multiple modalities and obtains new raw data from the sensor(s) of the at least one different modality. The processor(s) applies the one or more machine learning models to the new raw data to derive the inference.
-
公开(公告)号:US20230124038A1
公开(公告)日:2023-04-20
申请号:US17451169
申请日:2021-10-18
Applicant: International Business Machines Corporation
Inventor: Jenny S. Li , Nirmit V. Desai , Dhiraj Joshi , Raghu Ramaswamy , Satish Rajani
Abstract: Optimizing sensing capabilities of a roaming robotic device using Artificial Intelligence (AI) includes receiving data at a control system having a computer from a robotic device. The control system communicating a policy to the robotic device for choosing navigation actions for the robotic device. The received data is analyzed using the control system for determining when the received data meets a threshold for determining quality of the data. The analysis can include generating a model based on the received data where the model includes vector representation of inputs detected by a sensor array at the location. In response to the received data at the control system not meeting the threshold for determining quality, the robotic device communicating with the control system to collaborate in updating the policy to choose a next action.
-
公开(公告)号:US11521044B2
公开(公告)日:2022-12-06
申请号:US15982181
申请日:2018-05-17
Applicant: International Business Machines Corporation , The Board of Trustees of the University of Illinois
Inventor: Khoi-Nguyen C. Mac , Raymond Alexander Yeh , Dhiraj Joshi , Minh N. Do , Rogerio Feris , Jinjun Xiong
Abstract: Techniques regarding action detection based on motion in receptive fields of a neural network model are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a motion component that can extract a motion vector from a plurality of adaptive receptive fields in a deformable convolution layer of a neural network model. The computer executable components can also comprise an action detection component that can generate a spatio-temporal feature by concatenating the motion vector with a spatial feature extracted from the deformable convolution layer.
-
公开(公告)号:US20200162799A1
公开(公告)日:2020-05-21
申请号:US16752641
申请日:2020-01-25
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Michele Merler , Dhiraj Joshi , Quoc-Bao Nguyen , Stephen C. Hammer , John Joseph Kent , John R. Smith , Rogerio Feris
IPC: H04N21/8549 , G06N20/00 , H04N21/44 , G06K9/00 , H04N21/442 , H04N21/466 , H04N21/431 , H04N21/439 , G06N3/04 , G06N3/08
Abstract: A method and system for auto-curating a media are provided. Media content is received over the network interface. A set of markers is identified for the media content, each marker corresponding to one of a plurality of visible and audible cues in the media content. Segments in the media content are identified based on the identified set of markers. An excitement score is computed for each segment based on the identified markers that fall within the segment. A highlight clip is generated by identifying segments having excitement scores greater than a threshold.
-
6.
公开(公告)号:US12186907B2
公开(公告)日:2025-01-07
申请号:US17452247
申请日:2021-10-26
Applicant: International Business Machines Corporation
Inventor: Jenny S. Li , Raghu Ramaswamy , Nirmit V Desai , Dhiraj Joshi , Satish Rajani , Nancy Anne Greco , Shiva G , Aakash Praliya , Wei-Han Lee , Luis Angel Bathen , Tova Roth , Sujoy Kumar Roy Chowdhury , Prakriti Pritmani , Kay Murphy , Shilpa Shenai , Arun Yashwant Ingale , Ajjay Ratnakar , Gwilym Benjamin Lee Newton
Abstract: Dynamically adjusting, using artificial intelligence (AI), sensors and models of an autonomous roaming robotic device, which includes receiving data regarding an asset at a computer of a roaming robotic device from sensors on the robotic device. The robotic device identifies an asset at a location using the sensors, and the robotic device has instructions, received from a control system, to inspect the location or items at the location. The data is analyzed using the computer of the robotic device, and the analysis includes using historical data for the asset. An AI model is loaded using the computer of the robotic device, based on the identification of the asset. A sensor is selected using the computer of the robotic device, for conducting an inspection of the asset based on the analysis of the data and the AI model.
-
公开(公告)号:US20240112444A1
公开(公告)日:2024-04-04
申请号:US17936519
申请日:2022-09-29
Applicant: International Business Machines Corporation
Inventor: Michele Merler , Dhiraj Joshi , Apurv Gupta , Sebastien Gilbert , Shyama Prosad Chowdhury , Chidansh Amitkumar Bhatt , Nirmit V. Desai
IPC: G06V10/764 , G06V10/22 , G06V10/74 , G06V10/84 , G06V30/19
CPC classification number: G06V10/764 , G06V10/23 , G06V10/761 , G06V10/85 , G06V30/19173 , G06V2201/07
Abstract: Automated analog gauge reading is provided. The method comprises a computer system receiving input of an image and detecting at least one analog gauge in the image. The computer system corrects the orientation of the analog gauge in the image and detects scene text and tick labels on the analog gauge. The computer system determines a position of a pointer on the analog gauge relative to the scene text and outputs a gauge reading value based on an arithmetic progression of tick labels and angle of the pointer with respect to minimum and maximum values on the analog gauge.
-
8.
公开(公告)号:US20230126457A1
公开(公告)日:2023-04-27
申请号:US17452247
申请日:2021-10-26
Applicant: International Business Machines Corporation
Inventor: Jenny S. Li , Raghu Ramaswamy , Nirmit V Desai , Dhiraj Joshi , Satish Rajani , Nancy Anne Greco , Shiva G , Aakash Praliya , Wei-Han Lee , Luis Angel Bathen , Tova Roth , Sujoy Kumar Roy Chowdhury , Prakriti Pritmani , Kay Murphy , Shilpa Shenai , Arun Yashwant Ingale , Ajjay Ratnakar , Gwilym Benjamin Lee Newton
Abstract: Dynamically adjusting, using artificial intelligence (AI), sensors and models of an autonomous roaming robotic device, which includes receiving data regarding an asset at a computer of a roaming robotic device from sensors on the robotic device. The robotic device identifies an asset at a location using the sensors, and the robotic device has instructions, received from a control system, to inspect the location or items at the location. The data is analyzed using the computer of the robotic device, and the analysis includes using historical data for the asset. An AI model is loaded using the computer of the robotic device, based on the identification of the asset. A sensor is selected using the computer of the robotic device, for conducting an inspection of the asset based on the analysis of the data and the AI model.
-
公开(公告)号:US20240104369A1
公开(公告)日:2024-03-28
申请号:US17935198
申请日:2022-09-26
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Dinesh C. Verma , Franck Vinh Le , Michele Merler , Dhiraj Joshi , SUPRIYO CHAKRABORTY , Seraphin Bernard Calo
CPC classification number: G06N3/08 , G06K9/6201 , G06K9/6232 , G06K9/6262
Abstract: A system may receive an existing base set of knowledge, train a neural network on the base set of knowledge, deploy the neural network on a new data set, generate, using the deployment, instances of new knowledge, and validate the instances of new knowledge.
-
公开(公告)号:US11830241B2
公开(公告)日:2023-11-28
申请号:US16752641
申请日:2020-01-25
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Michele Merler , Dhiraj Joshi , Quoc-Bao Nguyen , Stephen C. Hammer , John Joseph Kent , John R. Smith , Rogerio Feris
IPC: H04N21/233 , H04N21/234 , H04N21/25 , G06N20/10 , G06V10/82 , H04N21/8549 , G06N3/08 , G06N3/04 , H04N21/439 , H04N21/431 , H04N21/466 , H04N21/442 , H04N21/44 , G06N20/00 , G06V20/40 , G06V40/16 , G06V30/19 , G06V30/10
CPC classification number: G06V10/82 , G06N3/04 , G06N3/08 , G06N20/00 , G06V20/46 , G06V30/19173 , G06V40/16 , G06V40/172 , H04N21/439 , H04N21/4312 , H04N21/44 , H04N21/442 , H04N21/466 , H04N21/8549 , G06V30/10
Abstract: A method and system for auto-curating a media are provided. Media content is received over the network interface. A set of markers is identified for the media content, each marker corresponding to one of a plurality of visible and audible cues in the media content. Segments in the media content are identified based on the identified set of markers. An excitement score is computed for each segment based on the identified markers that fall within the segment. A highlight clip is generated by identifying segments having excitement scores greater than a threshold.
-
-
-
-
-
-
-
-
-