-
公开(公告)号:US11893751B2
公开(公告)日:2024-02-06
申请号:US17405060
申请日:2021-08-18
Applicant: Tata Consultancy Services Limited
Inventor: Junaid Ahmed Ansari , Brojeshwar Bhowmick
IPC: G06K9/00 , G06T7/215 , G06T7/246 , G06N3/0442
CPC classification number: G06T7/215 , G06N3/0442 , G06T7/246 , G06T2207/10016 , G06T2207/20084
Abstract: This disclosure relates generally to system and method for forecasting location of target in monocular first person view. Conventional systems for location forecasting utilizes complex neural networks and hence are computationally intensive and requires high compute power. The disclosed system includes an efficient and light-weight RNN based network model for predicting motion of targets in first person monocular videos. The network model includes an auto-encoder in the encoding phase and a regularizing layer in the end helps us get better accuracy. The disclosed method relies entirely just on detection bounding boxes for prediction as well as training of the network model and is still capable of transferring zero-shot on a different dataset.
-
公开(公告)号:US11597080B2
公开(公告)日:2023-03-07
申请号:US17015238
申请日:2020-09-09
Applicant: Tata Consultancy Services Limited
Inventor: Chayan Sarkar , Snehasis Banerjee , Pradip Pramanick , Hrishav Bakul Barua , Soumyadip Maity , Dipanjan Das , Brojeshwar Bhowmick , Ashis Sau , Abhijan Bhattacharyya , Arpan Pal , Balamuralidhar Purushothaman , Ruddra Roy Chowdhury
Abstract: Conventional tele-presence robots have their own limitations with respect to task execution, information processing and management. Embodiments of the present disclosure provide a tele-presence robot (TPR) that communicates with a master device associated with a user via an edge device for task execution wherein control command from the master device is parsed for determining instructions set and task type for execution. Based on this determination, the TPR queries for information across storage devices until a response is obtained enough to execute task. The task upon execution is validated with the master device and user. Knowledge acquired, during querying, task execution and validation of the executed task, is dynamically partitioned by the TPR across storage devices namely, on-board memory of the tele-present robot, an edge device, a cloud and a web interface respectively depending upon the task type, operating environment of the tele-presence robot, and other performance affecting parameters.
-
公开(公告)号:US11256962B2
公开(公告)日:2022-02-22
申请号:US16815206
申请日:2020-03-11
Applicant: Tata Consultancy Services Limited
Inventor: Sandika Biswas , Sanjana Sinha , Kavya Gupta , Brojeshwar Bhowmick
Abstract: Estimating 3D human pose from monocular images is a challenging problem due to the variety and complexity of human poses and the inherent ambiguity in recovering depth from single view. Recent deep learning based methods show promising results by using supervised learning on 3D pose annotated datasets. However, the lack of large-scale 3D annotated training data makes the 3D pose estimation difficult in-the-wild. Embodiments of the present disclosure provide a method which can effectively predict 3D human poses from only 2D pose in a weakly-supervised manner by using both ground-truth 3D pose and ground-truth 2D pose based on re-projection error minimization as a constraint to predict the 3D joint locations. The method may further utilize additional geometric constraints on reconstructed body parts to regularize the pose in 3D along with minimizing re-projection error to improvise on estimating an accurate 3D pose.
-
14.
公开(公告)号:US11216692B2
公开(公告)日:2022-01-04
申请号:US16502760
申请日:2019-07-03
Applicant: Tata Consultancy Services Limited
Inventor: Kavya Gupta , Brojeshwar Bhowmick , Angshul Majumdar
IPC: G06K9/62
Abstract: This disclosure relates to systems and methods for solving generic inverse problems by providing a coupled representation architecture using transform learning. Convention solutions are complex, require long training and testing times, reconstruction quality also may not be suitable for all applications. Furthermore, they preclude application to real-time scenarios due to the mentioned inherent lacunae. The methods provided herein require involve very low computational complexity with a need for only three matrix-vector products, and requires very short training and testing times, which makes it applicable for real-time applications. Unlike the conventional learning architectures using inductive approaches, the CASC of the present disclosure can learn directly from the source domain and the number of features in a source domain may not be necessarily equal to the number of features in a target domain.
-
公开(公告)号:US20210291363A1
公开(公告)日:2021-09-23
申请号:US17015238
申请日:2020-09-09
Applicant: Tata Consultancy Services Limited
Inventor: Chayan Sarkar , Snehasis Banerjee , Pradip Pramanick , Hrishav Bakul Barua , Soumyadip Maity , Dipanjan Das , Brojeshwar Bhowmick , Ashis Sau , Abhijan Bhattacharyya , Arpan Pal , Balamuralidhar PURUSHOTHAMAN , Ruddra Roy Chowdhury
Abstract: Conventional tele-presence robots have their own limitations with respect to task execution, information processing and management. Embodiments of the present disclosure provide a tele-presence robot (TPR) that communicates with a master device associated with a user via an edge device for task execution wherein control command from the master device is parsed for determining instructions set and task type for execution. Based on this determination, the TPR queries for information across storage devices until a response is obtained enough to execute task. The task upon execution is validated with the master device and user. Knowledge acquired, during querying, task execution and validation of the executed task, is dynamically partitioned by the TPR across storage devices namely, on-board memory of the tele-present robot, an edge device, a cloud and a web interface respectively depending upon the task type, operating environment of the tele-presence robot, and other performance affecting parameters.
-
公开(公告)号:US20190008421A1
公开(公告)日:2019-01-10
申请号:US15908731
申请日:2018-02-28
Applicant: Tata Consultancy Services Limited
Inventor: Sanjana Sinha , Brojeshwar Bhowmick , Aniruddha Sinha , Abhijit Das
Abstract: Body joint tracking is applied in various industries and medical field. In body joint tracking, marker less devices plays an important role. However, the marker less devices are facing some challenges in providing optimal tracking due to occlusion, ambiguity, lighting conditions, dynamic objects etc. System and method of the present disclosure provides an optimized body joint tracking. Here, motion data pertaining to a first set of motion frames from a motion sensor are received. Further, the motion data are processed to obtain a plurality of 3 dimensional cylindrical models. Here, every cylindrical model among the plurality of 3 dimensional cylindrical model represents a body segment. The coefficients associated with the plurality of 3 dimensional cylindrical models are initialized to obtain a set of initialized cylindrical models. A set of dynamic coefficients associated with the initialized cylindrical models are utilized to track joint motion trajectories of a set of subsequent frames.
-
公开(公告)号:US09865061B2
公开(公告)日:2018-01-09
申请号:US14493959
申请日:2014-09-23
Applicant: Tata Consultancy Services Limited
Inventor: Brojeshwar Bhowmick , Apurbaa Mallik , Arindam Saha
CPC classification number: G06T7/593 , G06K9/209 , G06K2209/40 , G06T7/579 , G06T7/80 , G06T7/97 , G06T2200/08 , G06T2207/10021
Abstract: Disclosed is a method and system for constructing a 3D structure. The system of the present disclosure comprises an image capturing unit for capturing images of an object. The system comprises of a gyroscope, a magnetometer, and an accelerometer for determining extrinsic camera parameters, wherein the extrinsic camera parameters comprise a rotation and a translation of the images. Further the system determines an internal calibration matrix once. The system uses the extrinsic camera parameters and the internal calibration matrix for determining a fundamental matrix. The system extracts features of the images for establishing point correspondences between the images. Further, the point correspondences are filtered using the fundamental matrix for generating filtered point correspondences. The filtered point correspondences are triangulated for determining 3D points representing the 3D structure. Further, the 3D structure may be optimized for eliminating reprojection errors associated with the 3D structure.
-
公开(公告)号:US11941760B2
公开(公告)日:2024-03-26
申请号:US17807339
申请日:2022-06-16
Applicant: Tata Consultancy Services Limited
Inventor: Swapna Agarwal , Soumyadip Maity , Hrishav Bakul Barua , Brojeshwar Bhowmick
CPC classification number: G06T17/20 , G06T7/70 , G06T2207/10024
Abstract: Traditional machine learning (ML) based systems used for scene recognition and object recognition have the disadvantage that they require huge quantity of labeled data to generate data models for the purpose of aiding the scene and object recognition. The disclosure herein generally relates to image processing, and, more particularly, to method and system for generating 3D mesh generation using planar and non-planar data. The system extracts planar point cloud and non-planar point cloud from each RGBD image in a sequence of RGBD images fetched as input, and then generates a planar mesh and a non-planar mesh for planar and non-planar objects in the image. A mesh representation is generated by merging the planar mesh and the non-planar mesh. Further, an incremental merging of the mesh representation is performed on the sequence of RGBD images, based on an estimated camera pose information, to generate representation of the scene.
-
公开(公告)号:US11794347B2
公开(公告)日:2023-10-24
申请号:US17199182
申请日:2021-03-11
Applicant: Tata Consultancy Services Limited
Inventor: Abhijan Bhattacharyya , Ruddra dev Roychoudhury , Sanjana Sinha , Sandika Biswas , Ashis Sau , Madhurima Ganguly , Sayan Paul , Brojeshwar Bhowmick
CPC classification number: B25J9/1689 , B25J3/00 , B25J9/161 , B25J9/163 , B25J9/1666 , B25J9/1697 , B25J13/089
Abstract: This disclosure relates generally to navigation of a tele-robot in dynamic environment using in-situ intelligence. Tele-robotics is the area of robotics concerned with the control of robots (tele-robots) in a remote environment from a distance. In reality the remote environment where the tele robot navigates may be dynamic in nature with unpredictable movements, making the navigation extremely challenging. The disclosure proposes an in-situ intelligent navigation of a tele-robot in a dynamic environment. The disclosed in-situ intelligence enables the tele-robot to understand the dynamic environment by identification and estimation of future location of objects based on a generating/training a motion model. Further the disclosed techniques also enable communication between a master and the tele-robot (whenever necessary) based on an application layer communication semantic.
-
公开(公告)号:US11670047B2
公开(公告)日:2023-06-06
申请号:US16918743
申请日:2020-07-01
Applicant: Tata Consultancy Services Limited
Inventor: Dipanjan Das , Brojeshwar Bhowmick , Aniket Pokale , Krishnan Madhava Krishna , Aditya Aggarwal
CPC classification number: G06T17/05 , G06T7/174 , G06T7/55 , G06T7/75 , G06T7/77 , G06T15/04 , G06T2207/10021 , G06T2207/30241 , G06T2207/30244
Abstract: The embodiments herein provide a system and method for integrating objects in monocular simultaneous localization and mapping (SLAM). State of art object SLAM approach use two popular threads. In first, instance specific models are assumed to be known a priori. In second, a general model for an object such as ellipsoids and cuboids is used. However, these generic models just give the label of the object category and do not give much information about the object pose in the map. The method and system disclosed provide a SLAM framework on a real monocular sequence wherein joint optimization is performed on object localization and edges using category level shape priors and bundle adjustment. The method provides a better visualization incorporating object representations in the scene along with the 3D structure of the base SLAM system, which makes it useful for augmented reality (AR) applications.
-
-
-
-
-
-
-
-
-