-
公开(公告)号:US11100145B2
公开(公告)日:2021-08-24
申请号:US16656468
申请日:2019-10-17
Applicant: International Business Machines Corporation
Inventor: Xiaoxiao Guo , Hui Wu , Rogerio Feris
IPC: G06F16/00 , G06F16/332 , G06F16/383 , G06K9/62 , G06N3/04 , G06F16/535 , G06F16/53
Abstract: A method includes: receiving initial input from a client at least partially specifying one or more characteristics sought by the client; selecting a set of images from an image database for output to the client; and determining after each set of images whether an end condition has occurred. The method also includes, until the end condition has occurred: responsive to each set of images output to the client, receiving additional input from the client further specifying the one or more characteristics sought by the client; and responsive to each input received from the client, selecting another set of images for presentation to the client, said set of images being determined to at least partially satisfy the one or more characteristics specified by all input received from the client, said determination being based at least in part on side information for respective images for at least the set of images.
-
公开(公告)号:US11860928B2
公开(公告)日:2024-01-02
申请号:US17406893
申请日:2021-08-19
Applicant: International Business Machines Corporation
Inventor: Xiaoxiao Guo , Hui Wu , Rogerio Feris
IPC: G06F16/00 , G06F16/53 , G06F16/332 , G06F16/383 , G06N3/049 , G06F16/535 , G06F18/21 , G06V10/70 , G06V10/82 , G06V40/10
CPC classification number: G06F16/53 , G06F16/3328 , G06F16/3329 , G06F16/383 , G06F16/535 , G06F18/2185 , G06N3/049 , G06V10/768 , G06V10/82 , G06V40/103
Abstract: A method includes receiving input from a client at least partially specifying one or more characteristics, wherein the initial input includes a seed image and a natural language statement describing a desired change to the seed image; predicting one or more attributes of the seed image by operation of a neural network on the seed image; and parsing the natural language statement to identify desired changes to the one or more attributes of the seed image. The method also includes generating an interim target image by changing the one or more attributes of the seed image, according to the parsed natural language statement; selecting a set of images from an image database for output to the client, each of said set of images being determined to at least partially satisfy the one or more changed attributes of the seed image; and displaying the set of images to the client.
-
公开(公告)号:US12205306B2
公开(公告)日:2025-01-21
申请号:US16191759
申请日:2018-11-15
Applicant: International Business Machines Corporation
Inventor: Chung-Ching Lin , Rogerio Feris , Honghui Shi , Quanfu Fan , Lisa Brown , Mandis Beigi
Abstract: A system and a method for tracking a plurality of objects, including obtaining input data, estimating a number of skipping frames of the input data based on information from the input data, predicting results based on the estimating of the number of skipping frames, and correcting the predicted results.
-
公开(公告)号:US11521044B2
公开(公告)日:2022-12-06
申请号:US15982181
申请日:2018-05-17
Applicant: International Business Machines Corporation , The Board of Trustees of the University of Illinois
Inventor: Khoi-Nguyen C. Mac , Raymond Alexander Yeh , Dhiraj Joshi , Minh N. Do , Rogerio Feris , Jinjun Xiong
Abstract: Techniques regarding action detection based on motion in receptive fields of a neural network model are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a motion component that can extract a motion vector from a plurality of adaptive receptive fields in a deformable convolution layer of a neural network model. The computer executable components can also comprise an action detection component that can generate a spatio-temporal feature by concatenating the motion vector with a spatial feature extracted from the deformable convolution layer.
-
公开(公告)号:US20200162799A1
公开(公告)日:2020-05-21
申请号:US16752641
申请日:2020-01-25
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Michele Merler , Dhiraj Joshi , Quoc-Bao Nguyen , Stephen C. Hammer , John Joseph Kent , John R. Smith , Rogerio Feris
IPC: H04N21/8549 , G06N20/00 , H04N21/44 , G06K9/00 , H04N21/442 , H04N21/466 , H04N21/431 , H04N21/439 , G06N3/04 , G06N3/08
Abstract: A method and system for auto-curating a media are provided. Media content is received over the network interface. A set of markers is identified for the media content, each marker corresponding to one of a plurality of visible and audible cues in the media content. Segments in the media content are identified based on the identified set of markers. An excitement score is computed for each segment based on the identified markers that fall within the segment. A highlight clip is generated by identifying segments having excitement scores greater than a threshold.
-
公开(公告)号:US11830241B2
公开(公告)日:2023-11-28
申请号:US16752641
申请日:2020-01-25
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Michele Merler , Dhiraj Joshi , Quoc-Bao Nguyen , Stephen C. Hammer , John Joseph Kent , John R. Smith , Rogerio Feris
IPC: H04N21/233 , H04N21/234 , H04N21/25 , G06N20/10 , G06V10/82 , H04N21/8549 , G06N3/08 , G06N3/04 , H04N21/439 , H04N21/431 , H04N21/466 , H04N21/442 , H04N21/44 , G06N20/00 , G06V20/40 , G06V40/16 , G06V30/19 , G06V30/10
CPC classification number: G06V10/82 , G06N3/04 , G06N3/08 , G06N20/00 , G06V20/46 , G06V30/19173 , G06V40/16 , G06V40/172 , H04N21/439 , H04N21/4312 , H04N21/44 , H04N21/442 , H04N21/466 , H04N21/8549 , G06V30/10
Abstract: A method and system for auto-curating a media are provided. Media content is received over the network interface. A set of markers is identified for the media content, each marker corresponding to one of a plurality of visible and audible cues in the media content. Segments in the media content are identified based on the identified set of markers. An excitement score is computed for each segment based on the identified markers that fall within the segment. A highlight clip is generated by identifying segments having excitement scores greater than a threshold.
-
公开(公告)号:US20190354835A1
公开(公告)日:2019-11-21
申请号:US15982181
申请日:2018-05-17
Applicant: International Business Machines Corporation , The Board of Trustees of the University of Illinois
Inventor: Khoi-Nguyen C. Mac , Raymond Alexander Yeh , Dhiraj Joshi , Minh N. Do , Rogerio Feris , Jinjun Xiong
Abstract: Techniques regarding action detection based on motion in receptive fields of a neural network model are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a motion component that can extract a motion vector from a plurality of adaptive receptive fields in a deformable convolution layer of a neural network model. The computer executable components can also comprise an action detection component that can generate a spatio-temporal feature by concatenating the motion vector with a spatial feature extracted from the deformable convolution layer.
-
公开(公告)号:US20210382922A1
公开(公告)日:2021-12-09
申请号:US17406893
申请日:2021-08-19
Applicant: International Business Machines Corporation
Inventor: Xiaoxiao Guo , Hui Wu , Rogerio Feris
IPC: G06F16/332 , G06F16/383 , G06K9/62 , G06N3/04 , G06F16/535 , G06F16/53
Abstract: A method includes receiving input from a client at least partially specifying one or more characteristics, wherein the initial input includes a seed image and a natural language statement describing a desired change to the seed image; predicting one or more attributes of the seed image by operation of a neural network on the seed image; and parsing the natural language statement to identify desired changes to the one or more attributes of the seed image. The method also includes generating an interim target image by changing the one or more attributes of the seed image, according to the parsed natural language statement; selecting a set of images from an image database for output to the client, each of said set of images being determined to at least partially satisfy the one or more changed attributes of the seed image; and displaying the set of images to the client.
-
公开(公告)号:US20210073252A1
公开(公告)日:2021-03-11
申请号:US16656468
申请日:2019-10-17
Applicant: International Business Machines Corporation
Inventor: Xiaoxiao Guo , Hui Wu , Rogerio Feris
IPC: G06F16/332 , G06F16/383 , G06N3/04 , G06K9/62
Abstract: A method includes: receiving initial input from a client at least partially specifying one or more characteristics sought by the client; selecting a set of images from an image database for output to the client; and determining after each set of images whether an end condition has occurred. The method also includes, until the end condition has occurred: responsive to each set of images output to the client, receiving additional input from the client further specifying the one or more characteristics sought by the client; and responsive to each input received from the client, selecting another set of images for presentation to the client, said set of images being determined to at least partially satisfy the one or more characteristics specified by all input received from the client, said determination being based at least in part on side information for respective images for at least the set of images.
-
公开(公告)号:US20200160060A1
公开(公告)日:2020-05-21
申请号:US16191759
申请日:2018-11-15
Applicant: International Business Machines Corporation
Inventor: Chung-Ching Lin , Rogerio Feris , Honghui Shi , Quanfu Fan , Lisa Brown , Mandis Beigi
Abstract: A system and a method for tracking a plurality of objects, including obtaining input data, estimating a number of skipping frames of the input data based on information from the input data, predicting results based on the estimating of the number of skipping frames, and correcting the predicted results.
-
-
-
-
-
-
-
-
-