专利检索 ap:("Roku, Inc.") AND inv:"Jose Pio Pereira" 第 2 页

11.

发明授权
Method and apparatus for multi-dimensional content search and video identification 有权

公开(公告)号：US11288313B2

公开(公告)日：2022-03-29

申请号：US16442398

申请日：2019-06-14

申请人： Roku, Inc.

发明人： Jose Pio Pereira , Mihailo M. Stojancic , Shashank Merchant

IPC分类号： G06F16/00 , G06F16/783 , G06F16/71 , G06F16/901 , G06K9/00 , G06T7/11 , G06T7/12 , G06T7/162 , G06T7/10

摘要： A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures.

12.

发明授权
Method and apparatus for multi-dimensional content search and video identification 有权

公开(公告)号：US11281718B2

公开(公告)日：2022-03-22

申请号：US16442404

申请日：2019-06-14

申请人： Roku, Inc.

发明人： Jose Pio Pereira , Mihailo M. Stojancic , Shashank Merchant

IPC分类号： G06F16/00 , G06F16/783 , G06F16/71 , G06F16/901 , G06K9/00 , G06T7/11 , G06T7/12 , G06T7/162 , G06T7/10

摘要： A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures.

13.

发明授权
Media fingerprinting and identification system 有权

公开(公告)号：US11036783B2

公开(公告)日：2021-06-15

申请号：US16365577

申请日：2019-03-26

申请人： Roku, Inc.

发明人： Prashant Ramanathan , Jose Pio Pereira , Shashank Merchant , Mihailo M. Stojancic

IPC分类号： G06F16/00 , G06F16/48 , G06F16/41 , G06F16/28 , G06F16/783 , G06F16/951 , G06F16/22 , G06F16/901 , G06K9/00 , G06K9/46

摘要： The overall architecture and details of a scalable video fingerprinting and identification system that is robust with respect to many classes of video distortions is described. In this system, a fingerprint for a piece of multimedia content is composed of a number of compact signatures, along with traversal hash signatures and associated metadata. Numerical descriptors are generated for features found in a multimedia clip, signatures are generated from these descriptors, and a reference signature database is constructed from these signatures. Query signatures are also generated for a query multimedia clip. These query signatures are searched against the reference database using a fast similarity search procedure, to produce a candidate list of matching signatures. This candidate list is further analyzed to find the most likely reference matches. Signature correlation is performed between the likely reference matches and the query clip to improve detection accuracy.

14.

发明授权
Media content identification on mobile devices 有权

公开(公告)号：US11736762B2

公开(公告)日：2023-08-22

申请号：US17722523

申请日：2022-04-18

申请人： Roku, Inc.

发明人： Mihailo M. Stojancic , Jose Pio Pereira , Peter Wendt , Shashank Merchant , Sunil Suresh Kulkarni

IPC分类号： H04N21/439 , H04N21/8352 , G10H1/00 , H04N21/414 , H04N21/422 , H04N21/4223 , H04N21/433 , H04N21/84 , H04N21/845 , H04N1/32 , H04N21/43

CPC分类号： H04N21/4394 , G10H1/0008 , H04N1/32 , H04N21/41407 , H04N21/4223 , H04N21/42203 , H04N21/439 , H04N21/4332 , H04N21/43072 , H04N21/8352 , H04N21/84 , H04N21/8456 , G10H2210/041 , G10H2210/051 , G10H2240/141 , G10H2250/235

摘要： A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.

15.

发明授权
Media content identification on mobile devices 有权

公开(公告)号：US11729458B2

公开(公告)日：2023-08-15

申请号：US16441949

申请日：2019-06-14

申请人： Roku, Inc.

发明人： Mihailo M. Stojancic , Sunil Suresh Kulkarni , Shashank Merchant , Jose Pio Pereira , Oleksiy Bolgarov

IPC分类号： H04N21/439 , H04N21/84 , H04N21/845 , H04N21/8352 , G10H1/00 , H04N21/414 , H04N21/422 , H04N21/4223 , H04N21/433 , H04N1/32 , H04N21/43

CPC分类号： H04N21/4394 , G10H1/0008 , H04N1/32 , H04N21/41407 , H04N21/4223 , H04N21/42203 , H04N21/439 , H04N21/4332 , H04N21/43072 , H04N21/8352 , H04N21/84 , H04N21/8456 , G10H2210/041 , G10H2210/051 , G10H2240/141 , G10H2250/235

摘要： A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.

16.

发明授权
Robust audio identification with interference cancellation 有权

公开(公告)号：US11631404B2

公开(公告)日：2023-04-18

申请号：US17444946

申请日：2021-08-12

申请人： Roku, Inc.

发明人： Jose Pio Pereira , Sunil Suresh Kulkarni , Mihailo M. Stojancic , Shashank Merchant , Peter Wendt

IPC分类号： G10L15/16 , G10L15/20 , G06T7/246 , G06T7/215 , G06K9/62 , G06F16/00 , G06T7/254 , G06F16/45 , G06F16/48 , G06V10/50 , G06V20/40 , G10L15/02 , G10L15/06 , G10L15/10 , G10L15/14 , G10L21/0232 , G10L25/81 , G06F16/906 , G06F16/44 , G10L21/0216

摘要： Audio distortion compensation methods to improve accuracy and efficiency of audio content identification are described. The method is also applicable to speech recognition. Methods to detect the interference from speakers and sources, and distortion to audio from environment and devices, are discussed. Additional methods to detect distortion to the content after performing search and correlation are illustrated. The causes of actual distortion at each client are measured and registered and learnt to generate rules for determining likely distortion and interference sources. The learnt rules are applied at the client, and likely distortions that are detected are compensated or heavily distorted sections are ignored at audio level or signature and feature level based on compute resources available. Further methods to subtract the likely distortions in the query at both audio level and after processing at signature and feature level are described.

17.

发明申请
Media Content Identification on Mobile Devices 有权

公开(公告)号：US20220239977A1

公开(公告)日：2022-07-28

申请号：US17722523

申请日：2022-04-18

申请人： Roku, Inc.

发明人： Mihailo M. Stojancic , Jose Pio Pereira , Peter Wendt , Shashank Merchant , Sunil Suresh Kulkarni

IPC分类号： H04N21/439 , H04N1/32 , H04N21/8352 , G10H1/00 , H04N21/414 , H04N21/422 , H04N21/4223 , H04N21/433 , H04N21/84 , H04N21/845

摘要： A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.

18.

发明授权
Method to differentiate and classify fingerprints using fingerprint neighborhood analysis 有权

公开(公告)号：US11361017B1

公开(公告)日：2022-06-14

申请号：US16929858

申请日：2020-07-15

申请人： Roku, Inc.

发明人： Sunil Suresh Kulkarni , Pradipkumar Dineshbhai Gajjar , Jose Pio Pereira , Preshant Ramanathan , Mihailo M. Stojancic , Shashank Merchant

IPC分类号： G06F16/45 , G06F16/48 , G06F16/906 , G06F16/44

摘要： Techniques are described that exclude use of “stop-fingerprints” from media database formation and search query to an automatic content recognition (ACR) systems based on media content fingerprints updated by stop-fingerprint analysis. A classification process is presented which takes in fingerprints from reference media files as an input and produces a modified set of fingerprints as an output by applying a novel stop-fingerprint classification algorithm. Architecture for the distributed stop-fingerprint generation is presented. Various cases, as stop-fingerprints generation for the entire reference database, stop-fingerprints generation for the individual reference fingerprint files, and temporal fingerprint classification obtained through intermediate steps of the temporal fingerprint classification algorithm are presented. A hash-based signature classification algorithm is also described.

19.

发明授权
Media fingerprinting and identification system 有权

公开(公告)号：US11188587B2

公开(公告)日：2021-11-30

申请号：US16387456

申请日：2019-04-17

申请人： Roku, Inc.

发明人： Jose Pio Pereira , Sunil Suresh Kulkarni , Shashank Merchant , Prashant Ramanathan , Pradipkumar Dineshbhai Gajjar

IPC分类号： G06F16/00 , G06F16/48 , G06F16/41 , G06F16/28 , G06F16/783 , G06F16/951 , G06F16/22 , G06F16/901 , G06K9/00 , G06K9/46

摘要： The overall architecture and details of a scalable video fingerprinting and identification system that is robust with respect to many classes of video distortions is described. In this system, a fingerprint for a piece of multimedia content is composed of a number of compact signatures, along with traversal hash signatures and associated metadata. Numerical descriptors are generated for features found in a multimedia clip, signatures are generated from these descriptors, and a reference signature database is constructed from these signatures. Query signatures are also generated for a query multimedia clip. These query signatures are searched against the reference database using a fast similarity search procedure, to produce a candidate list of matching signatures. This candidate list is further analyzed to find the most likely reference matches. Signature correlation is performed between the likely reference matches and the query clip to improve detection accuracy.

20.

发明授权
Robust audio identification with interference cancellation 有权

公开(公告)号：US11132997B1

公开(公告)日：2021-09-28

申请号：US16140538

申请日：2018-09-25

申请人： Roku, Inc.

发明人： Jose Pio Pereira , Sunil Suresh Kulkarni , Mihailo M. Stojancic , Shashank Merchant , Peter Wendt

IPC分类号： G10L15/20 , G10L25/81 , G10L21/0232 , G10L15/06 , G10L15/10 , G10L15/14 , G10L15/02 , G10L21/0216

摘要： Audio distortion compensation methods to improve accuracy and efficiency of audio content identification are described. The method is also applicable to speech recognition. Methods to detect the interference from speakers and sources, and distortion to audio from environment and devices are discussed. Additional methods to detect distortion to the content after performing search and correlation are illustrated. The causes of actual distortion at each client are measured and registered and learnt to generate rules for determining likely distortion and interference sources. The learnt rules are applied at the client, and likely distortions that are detected are compensated or heavily distorted sections are ignored at audio level or signature and feature level based on compute resources available. Further methods to subtract the likely distortions in the query at both audio level and after processing at signature and feature level are described.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类