-
公开(公告)号:US11589120B2
公开(公告)日:2023-02-21
申请号:US16799232
申请日:2020-02-24
Applicant: Synaptics Incorporated
Inventor: Utkarsh Gaur , Adil Ilyas Jagmag , Gaurav Arora
IPC: G06F3/00 , G06F13/00 , H04N5/445 , H04N21/466 , G06F3/16 , H04N21/475
Abstract: A method and apparatus for deep content tagging. A media device receives one or more first frames of a content item, where the one or more first frames spans a duration of a scene in the content item. The media device detects one or more objects or features in each of the first frames using a neural network model and identifies one or more first genres associated with the first frames based at least in part on the detected objects or features in each of the first frames. The media device further controls playback of the content item based at least in part on the identified first genres.
-
公开(公告)号:US11785068B2
公开(公告)日:2023-10-10
申请号:US17139905
申请日:2020-12-31
Applicant: SYNAPTICS INCORPORATED
Inventor: Vladan Petrovic , Utkarsh Gaur , Pontus Lidman
IPC: H04L65/70 , H04N21/2662 , G06N3/08 , H04N21/258 , H04L65/403 , H04N21/24 , H04L65/75
CPC classification number: H04L65/70 , G06N3/08 , H04L65/403 , H04L65/75 , H04N21/2402 , H04N21/25825 , H04N21/2662
Abstract: Systems and method for streaming video content include downscaling video content using a downscaling model to generate downscaled video content and downloading the downscaled video content as a video stream and corresponding upscaling model to a client device. The system converts received video frames to a video memory format comprising channels having the same memory allocation size, each subsequent channel arranged in an adjacent memory location, for input to the downscaling model. The client device upscales the video stream using the received upscaling model for display by the client device in real-time. A training system trains the downscaling model to generate the downscaled video content, based on associated metadata identifying a type of video content. The downscaled video content and associated upscaling models are stored for access by an edge server, which downloads upscaling models to a client device to select an upscaling model.
-
公开(公告)号:US11120569B2
公开(公告)日:2021-09-14
申请号:US16450832
申请日:2019-06-24
Applicant: Synaptics Incorporated
Inventor: Boyan Ivanov Bonev , Utkarsh Gaur
Abstract: A method and apparatus for estimating a user's head pose relative to a sensing device. The sensing device detects a face of the user in an image. The sensing device further identifies a plurality of points in the image corresponding to respective features of the detected face. The plurality of points includes at least a first point corresponding to a location of a first facial feature. The sensing device determines a position of the face relative to the sensing device based at least in part on a distance between the first point in the image and one or more of the remaining points. For example, the sensing device may determine a pitch, yaw, distance, or location of the user's face relative to the sensing device.
-
公开(公告)号:US20200275158A1
公开(公告)日:2020-08-27
申请号:US16799232
申请日:2020-02-24
Applicant: Synaptics Incorporated
Inventor: Utkarsh Gaur , Adil Ilyas Jagmag , Gaurav Arora
IPC: H04N21/466 , H04N21/475 , G06F3/16
Abstract: A method and apparatus for deep content tagging. A media device receives one or more first frames of a content item, where the one or more first frames spans a duration of a scene in the content item. The media device detects one or more objects or features in each of the first frames using a neural network model and identifies one or more first genres associated with the first frames based at least in part on the detected objects or features in each of the first frames. The media device further controls playback of the content item based at least in part on the identified first genres.
-
公开(公告)号:US12126667B2
公开(公告)日:2024-10-22
申请号:US18447261
申请日:2023-08-09
Applicant: Synaptics Incorporated
Inventor: Vladan Petrovic , Utkarsh Gaur , Pontus Lidman
IPC: H04L65/70 , G06N3/08 , H04L65/403 , H04L65/75 , H04N21/24 , H04N21/258 , H04N21/2662
CPC classification number: H04L65/70 , G06N3/08 , H04L65/403 , H04L65/75 , H04N21/2402 , H04N21/25825 , H04N21/2662
Abstract: Systems and method for streaming video content include downscaling video content using a downscaling model to generate downscaled video content and downloading the downscaled video content as a video stream and corresponding upscaling model to a client device. The system converts received video frames to a video memory format comprising channels having the same memory allocation size, each subsequent channel arranged in an adjacent memory location, for input to the downscaling model. The client device upscales the video stream using the received upscaling model for display by the client device in real-time. A training system trains the downscaling model to generate the downscaled video content, based on associated metadata identifying a type of video content. The downscaled video content and associated upscaling models are stored for access by an edge server, which downloads upscaling models to a client device to select an upscaling model.
-
公开(公告)号:US11082460B2
公开(公告)日:2021-08-03
申请号:US16455668
申请日:2019-06-27
Applicant: SYNAPTICS INCORPORATED
Inventor: Francesco Nesta , Boyan Bonev , Utkarsh Gaur
Abstract: Systems and methods for audio signal enhancement facilitated using video data are provided. In one example, a method includes receiving a multi-channel audio signal including audio inputs detected by a plurality of audio input devices. The method further includes receiving an image captured by a video input device. The method further includes determining a first signal based at least in part on the image. The first signal is indicative of a likelihood associated with a target audio source. The method further includes determining a second signal based at least in part on the multi-channel audio signal and the first signal. The second signal is indicative of a likelihood associated with an audio component attributed to the target audio source. The method further includes processing the multi-channel audio signal based at least in part on the second signal to generate an output audio signal.
-
公开(公告)号:US11079911B2
公开(公告)日:2021-08-03
申请号:US16553998
申请日:2019-08-28
Applicant: Synaptics Incorporated
Inventor: Utkarsh Gaur , Gaurav Arora
IPC: G06F16/56 , G06F3/0484 , G06F9/451 , G06F16/22 , G06K9/00 , G06F16/583 , G06N3/08 , G06F16/2457
Abstract: A method and apparatus for device personalization. A device is configured to receive first sensor data from one or more sensors, detect biometric information in the first sensor data, encode the biometric information as a first vector using one or more neural network models stored on the device, and configure a user interface of the device based at least in part on the first vector. For example, the profile information may include configurations, settings, preferences, or content to be displayed or rendered via the user interface. In some implementations, the first sensor data may comprise an image of a scene and the biometric information may comprise one or more facial features of a user in the scene.
-
公开(公告)号:US20200273485A1
公开(公告)日:2020-08-27
申请号:US16799263
申请日:2020-02-24
Applicant: Synaptics Incorporated
Inventor: Adil Ilyas Jagmag , Utkarsh Gaur , Gaurav Arora
Abstract: A method and apparatus for user engagement detection. A media device captures sensor data via one or more sensors while concurrently playing back a first content item. The media device detects one or more reactions to the first content item by one or more users based at least in part on the sensor data and controls a media playback interface used to play back the first content item based at least in part on the detected reactions.
-
-
-
-
-
-
-