-
公开(公告)号:US11089373B2
公开(公告)日:2021-08-10
申请号:US15859284
申请日:2017-12-29
Applicant: SLING MEDIA PVT LTD
Inventor: Kiran Chittella , Bharani Gopinath , Rueju Namath , Jayaprakash Ramaraj , Arunoday Thammineni , Varunkumar Tripathi
IPC: H04N21/472 , H04N21/2387 , H04N5/783 , H04N21/6587 , H04N21/2187 , H04N21/2343
Abstract: Seek and other trick play functions can be improved in placeshifting and similarly live-encoded video streams. Thumbnail images are derived from I-frames (or similar key frames) of the source video stream rather than from the live-encoded stream. The thumbnail images are tagged to indicate a presentation time stamp (PTS) or similar identification of the source video frame that was used to create the thumbnail image. The tagged thumbnails are provided to the media player, which renders the images to indicate different portions of the video stream as the viewer scans or performs other functions. When the viewer selects to skip to a different part of the video stream, the PTS or similar identifier associated with the presented thumbnail image is sent to the placeshifting encoder to identify the appropriate starting point to resume live encoding.
-
公开(公告)号:US11133025B2
公开(公告)日:2021-09-28
申请号:US16677324
申请日:2019-11-07
Applicant: SLING MEDIA PVT LTD
Inventor: Yatish Jayant Naik Raikar , Varunkumar Tripathi , Kiran Chittella , Vinayak Kulkarni
Abstract: A method for speech emotion recognition for enriching speech to text communications between users in speech chat sessions including: implementing a speech emotion recognition model to enable converting observed emotions in speech samples to enrich text with visual emotion content by: generating a data set of speech samples with labels of a plurality of emotion classes; extracting a set of acoustic features from each of the emotion classes; generating a machine learning (ML) model based on the acoustic features and data set; training the ML model from acoustic features from speech samples during speech chat sessions; predicting emotion content based on a trained ML model in the observed speech; generating enriched text based on predicted emotion content of the trained ML model; and presenting the enriched text in speech to text communications between users in the chat session for visual notice of an observed emotion in the speech sample.
-
公开(公告)号:US20160191959A1
公开(公告)日:2016-06-30
申请号:US14985698
申请日:2015-12-31
Applicant: SLING MEDIA PVT LTD
Inventor: Kiran Chittella , Yatish J. Naik Raikar
IPC: H04N21/234 , H04N21/84 , G06F17/27 , H04N21/242 , H04N21/43 , H04N21/858 , H04N21/488 , H04N21/44
Abstract: Timed text that is provided in a television broadcast or media stream can be enhanced to provide an improved user experience. A scrollable text window can be provided in a media player application, for example, that can allow the user to quickly “catchup” from a missed moment. The timed text may be enhanced to allow links to dictionaries, encyclopedias, online sources, thesauruses, translating services, and/or the like. Further implementations could use automated tools to automatically generate program summaries for watched or unwatched content.
Abstract translation: 可以增强在电视广播或媒体流中提供的定时文本,以提供改进的用户体验。 可以在媒体播放器应用中提供可滚动文本窗口,例如,可以允许用户从错过的时刻快速地“追赶”。 可以增强定时文本以允许链接到词典,百科全书,在线来源,词典,翻译服务等。 进一步的实现可以使用自动化工具来自动生成观看或未修剪内容的程序摘要。
-
公开(公告)号:US11688416B2
公开(公告)日:2023-06-27
申请号:US17446385
申请日:2021-08-30
Applicant: SLING MEDIA PVT LTD
Inventor: Yatish Jayant Naik Raikar , Varunkumar Tripathi , Kiran Chittella , Vinayak Kulkarni
CPC classification number: G10L25/63 , G10L15/02 , G10L15/22 , G10L15/26 , G10L2015/027
Abstract: Systems and methods enrich speech to text communications between users in speech chat sessions using a speech emotion recognition model to convert observed emotions in speech samples to enrich text with visual emotion content. The method may include generating a data set of speech samples with labels of a plurality of emotion classes, selecting a set of acoustic features from each of the emotion classes, generating a machine learning (ML) model based on the acoustic features and data set, applying the set of rules based on the selected set of acoustic features and data set, computing a number of rules that have been satisfied, and presenting the enriched text in speech-to-text communications between users in the chat session for visual notice of an observed emotion in the speech sample.
-
公开(公告)号:US10796089B2
公开(公告)日:2020-10-06
申请号:US14985698
申请日:2015-12-31
Applicant: SLING MEDIA PVT LTD
Inventor: Kiran Chittella , Yatish J. Naik Raikar
IPC: G06F40/247 , H04N21/488 , H04N21/43 , H04N21/858 , H04N21/8405 , H04N21/431 , H04N21/234 , H04N21/23 , H04N21/235 , H04N21/242
Abstract: Timed text that is provided in a television broadcast or media stream can be enhanced to provide an improved user experience. A scrollable text window can be provided in a media player application, for example, that can allow the user to quickly “catchup” from a missed moment. The timed text may be enhanced to allow links to dictionaries, encyclopedias, online sources, thesauruses, translating services, and/or the like. Further implementations could use automated tools to automatically generate program summaries for watched or unwatched content.
-
公开(公告)号:US09544643B2
公开(公告)日:2017-01-10
申请号:US14574538
申请日:2014-12-18
Applicant: Sling Media PVT Ltd
Inventor: Kiran Chittella , Arunoday Thammineni , Varunkumar Tripathi
IPC: H04N7/173 , H04N21/4402 , H04N21/41 , H04N21/433 , H04N21/436 , H04N21/845 , H04H60/80 , H04N21/2343 , H04N21/234
CPC classification number: H04N21/440218 , H04H60/80 , H04N21/234 , H04N21/234309 , H04N21/4126 , H04N21/433 , H04N21/436 , H04N21/440245 , H04N21/8455 , H04N21/8456
Abstract: Media content is downloaded from a remote source. A request is received from a client device for sideloading of the media content. Sideloading of the media content to the client device is begun when the downloading has begun and the request has been received.
Abstract translation: 媒体内容从远程源下载。 从客户端设备接收到用于侧面加载媒体内容的请求。 当下载已经开始并且已经接收到该请求时,开始将媒体内容侧向加载到客户端设备。
-
公开(公告)号:US20160182947A1
公开(公告)日:2016-06-23
申请号:US14574538
申请日:2014-12-18
Applicant: Sling Media PVT Ltd
Inventor: Kiran Chittella , Arunoday Thammineni , Varunkumar Tripathi
IPC: H04N21/4402 , H04N21/845 , H04N21/436 , H04N21/41 , H04N21/433
CPC classification number: H04N21/440218 , H04H60/80 , H04N21/234 , H04N21/234309 , H04N21/4126 , H04N21/433 , H04N21/436 , H04N21/440245 , H04N21/8455 , H04N21/8456
Abstract: Media content is downloaded from a remote source. A request is received from a client device for sideloading of the media content. Sideloading of the media content to the client device is begun when the downloading has begun and the request has been received.
Abstract translation: 媒体内容从远程源下载。 从客户端设备接收到用于侧面加载媒体内容的请求。 当下载已经开始并且已经接收到该请求时,开始将媒体内容侧向加载到客户端设备。
-
公开(公告)号:US20180255362A1
公开(公告)日:2018-09-06
申请号:US15859284
申请日:2017-12-29
Applicant: SLING MEDIA PVT LTD
Inventor: Kiran Chittella , Bharani Gopinath , Rueju Namath , Jayaprakash Ramaraj , Arunoday Thammineni , Varunkumar Tripathi
IPC: H04N21/472 , H04N21/2187 , H04N21/2387
CPC classification number: H04N21/47217 , H04N5/783 , H04N21/2187 , H04N21/234363 , H04N21/2387 , H04N21/47202 , H04N21/6587 , H04N2201/325
Abstract: Seek and other trick play functions can be improved in placeshifting and similarly live-encoded video streams. Thumbnail images are derived from I-frames (or similar key frames) of the source video stream rather than from the live-encoded stream. The thumbnail images are tagged to indicate a presentation time stamp (PTS) or similar identification of the source video frame that was used to create the thumbnail image. The tagged thumbnails are provided to the media player, which renders the images to indicate different portions of the video stream as the viewer scans or performs other functions. When the viewer selects to skip to a different part of the video stream, the PTS or similar identifier associated with the presented thumbnail image is sent to the placeshifting encoder to identify the appropriate starting point to resume live encoding.
-
-
-
-
-
-
-