-
公开(公告)号:US11238888B2
公开(公告)日:2022-02-01
申请号:US16732142
申请日:2019-12-31
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Shilpa Jois Rao , Murthy Parthasarathi , Kyle Tacke
Abstract: The disclosed computer-implemented method may include obtaining an audio sample from a content source, inputting the obtained audio sample into a trained machine learning model, obtaining the output of the trained machine learning model, wherein the output is a profile of an environment in which the input audio sample was recorded, obtaining an acoustic impulse response corresponding to the profile of the environment in which the input audio sample was recorded, obtaining a second audio sample, processing the obtained acoustic impulse response with the second audio sample, and inserting a result of processing the obtained acoustic impulse response and the second audio sample into an audio track. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US11924481B2
公开(公告)日:2024-03-05
申请号:US18186366
申请日:2023-03-20
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Chih-Wei Wu , Kyle Tacke , Shilpa Jois Rao , Boney Sekh , Andrew Swan , Raja Ranjan Senapati
IPC: H04N21/2343 , G06Q10/0631 , G11B27/031 , G11B27/10 , H04N21/234
CPC classification number: H04N21/2343 , G06Q10/06312 , G11B27/031 , G11B27/10 , H04N21/23412 , H04N21/23418
Abstract: The disclosed computer-implemented method may include (1) accessing a first media data object and a different, second media data object that, when played back, each render temporally sequenced content, (2) comparing first temporally sequenced content represented by the first media data object with second temporally sequenced content represented by the second media data object to identify a set of common temporal subsequences between the first media data object and the second media data object, (3) identifying a set of edits relative to the set of common temporal subsequences that describe a difference between the temporally sequenced content of the first media data object and the temporally sequenced content of the second media data object, and (4) executing a workflow relating to the first media data object and/or the second media data object based on the set of edits. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US20220115030A1
公开(公告)日:2022-04-14
申请号:US17555175
申请日:2021-12-17
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Shilpa Jois Rao , Murthy Parthasarathi , Kyle Tacke
Abstract: The disclosed computer-implemented method may include obtaining an audio sample from a content source, inputting the obtained audio sample into a trained machine learning model, obtaining the output of the trained machine learning model, wherein the output is a profile of an environment in which the input audio sample was recorded, obtaining an acoustic impulse response corresponding to the profile of the environment in which the input audio sample was recorded, obtaining a second audio sample, processing the obtained acoustic impulse response with the second audio sample, and inserting a result of processing the obtained acoustic impulse response and the second audio sample into an audio track. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US20220021911A1
公开(公告)日:2022-01-20
申请号:US17245252
申请日:2021-04-30
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Chih-Wei Wu , Kyle Tacke , Shilpa Jois Rao , Boney Sekh , Andrew Swan , Raja Ranjan Senapati
IPC: H04N21/2343 , H04N21/234 , G06Q10/06
Abstract: The disclosed computer-implemented method may include (1) accessing a first media data object and a different, second media data object that, when played back, each render temporally sequenced content, (2) comparing first temporally sequenced content represented by the first media data object with second temporally sequenced content represented by the second media data object to identify a set of common temporal subsequences between the first media data object and the second media data object, (3) identifying a set of edits relative to the set of common temporal subsequences that describe a difference between the temporally sequenced content of the first media data object and the temporally sequenced content of the second media data object, and (4) executing a workflow relating to the first media data object and/or the second media data object based on the set of edits. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US20210151082A1
公开(公告)日:2021-05-20
申请号:US16747314
申请日:2020-01-20
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Murthy Parthasarathi , Andrew Swan , Raja Ranjan Senapati , Shilpa Jois Rao , Anjali Chablani , Kyle Tacke
IPC: G11B27/036 , G11B27/034 , H04N21/84 , H04N21/81 , H04N21/485 , G10L13/04 , G10L13/08
Abstract: The disclosed computer-implemented method may include accessing an audio track that is associated with a video recording, identifying a section of the accessed audio track having a specific audio characteristic, reducing a volume level of the audio track in the identified section, accessing an audio segment that includes a synthesized voice and inserting the accessed audio segment into the identified section of the audio track, where the inserted segment has a higher volume level than the reduced volume level of the audio track in the identified section. The synthesized voice description can be used to provide additional information to a visually impaired viewer without interrupting the audio track that is associated with the video recording, typically by inserting the synthesized voice description into a segment of the audio track in which there is no dialog. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US20230232055A1
公开(公告)日:2023-07-20
申请号:US18186366
申请日:2023-03-20
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Chih-Wei Wu , Kyle Tacke , Shilpa Jois Rao , Boney Sekh , Andrew Swan , Raja Ranjan Senapati
IPC: H04N21/2343 , H04N21/234 , G06Q10/0631 , G11B27/10 , G11B27/031
CPC classification number: H04N21/2343 , H04N21/23412 , G06Q10/06312 , H04N21/23418 , G11B27/10 , G11B27/031
Abstract: The disclosed computer-implemented method may include (1) accessing a first media data object and a different, second media data object that, when played back, each render temporally sequenced content, (2) comparing first temporally sequenced content represented by the first media data object with second temporally sequenced content represented by the second media data object to identify a set of common temporal subsequences between the first media data object and the second media data object, (3) identifying a set of edits relative to the set of common temporal subsequences that describe a difference between the temporally sequenced content of the first media data object and the temporally sequenced content of the second media data object, and (4) executing a workflow relating to the first media data object and/or the second media data object based on the set of edits. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US20210201931A1
公开(公告)日:2021-07-01
申请号:US16732142
申请日:2019-12-31
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Shilpa Jois Rao , Murthy Parthasarathi , Kyle Tacke
Abstract: The disclosed computer-implemented method may include obtaining an audio sample from a content source, inputting the obtained audio sample into a trained machine learning model, obtaining the output of the trained machine learning model, wherein the output is a profile of an environment in which the input audio sample was recorded, obtaining an acoustic impulse response corresponding to the profile of the environment in which the input audio sample was recorded, obtaining a second audio sample, processing the obtained acoustic impulse response with the second audio sample, and inserting a result of processing the obtained acoustic impulse response and the second audio sample into an audio track. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US11983923B1
公开(公告)日:2024-05-14
申请号:US18063107
申请日:2022-12-08
Applicant: NETFLIX, INC.
Inventor: Yadong Wang , Kyle Tacke , Shilpa Jois Rao
IPC: G06V20/40 , G10L25/57 , G10L25/60 , G11B27/031
CPC classification number: G06V20/41 , G06V20/48 , G10L25/57 , G10L25/60 , G11B27/031 , G06V2201/10
Abstract: The disclosed computer-implemented method may include receiving, as input, an audio/video data object; isolating a video stream of a visible potential speaker over a plurality of frames of the audio/video data object; isolating an audio stream over the plurality of frames; providing the isolated video stream and the isolated audio stream to a machine learning model trained with contrastive learning, the contrastive learning using (i) a corpus of video segments of visible speakers with corresponding original audio for positive samples; and (ii) a corpus of video segments of visible speakers with corresponding dubbed audio for negative samples; and evaluating a match between the isolated audio stream and the isolated video stream based at least in part on an output of the machine learning model. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US11659214B2
公开(公告)日:2023-05-23
申请号:US17245252
申请日:2021-04-30
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Chih-Wei Wu , Kyle Tacke , Shilpa Jois Rao , Boney Sekh , Andrew Swan , Raja Ranjan Senapati
IPC: H04N21/2343 , G11B27/10 , G11B27/031 , H04N21/234 , G06Q10/0631
CPC classification number: H04N21/2343 , G06Q10/06312 , G11B27/031 , G11B27/10 , H04N21/23412 , H04N21/23418
Abstract: The disclosed computer-implemented method may include (1) accessing a first media data object and a different, second media data object that, when played back, each render temporally sequenced content, (2) comparing first temporally sequenced content represented by the first media data object with second temporally sequenced content represented by the second media data object to identify a set of common temporal subsequences between the first media data object and the second media data object, (3) identifying a set of edits relative to the set of common temporal subsequences that describe a difference between the temporally sequenced content of the first media data object and the temporally sequenced content of the second media data object, and (4) executing a workflow relating to the first media data object and/or the second media data object based on the set of edits. Various other methods, systems, and computer-readable media are also disclosed.
-
公开(公告)号:US11430485B2
公开(公告)日:2022-08-30
申请号:US16747314
申请日:2020-01-20
Applicant: Netflix, Inc.
Inventor: Yadong Wang , Murthy Parthasarathi , Andrew Swan , Raja Ranjan Senapati , Shilpa Jois Rao , Anjali Chablani , Kyle Tacke
IPC: G11B27/00 , H04N5/93 , G11B27/036 , G11B27/034 , H04N21/84 , G10L13/08 , H04N21/485 , H04N21/81 , G10L13/00 , H04N9/80
Abstract: The disclosed computer-implemented method may include accessing an audio track that is associated with a video recording, identifying a section of the accessed audio track having a specific audio characteristic, reducing a volume level of the audio track in the identified section, accessing an audio segment that includes a synthesized voice and inserting the accessed audio segment into the identified section of the audio track, where the inserted segment has a higher volume level than the reduced volume level of the audio track in the identified section. The synthesized voice description can be used to provide additional information to a visually impaired viewer without interrupting the audio track that is associated with the video recording, typically by inserting the synthesized voice description into a segment of the audio track in which there is no dialog. Various other methods, systems, and computer-readable media are also disclosed.
-
-
-
-
-
-
-
-
-