SYSTEMS AND METHODS FOR DETECTING MOMENTS WITHIN VIDEOS

    公开(公告)号:US20200176030A1

    公开(公告)日:2020-06-04

    申请号:US16784782

    申请日:2020-02-07

    Applicant: GoPro, Inc.

    Abstract: Video information defining video content may be obtained. The video content may include video frames and may have a progress length. The video frames may be encoded into video packets, with the video packets being of particular sizes. One or more size criteria for detecting a given moment within the video content may be obtained. The sizes of the video packets may be compared with the one or more size criteria. One or more sets of the video packets that satisfy the one or more size criteria may be identified. One or more portions of the video content having video frames defined by the set(s) of video packets that satisfy the one or more size criteria may be identified as the given moment within the video content. Storage of the identification of the given moment within the video content in a storage medium may be effectuated.

    Systems and methods for identifying speech based on spectral features

    公开(公告)号:US10546598B2

    公开(公告)日:2020-01-28

    申请号:US16542871

    申请日:2019-08-16

    Applicant: GoPro, Inc.

    Inventor: Tom Médioni

    Abstract: Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. The audio segments may include a first audio segment corresponding to a first portion of the duration. Energy features, entropy features, frequency features, and/or other features of the audio segments may be determined. Energy features may characterize energy of the audio segments. Entropy features may characterize spectral flatness of the audio segments. Frequency features may characterize highest frequencies of the audio segments. One or more of the audio segments may be identified as containing speech based on the energy features, the entropy features, the frequency features, and/or other information. Storage of the identification of the one or more of the audio segments as containing speech in one or more storage media may be effectuated.

    Systems and methods for identifying speech based on cepstral coefficients and support vector machines

    公开(公告)号:US10403303B1

    公开(公告)日:2019-09-03

    申请号:US15802115

    申请日:2017-11-02

    Applicant: GoPro, Inc.

    Abstract: Audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. Mel frequency spectral power features, Mel frequency cepstral coefficient features, and energy features of the audio segments may be determined. Feature vectors of the audio segments may be determined based on the Mel frequency spectral power features, the Mel frequency cepstral coefficient features, and the energy features. The feature vectors may be processed through a support vector machine. The support vector machine may output predictions on whether the audio segments contain speech. One or more of the audio segments may be identified as containing speech based on filtering the predictions and comparing the filtered predictions to a threshold. Storage of the identification of the one or more of the audio segments as containing speech in one or more storage media may be effectuated.

    Systems and methods for identifying speech based on spectral features

    公开(公告)号:US10431242B1

    公开(公告)日:2019-10-01

    申请号:US15802145

    申请日:2017-11-02

    Applicant: GoPro, Inc.

    Inventor: Tom Médioni

    Abstract: Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. The audio segments may include a first audio segment corresponding to a first portion of the duration. Energy features, entropy features, frequency features, and/or other features of the audio segments may be determined. Energy features may characterize energy of the audio segments. Entropy features may characterize spectral flatness of the audio segments. Frequency features may characterize highest frequencies of the audio segments. One or more of the audio segments may be identified as containing speech based on the energy features, the entropy features, the frequency features, and/or other information. Storage of the identification of the one or more of the audio segments as containing speech in one or more storage media may be effectuated.

    Systems and methods for detecting moments within videos

    公开(公告)号:US10403326B1

    公开(公告)日:2019-09-03

    申请号:US15874407

    申请日:2018-01-18

    Applicant: GoPro, Inc.

    Abstract: Video information defining video content may be obtained. The video content may include video frames and may have a progress length. The video frames may be encoded into video packets, with the video packets being of particular sizes. One or more size criteria for detecting a given moment within the video content may be obtained. The sizes of the video packets may be compared with the one or more size criteria. One or more sets of the video packets that satisfy the one or more size criteria may be identified. One or more portions of the video content having video frames defined by the set(s) of video packets that satisfy the one or more size criteria may be identified as the given moment within the video content. Storage of the identification of the given moment within the video content in a storage medium may be effectuated.

    Three-dimensional convolutional neural networks for video highlight detection

    公开(公告)号:US09836853B1

    公开(公告)日:2017-12-05

    申请号:US15256874

    申请日:2016-09-06

    Applicant: GOPRO, INC.

    Inventor: Tom Médioni

    Abstract: A three-dimensional convolutional neural network may include a preliminary layer group, one or more intermediate layer groups, a final layer group, and/or other layers/layer groups. The preliminary layer group may include an input layer, a preliminary three-dimensional padding layer, a preliminary three-dimensional convolution layer, a preliminary activation layer, a preliminary normalization layer, and a preliminary downsampling layer. One or more intermediate layer groups may include an intermediate three-dimensional squeeze layer, a first intermediate normalization layer, an intermediate three-dimensional padding layer, a first intermediate three-dimensional expand layer, a second intermediate three-dimensional expand layer, an intermediate concatenation layer, a second intermediate normalization layer, an intermediate activation layer, and an intermediate combination layer. The final layer group may include a final dropout layer, a final three-dimensional convolution layer, a final activation layer, a final normalization layer, a final three-dimensional downsampling layer, and a final flatten layer.

    Systems and methods for detecting moments within videos

    公开(公告)号:US10957359B2

    公开(公告)日:2021-03-23

    申请号:US16784782

    申请日:2020-02-07

    Applicant: GoPro, Inc.

    Abstract: Video information defining video content may be obtained. The video content may include video frames and may have a progress length. The video frames may be encoded into video packets, with the video packets being of particular sizes. One or more size criteria for detecting a given moment within the video content may be obtained. The sizes of the video packets may be compared with the one or more size criteria. One or more sets of the video packets that satisfy the one or more size criteria may be identified. One or more portions of the video content having video frames defined by the set(s) of video packets that satisfy the one or more size criteria may be identified as the given moment within the video content. Storage of the identification of the given moment within the video content in a storage medium may be effectuated.

    Systems and methods for detecting moments within videos

    公开(公告)号:US10559325B2

    公开(公告)日:2020-02-11

    申请号:US16541461

    申请日:2019-08-15

    Applicant: GoPro, Inc.

    Abstract: Video information defining video content may be obtained. The video content may include video frames and may have a progress length. The video frames may be encoded into video packets, with the video packets being of particular sizes. One or more size criteria for detecting a given moment within the video content may be obtained. The sizes of the video packets may be compared with the one or more size criteria. One or more sets of the video packets that satisfy the one or more size criteria may be identified. One or more portions of the video content having video frames defined by the set(s) of video packets that satisfy the one or more size criteria may be identified as the given moment within the video content. Storage of the identification of the given moment within the video content in a storage medium may be effectuated.

    SYSTEMS AND METHODS FOR DETECTING MOMENTS WITHIN VIDEOS

    公开(公告)号:US20190371365A1

    公开(公告)日:2019-12-05

    申请号:US16541461

    申请日:2019-08-15

    Applicant: GoPro, Inc.

    Abstract: Video information defining video content may be obtained. The video content may include video frames and may have a progress length. The video frames may be encoded into video packets, with the video packets being of particular sizes. One or more size criteria for detecting a given moment within the video content may be obtained. The sizes of the video packets may be compared with the one or more size criteria. One or more sets of the video packets that satisfy the one or more size criteria may be identified. One or more portions of the video content having video frames defined by the set(s) of video packets that satisfy the one or more size criteria may be identified as the given moment within the video content. Storage of the identification of the given moment within the video content in a storage medium may be effectuated.

    SYSTEMS AND METHODS FOR IDENTIFYING SPEECH BASED ON SPECTRAL FEATURES

    公开(公告)号:US20190371358A1

    公开(公告)日:2019-12-05

    申请号:US16542871

    申请日:2019-08-16

    Applicant: GoPro, Inc.

    Inventor: Tom Médioni

    Abstract: Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. The audio segments may include a first audio segment corresponding to a first portion of the duration. Energy features, entropy features, frequency features, and/or other features of the audio segments may be determined. Energy features may characterize energy of the audio segments. Entropy features may characterize spectral flatness of the audio segments. Frequency features may characterize highest frequencies of the audio segments. One or more of the audio segments may be identified as containing speech based on the energy features, the entropy features, the frequency features, and/or other information. Storage of the identification of the one or more of the audio segments as containing speech in one or more storage media may be effectuated.

Patent Agency Ranking