-
公开(公告)号:US20240203456A1
公开(公告)日:2024-06-20
申请号:US18590607
申请日:2024-02-28
Applicant: Google LLC
Inventor: Dimitri Kanevsky , Golan Pundak
IPC: G11B20/10 , G06F3/16 , G10L17/00 , G10L21/0208 , G10L21/028 , G10L21/0364 , G10L25/51 , G10L25/84 , H04M3/56
CPC classification number: G11B20/10527 , G06F3/16 , G06F3/165 , G10L17/00 , G10L21/0364 , G10L25/51 , H04M3/56 , H04M3/568 , G10L21/0208 , G10L21/028 , G10L25/84 , G11B2020/10546
Abstract: Various arrangements for enhancing audio are detailed herein. An audio stream and a second audio stream can be received. From these audio streams, a first audio source and a second audio source are extracted. A conversation between the first audio source and a third audio source that occurs within the audio streams is identified. An updated audio stream is generated that enhances the first audio source and diminishes the second audio source extracted from the audio stream and the second audio stream.
-
公开(公告)号:US11443769B2
公开(公告)日:2022-09-13
申请号:US17194827
申请日:2021-03-08
Applicant: Google LLC
Inventor: Dimitri Kanevsky , Golan Pundak
IPC: G11B20/10 , G06F3/16 , G10L17/00 , G10L21/0364 , H04M3/56 , G10L25/51 , G10L21/0208 , G10L21/028 , G10L25/84
Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for identifying that a first audio stream includes first, second, and third sources of audio. A computing system identifies that a second audio stream includes the first, second, and third sources of audio. The computing system determines that the first and second sources of audio are part of a first conversation. The computing system generates a third audio stream that combines the first source of audio from the first audio stream, the first source of audio from the second audio stream, the second source of audio from the first audio stream, and the second source of audio from the second audio stream, and diminishes the third source of audio from the first audio stream, and the third source of audio from the second audio stream.
-
公开(公告)号:US20210405756A1
公开(公告)日:2021-12-30
申请号:US17468465
申请日:2021-09-07
Applicant: Google LLC
Inventor: Dimitri Kanevsky , Artem Dementyev
IPC: G06F3/01
Abstract: Techniques, apparatuses, and systems are described for integrating haptic actuators into accessories for mobile computing device. Current accessory devices and their associated housings lack the necessary space to implement multiple haptic actuators for conveying complex haptic information. The accessory device attached to the mobile computing device provides additional space for integrating haptic actuators, which provide complex haptic information to a user of the mobile computing device. Using the haptic device accessory, the mobile computing device can convey haptic information to a user of the accessory device. In this way, a haptic device accessory can improve the otherwise limited haptic experience for a user of the accessory device.
-
4.
公开(公告)号:US10430835B2
公开(公告)日:2019-10-01
申请号:US15174668
申请日:2016-06-06
Applicant: Google LLC
Inventor: Ayşe Seza Doğruöz , Natalia Ponomareva , Christoph Urs Oehler , Dimitri Kanevsky
Abstract: Methods, systems, and media for language identification of a media content item based on comments are provided. In some embodiments, the method includes: obtaining a plurality of comments associated with a media content item; selecting a subset of the plurality of comments based on one or more criteria; assigning, for each comment in the subset of the plurality of comments, a vector of language probabilities, wherein each component of the vector is assigned a language probability that indicates the likelihood that the comment includes content in a language from a plurality of languages; combining the vector of language probabilities for each comment in the subset of the plurality of comments to generate a combined language vector; identifying a language associated with the media content item based on the combined language vector; and performing an action based on the identified language.
-
公开(公告)号:US10158616B1
公开(公告)日:2018-12-18
申请号:US15214754
申请日:2016-07-20
Applicant: Google LLC
Inventor: Dimitri Kanevsky , Marcel Yung
Abstract: Systems and methods for online access credential transition are described, including receiving a first string of elements associated with a subsequent online access credential, during a credential transition period, receiving a second string of elements associated with an attempted subsequent online access credential, performing a matching operation to determine a degree of matching between the first string of elements and the second string of elements, and based on the degree of matching between the first string of elements and the second string of elements, providing online feedback, and prompting another attempted subsequent online access credential.
-
公开(公告)号:US20240402812A1
公开(公告)日:2024-12-05
申请号:US18691575
申请日:2021-09-13
Applicant: Google LLC
Inventor: Artem Dementyev , Pascal Tom Getreuer , Dimitri Kanevsky , Richard Francis Lyon
Abstract: A vibrotactile device including a first actuator channel having a vibrotactile actuator and a resistor with a predetermined resistance positioned at an input of the vibrotactile actuator, a processor configured to output a driving signal for driving the vibrotactile actuator, and a voltage sensor configured to measure a voltage drop across the resistor. A current drawn by the vibrotactile actuator varies according to a load applied to the vibrotactile actuator and passes through the resistor. The processor is configured to receive voltage drop measurement data from the voltage sensor, detect a load applied to the vibrotactile actuator based on the measured voltage drop, and control the driving signal based on the detected load.
-
公开(公告)号:US10943619B2
公开(公告)日:2021-03-09
申请号:US16812760
申请日:2020-03-09
Applicant: Google LLC
Inventor: Dimitri Kanevsky , Golan Pundak
IPC: G11B20/10 , G06F3/16 , G10L17/00 , G10L21/0364 , H04M3/56 , G10L25/51 , G10L21/0208 , G10L21/028 , G10L25/84
Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for identifying that a first audio stream includes first, second, and third sources of audio. A computing system identifies that a second audio stream includes the first, second, and third sources of audio. The computing system determines that the first and second sources of audio are part of a first conversation. The computing system generates a third audio stream that combines the first source of audio from the first audio stream, the first source of audio from the second audio stream, the second source of audio from the first audio stream, and the second source of audio from the second audio stream, and diminishes the third source of audio from the first audio stream, and the third source of audio from the second audio stream.
-
公开(公告)号:US10586569B2
公开(公告)日:2020-03-10
申请号:US15954105
申请日:2018-04-16
Applicant: Google LLC
Inventor: Dimitri Kanevsky , Golan Pundak
IPC: G06F17/00 , G11B20/10 , G06F3/16 , G10L17/00 , G10L21/0364 , H04M3/56 , G10L25/51 , G10L21/02 , G10L21/0208 , G10L21/028 , G10L25/84
Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for identifying that a first audio stream includes first, second, and third sources of audio. A computing system identifies that a second audio stream includes the first, second, and third sources of audio. The computing system determines that the first and second sources of audio are part of a first conversation. The computing system generates a third audio stream that combines the first source of audio from the first audio stream, the first source of audio from the second audio stream, the second source of audio from the first audio stream, and the second source of audio from the second audio stream, and diminishes the third source of audio from the first audio stream, and the third source of audio from the second audio stream.
-
公开(公告)号:US20180233173A1
公开(公告)日:2018-08-16
申请号:US15954105
申请日:2018-04-16
Applicant: Google LLC
Inventor: Dimitri Kanevsky , Golan Pundak
CPC classification number: G11B20/10527 , G06F3/16 , G06F3/165 , G10L17/00 , G10L21/0202 , G10L21/0208 , G10L21/028 , G10L21/0364 , G10L25/51 , G10L25/84 , G11B2020/10546 , H04M3/56 , H04M3/568
Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for identifying that a first audio stream includes first, second, and third sources of audio. A computing system identifies that a second audio stream includes the first, second, and third sources of audio. The computing system determines that the first and second sources of audio are part of a first conversation. The computing system generates a third audio stream that combines the first source of audio from the first audio stream, the first source of audio from the second audio stream, the second source of audio from the first audio stream, and the second source of audio from the second audio stream, and diminishes the third source of audio from the first audio stream, and the third source of audio from the second audio stream.
-
公开(公告)号:US20250166633A1
公开(公告)日:2025-05-22
申请号:US18511095
申请日:2023-11-16
Applicant: Google LLC
Inventor: Dimitri Kanevsky , Sharlene Yuan , Artem Dementyev , Sagar Savla , Vinton Gray Cerf
IPC: G10L17/06 , G06F3/0482 , G06F3/0484 , G09B21/00 , H04H20/86
Abstract: Systems and methods for generating transcriptions of audio data for presentation at a client device are provided. One or more audio streams, provided by one or more audio sources of one or more client devices of a plurality of users, are received by a broadcasting system. Sensory modality information comprising one or more of auditory, visual, or haptic characteristics of a first user is determined. First audio data from the one or more audio streams corresponding to the first user and additional audio data from the one or more audio streams corresponding to other users are determined using one or more machine learning models. At least one of a first transcription of the first audio data or one or more additional transcription of one or more of the additional audio data are provided for presentation at a first client device according to the auditory, visual, or haptic characteristics.
-
-
-
-
-
-
-
-
-