-
1.
公开(公告)号:US20180061437A1
公开(公告)日:2018-03-01
申请号:US15720498
申请日:2017-09-29
Applicant: Google Inc.
Inventor: Erik Kay , Jonas Erik Lindberg , Serge Lachapelle , Henrik Lundin
IPC: G10L21/043 , G10L21/0208 , H04L29/06 , G10L15/20 , G10L25/78
CPC classification number: G10L21/043 , G10L15/20 , G10L19/005 , G10L21/0208 , G10L21/0232 , G10L25/78 , H04L65/1069 , H04L65/80 , H04L67/141
Abstract: A computer-implemented technique can include establishing an audio communication session between first and second computing devices and obtaining, by the first computing device, an audio input signal using audio data captured by a microphone. The first computing device can analyze the audio input signal to detect a speech input by its first user and can determine a duration of a detection period from when the audio input signal was obtained until the analyzing has completed. The first computing device can then transmit, to the second computing device, (i) a portion of the audio input signal beginning at a start of the speech input and (ii) the detection period duration, wherein receipt of the portion of the audio input signal and the detection period duration causes the second computing device to accelerate playback of the portion of the audio input signal to compensate for the detection period duration.
-
公开(公告)号:US09661208B1
公开(公告)日:2017-05-23
申请号:US14941290
申请日:2015-11-13
Applicant: Google Inc.
Inventor: Serge Lachapelle , Jens Fredrik Oja
IPC: H04N7/14 , H04N5/232 , H04N7/15 , H04N21/4227 , H04N21/4223
CPC classification number: H04N5/23206 , H04N5/23222 , H04N5/2352 , H04N5/2357 , H04N7/15 , H04N7/152 , H04N7/183 , H04N21/4223 , H04N21/4227
Abstract: Implementations generally relate to enhancing video conferences. In some implementations, a method includes determining one or more characteristics of a video stream provided by a first camera. The method further includes determining one or more functions of the first camera based on the one or more characteristics. The method further includes enabling a browser to control the one or more functions of the first camera, and wherein the browser is remote relative to the first camera.
-
公开(公告)号:US09742921B2
公开(公告)日:2017-08-22
申请号:US15336629
申请日:2016-10-27
Applicant: GOOGLE INC.
Inventor: Serge Lachapelle , Alexander Kjeldaas
IPC: H04M3/56 , A61B90/00 , A61B18/24 , A61B17/3211 , A61B17/3207 , A61B17/3203 , G10L21/00 , A61B17/50 , A61N1/05 , H04L29/06 , G10L17/00 , A61B17/32
CPC classification number: H04M3/568 , A61B17/3203 , A61B17/3207 , A61B17/320725 , A61B17/3211 , A61B17/50 , A61B18/245 , A61B90/02 , A61B2017/320044 , A61N1/056 , G10L21/00 , H04L65/403
Abstract: Systems and methods are provided for handling concurrent speech in which first speech data is received from a first participant of a session and second speech data is received from a second participant of the session. The second speech data includes a pause. The second speech data temporally overlaps the first speech data. A determination is made as to whether the first speech data exceeds a predetermined length. When the first speech data exceeds the predetermined length, the first speech data is outputted and then the second speech data of the second participant is outputted without the pause. When the first speech data does not exceed the predetermined length, the first speech data is outputted and then the second speech data is outputted with the pause.
-
公开(公告)号:US09313335B2
公开(公告)日:2016-04-12
申请号:US14027061
申请日:2013-09-13
Applicant: Google Inc.
Inventor: Serge Lachapelle , Alexander Kjeldaas
IPC: H04M3/56 , A61B18/24 , A61B19/00 , A61B17/3211 , A61B17/3207 , A61B17/3203 , G10L21/00 , G10L17/00
CPC classification number: H04M3/568 , A61B17/3203 , A61B17/3207 , A61B17/320725 , A61B17/3211 , A61B17/50 , A61B18/245 , A61B90/02 , A61B2017/320044 , A61N1/056 , G10L21/00 , H04L65/403
Abstract: A system having one or more processors and a memory, receives both speech data from first and second participants of a session. The system outputs the speech of the first participant. The system outputs the speech of the second participant in accordance with an adjustment of the speech of a participant of the session when the speech of the second participant temporally overlaps less than a first predetermined threshold amount of a terminal portion of the speech of the first participant. The system drops the speech of the second participant when the speech of the second participant temporally overlaps more than the first predetermined threshold amount of the terminal portion of the speech of the first participant. Optionally, the system adjusts the speech of a participant of the session by delaying output of the speech of the second participant.
Abstract translation: 具有一个或多个处理器和存储器的系统从会话的第一和第二参与者接收语音数据。 系统输出第一个参与者的语音。 当第二参与者的语音时间上重叠小于第一参与者的语音的终端部分的第一预定阈值量时,系统根据会话的与会者的语音的调整来输出第二参与者的语音 。 当第二参与者的语音时间上重叠多于第一参与者的语音的终端部分的第一预定阈值量时,系统丢弃第二参与者的语音。 可选地,系统通过延迟第二参与者的语音的输出来调整会话的参与者的语音。
-
公开(公告)号:US09210058B2
公开(公告)日:2015-12-08
申请号:US13791878
申请日:2013-03-08
Applicant: GOOGLE INC.
Inventor: Alexander Kjeldaas , Serge Lachapelle
IPC: H04L12/28 , H04L12/26 , H04L12/24 , H04L29/06 , H04L12/841 , H04J3/06 , H04L12/825 , H04J1/16
CPC classification number: H04L43/087 , H04J3/0632 , H04L41/5038 , H04L43/16 , H04L47/25 , H04L47/283 , H04L65/1083 , H04L65/80
Abstract: A system having one or more processors and a memory, sends a plurality of test audio packets at a level of signal complexity deviating from a model level of signal complexity to a destination device through one or more networks. The system then receives a response to the plurality of test audio packets, where the response is indicative of a value for a quality of service characteristic associated with the one or more networks, and where the value for the quality of service characteristic is determined by how the plurality of test audio packets deviate from the model level of signal complexity when received by a remote device. In response to receiving the response to the plurality of test audio packets, the system activates a signal correction action when the value for the quality of service characteristic fails to meet a performance threshold.
Abstract translation: 具有一个或多个处理器和存储器的系统通过一个或多个网络将信号复杂度偏离信号复杂度的信号复杂度水平的多个测试音频分组发送到目的地设备。 然后,系统接收对多个测试音频分组的响应,其中响应指示与一个或多个网络相关联的服务质量特征的值,并且其中服务质量特征的值由如何 当由远程设备接收时,多个测试音频分组偏离信号复杂度的模型级别。 响应于接收对多个测试音频分组的响应,当服务质量特性的值不能满足性能阈值时,系统激活信号校正动作。
-
公开(公告)号:US09886160B2
公开(公告)日:2018-02-06
申请号:US13843721
申请日:2013-03-15
Applicant: Google Inc.
Inventor: Shijing Xian , Serge Lachapelle , Yuri James Wiitala , Jiao Yang Lin , Hin-Chung Lam
IPC: G06F3/0482 , G06F3/0481 , G06F3/16 , G06F3/0483 , G06F3/0484
CPC classification number: G06F3/0481 , G06F3/0483 , G06F3/0484 , G06F3/167
Abstract: According to one general aspect, a method may include executing, by a processor of a computing device, at least a portion of an application that includes a plurality of tabs, each tab associated with a respective document that is configured to be rendered for display by the application. The method may also include determining a particular tab of the plurality of tabs that is recording an audio and/or visual signal derived from an environment of the computing device. The method may further include providing a graphical indication, associated with the particular tab, that indicates to a user of the computing device that the particular tab is recording the audio and/or visual signal.
-
公开(公告)号:US20170318158A1
公开(公告)日:2017-11-02
申请号:US15653324
申请日:2017-07-18
Applicant: GOOGLE INC.
Inventor: Serge Lachapelle , Alexander Kjeldaas
IPC: H04M3/56 , G10L21/00 , A61N1/05 , A61B17/50 , A61B90/00 , A61B17/3211 , A61B17/3207 , A61B17/3203 , H04L29/06 , A61B18/24 , A61B17/32
CPC classification number: H04M3/568 , A61B17/3203 , A61B17/3207 , A61B17/320725 , A61B17/3211 , A61B17/50 , A61B18/245 , A61B90/02 , A61B2017/320044 , A61N1/056 , G10L21/00 , H04L65/403
Abstract: Systems and methods are provided for handling concurrent speech in which temporally overlapping first speech data and second speech data is received from respective first and second participants of a session. A speech policy applied to the speech data specifies dropping the second speech when it interrupts the first speech within a first interval of the first speech data. The first interval is temporally bounded by the beginning of the first speech and a first predetermined amount of time after the beginning of the first speech. The speech policy specifies outputting the first speech data and then outputting the second speech data when the second speech data interrupts a second interval of the first speech data. The second interval of the first speech data is temporally bounded by the end of the first speech data and a second predetermined amount of time before the end of the first speech data.
-
公开(公告)号:US09491300B2
公开(公告)日:2016-11-08
申请号:US15059222
申请日:2016-03-02
Applicant: GOOGLE INC.
Inventor: Serge Lachapelle , Alexander Kjeldaas
IPC: H04M3/56 , A61B18/24 , A61B17/3211 , A61B17/3207 , A61B17/3203 , G10L21/00 , A61B17/50 , A61N1/05 , H04L29/06 , G10L17/00 , A61B17/32
CPC classification number: H04M3/568 , A61B17/3203 , A61B17/3207 , A61B17/320725 , A61B17/3211 , A61B17/50 , A61B18/245 , A61B90/02 , A61B2017/320044 , A61N1/056 , G10L21/00 , H04L65/403
Abstract: A system having one or more processors and a memory receives both speech data from first and second participants of a session. The system outputs the speech of the first participant. The system outputs the speech of the second participant concurrent with the speech of the first participant when the length of time of the speech data of the first participant is more than a predetermined threshold amount. The system outputs the speech data of the second participant in accordance with an adjustment of the speech of one or more participants of the session that includes delaying output of the speech data of the second participant until after the speech data of the first participant has been outputted when the length of time of the speech data of the first participant is less than the predetermined threshold amount.
Abstract translation: 具有一个或多个处理器和存储器的系统接收来自会话的第一和第二参与者的语音数据。 系统输出第一个参与者的语音。 当第一参与者的语音数据的时间长度大于预定阈值量时,系统与第一参与者的语音同时输出第二参与者的语音。 系统根据会话的一个或多个参与者的语音的调整输出第二参与者的语音数据,其中包括延迟第二参与者的语音数据的输出,直到第一参与者的语音数据被输出 当第一参与者的语音数据的长度小于预定阈值量时。
-
公开(公告)号:US09288435B2
公开(公告)日:2016-03-15
申请号:US13894232
申请日:2013-05-14
Applicant: GOOGLE INC.
Inventor: Serge Lachapelle , André Susano Pinto , Åsa Persson , Magnus Flodman
IPC: H04N7/15
CPC classification number: H04N7/15
Abstract: Provided are methods for switching active speakers during a video conferencing session. An image of an active speaker in a video conference is provided for presentation in a main display area of a display screen, where the active speaker is one of a plurality of users participating in the video conference over a network. When, a new active speaker out of the users participating in the video conference is detected, resolutions of available video streams received for each of the users are determined. In response to determining that the resolution of the video stream received for the new active speaker is below a threshold resolution, the image of the active speaker continues to be provided for presentation in the main display area until a predetermined period of time has elapsed.
Abstract translation: 提供了在视频会议会话期间切换主动扬声器的方法。 视频会议中的主动扬声器的图像被提供用于呈现在显示屏幕的主显示区域中,其中主动扬声器是通过网络参与视频会议的多个用户之一。 当检测到参与视频会议的用户中的新的主动扬声器时,确定为每个用户接收的可用视频流的分辨率。 响应于确定为新的主动扬声器接收的视频流的分辨率低于阈值分辨率,则主动扬声器的图像继续被提供用于在主显示区域中呈现,直到经过预定时间段。
-
公开(公告)号:US10015385B2
公开(公告)日:2018-07-03
申请号:US15599291
申请日:2017-05-18
Applicant: Google Inc.
Inventor: Serge Lachapelle , Jens Fredrik Oja
IPC: H04N7/14 , H04N5/232 , H04N7/15 , H04N21/4227 , H04N21/4223
CPC classification number: H04N5/23206 , H04N5/23222 , H04N5/2352 , H04N5/2357 , H04N7/15 , H04N7/152 , H04N7/183 , H04N21/4223 , H04N21/4227
Abstract: Implementations generally relate to enhancing video conferences. In some implementations, a method includes determining one or more characteristics of a video stream provided by a first camera. The method further includes determining one or more functions of the first camera based on the one or more characteristics. The method further includes enabling a browser to control the one or more functions of the first camera, and wherein the browser is remote relative to the first camera.
-
-
-
-
-
-
-
-
-