Abstract:
Computer-implemented techniques can include establishing, between first and second computing devices, an audio communication session via a network. The first computing device can receive and temporarily store audio packets corresponding to an audio input signal obtained by the second computing device. When the audio packets cause the buffer to fill beyond or at a rate exceeding a loss threshold indicative of a temporary loss of communication via the network, the techniques can decode the audio packets to obtain an audio output signal, determine whether the audio output signal includes speech, and when the audio output signal includes speech, remove a redundant portion of the audio output signal corresponding to a pitch period to obtain a modified audio output signal. The modified audio output signal can then be output via a speaker of the first computing device.
Abstract:
A computer-implemented technique can include establishing an audio communication session between first and second computing devices and obtaining, by the first computing device, an audio input signal using audio data captured by a microphone. The first computing device can analyze the audio input signal to detect a speech input by its first user and can determine a duration of a detection period from when the audio input signal was obtained until the analyzing has completed. The first computing device can then transmit, to the second computing device, (i) a portion of the audio input signal beginning at a start of the speech input and (ii) the detection period duration, wherein receipt of the portion of the audio input signal and the detection period duration causes the second computing device to accelerate playback of the portion of the audio input signal to compensate for the detection period duration.
Abstract:
Systems and methods are provided for handling concurrent speech in which first speech data is received from a first participant of a session and second speech data is received from a second participant of the session. The second speech data includes a pause. The second speech data temporally overlaps the first speech data. A determination is made as to whether the first speech data exceeds a predetermined length. When the first speech data exceeds the predetermined length, the first speech data is outputted and then the second speech data of the second participant is outputted without the pause. When the first speech data does not exceed the predetermined length, the first speech data is outputted and then the second speech data is outputted with the pause.
Abstract:
Implementations generally relate to enhancing video conferences. In some implementations, a method includes determining one or more characteristics of a video stream provided by a first camera. The method further includes determining one or more functions of the first camera based on the one or more characteristics. The method further includes enabling a browser to control the one or more functions of the first camera, and wherein the browser is remote relative to the first camera.
Abstract:
A method and computing system for monitoring an AV synchronous communication session of a user. The AV synchronous communication session includes an audio channel and a video channel. A performance characteristic is determined for the audio channel. The performance characteristic is compared to a defined audio performance threshold. If the performance characteristic does not meet the defined audio performance threshold, the video channel is adjusted to reduce the bandwidth requirements of the video channel.
Abstract:
According to one general aspect, a method may include executing, by a processor of a computing device, at least a portion of an application that includes a plurality of tabs, each tab associated with a respective document that is configured to be rendered for display by the application. The method may also include determining a particular tab of the plurality of tabs that is recording an audio and/or visual signal derived from an environment of the computing device. The method may further include providing a graphical indication, associated with the particular tab, that indicates to a user of the computing device that the particular tab is recording the audio and/or visual signal.
Abstract:
A system having one or more processors and a memory, receives both speech data from first and second participants of a session. The system outputs the speech of the first participant. The system outputs the speech of the second participant in accordance with an adjustment of the speech of a participant of the session when the speech of the second participant temporally overlaps less than a first predetermined threshold amount of a terminal portion of the speech of the first participant. The system drops the speech of the second participant when the speech of the second participant temporally overlaps more than the first predetermined threshold amount of the terminal portion of the speech of the first participant. Optionally, the system adjusts the speech of a participant of the session by delaying output of the speech of the second participant.
Abstract:
A system having one or more processors and a memory, sends a plurality of test audio packets at a level of signal complexity deviating from a model level of signal complexity to a destination device through one or more networks. The system then receives a response to the plurality of test audio packets, where the response is indicative of a value for a quality of service characteristic associated with the one or more networks, and where the value for the quality of service characteristic is determined by how the plurality of test audio packets deviate from the model level of signal complexity when received by a remote device. In response to receiving the response to the plurality of test audio packets, the system activates a signal correction action when the value for the quality of service characteristic fails to meet a performance threshold.