System and method for generating spatial audio with uniform reverberation in real-time communication

    公开(公告)号:US11950088B2

    公开(公告)日:2024-04-02

    申请号:US17859181

    申请日:2022-07-07

    申请人: Agora Lab, Inc.

    摘要: A computer-implemented method for generating spatial audio with uniform reverberation in a real-time communication session is performed by a real-time communication software application running on an electronic communication device. The method includes removing the reverberation of recorded speech signals from far-end participants by the dereverberation approach, rendering the direct sound parts by filtering the output signals by head-related transfer functions of desired directions, generating reverberant sound parts by convolving the output signals from with uniform room impulse responses or an artificial reverberator, combining direct and reverberant sound components to generate spatialized speech signals. When speakers and listeners are located in two virtual conference rooms, the reverberation of the two rooms are coupled. The reverberant sound parts are then generated by convolving the output signals and coupled RIRs from the two rooms.

    Highly scalable system for automated SDK testing

    公开(公告)号:US11650913B2

    公开(公告)日:2023-05-16

    申请号:US17891324

    申请日:2022-08-19

    申请人: Agora Lab, Inc.

    发明人: Jiyu Bai Yuhua Gong

    摘要: A highly scalable automated testing system includes an automated testing controller, an automated testing message server and an automated testing message terminal running on a set of automated testing target devices. The controller and the terminals register themselves with the message server. A testing case is programmed for testing an SDK on a number of target devices concurrently and transformed into a set of command messages in JSON message format. The controller sends the set of messages to the message server. The message server then distributes the set of command messages to the terminals. In response, the terminal calls corresponding APIs of the SDK. The APIs called can be the same or different between the devices within the set of target devices. The SDK returns a result that is forwarded to the server. The server sends the results from the target devices to the controller. The controller verifies the results.

    Gain control for multiple description coding

    公开(公告)号:US11645079B2

    公开(公告)日:2023-05-09

    申请号:US16735021

    申请日:2020-01-06

    申请人: Agora Lab, Inc.

    摘要: A method for encoding multiple descriptions for a media stream includes: determining, for a current block of the media stream, a first number of duplicate blocks, determining, for the current block, a plurality of complementary forward gain factors, and determining a first number of descriptions for the current block by applying the plurality of complementary forward gain factors to the first number of duplicate blocks. A method for decoding multiple descriptions for a media stream includes: determining, for a current block of the media stream, a first number of descriptions from the media stream, in which each description comprises a scaled block corresponding to the current block, determining backward gain factors for the first number of scaled blocks, and determining the current block by applying the backward gain factors to the first number of scaled blocks.

    System and method for automatically switching real-time communication device to new basic service set

    公开(公告)号:US11470531B2

    公开(公告)日:2022-10-11

    申请号:US16926855

    申请日:2020-07-13

    申请人: Agora Lab, Inc.

    发明人: Yue Cao

    摘要: An improved Wi-Fi access point dynamically determines the interference level of each channel. The access point determines that a new channel is better suited to serve an RTC/RTE application over the present channel based on the interference levels and a set of interference measure thresholds. The access point automatically creates a new basic service set on the new channel and associates the RTC/RTE device running the RTC/RTE application with the new basic service set. The access point further refuses to associate the RTC/RTE device with the previous channel when the interference level on the new channel remains low based on a second set of interference measure thresholds. In addition, the access point allows additional RTC/RTE devices to join the new basic service set with the first RTC/RTE device when the interference level on the new channel remains low based on the second set of interference measure thresholds.

    WIRELESS ACCESS POINT SOFTWARE APPLICATION FOR CLASSIFYING NETWORK DATA PACKETS

    公开(公告)号:US20220248270A1

    公开(公告)日:2022-08-04

    申请号:US17727925

    申请日:2022-04-25

    申请人: Agora Lab, Inc.

    发明人: Yue Cao

    摘要: A wireless access point software application, adapted to be run within a wireless access point, classifies network data packets with classification identifiers provided by RTC/RTE devices without scanning the content of them. The wireless access point software application receives inbound and outbound packet classification identifiers from the RTC/RTE applications running on the RTC/RTE devices. The identifiers are provided to a hardware data packet classifier. The hardware data packet classifier applies the identifiers against ingress packets and egress packets respectively. Data packets of the same class are assigned with a same priority. The prioritized network data packets are scheduled for transmission based on their respective priorities. The scheduled network data packets are forwarded to respective network interfaces for transmission.

    Real-time voice timbre style transform

    公开(公告)号:US11380345B2

    公开(公告)日:2022-07-05

    申请号:US17071454

    申请日:2020-10-15

    申请人: Agora Lab, Inc.

    IPC分类号: G10L21/013 G10L25/51

    摘要: Transforming a voice of a speaker to a reference timbre includes converting a first portion of a source signal of the voice of the speaker into a time-frequency domain to obtain a time-frequency signal; obtaining frequency bin means of magnitudes over time of the time-frequency signal; converting the frequency bin magnitude means into a Bark domain to obtain a source frequency response curve (SR), where SR(i) corresponds to magnitude mean of the ith frequency bin; obtaining respective gains of frequency bins of the Bark domain with respect to a reference frequency response curve (Rf); obtaining equalizer parameters using the respective gains of the frequency bins of the Bark domain; and transforming the first portion to the reference timbre using the equalizer parameters.