摘要:
According to one embodiment of the invention, a method for managing time-sensitive packetized data streams at a receiver includes receiving a time-sensitive packet of a data stream, analyzing an energy level of a payload signal of the packet, and determining whether to drop the packet based on the energy level of the payload signal.
摘要:
Methods, devices, and software are provided for generating and sending data packets that contain redundant voice data over VoIP connections made under an unreliable network protocol. The redundant data is packaged either in redundant data packets, or in expanded original packets, to repeat data that originally belongs in other packets. Generation of the redundant voice data is either from the transmitting device or from a retransmitting device, such as a router in the network. Generation is triggered either when errors are detected, or simply when the network resources permit it, or both. The received voice data is processed by the second party to the connection, which is typically a telephone call. The redundant voice data that is actually received is discarded. The invention thus ensures that less voice data is lost than in the prior art over VoIP connections made under an unreliable network protocol.
摘要:
According to one embodiment of the invention, a method for managing communication impairments between Internet Protocol devices is provided. The method includes determining a transmission of a signal comprising a comfort noise, where the signal is transmitted from a first endpoint to a second endpoint. The method also includes sending a notice signal from the first endpoint to the second endpoint indicating that the signal is transmitted. The method also includes suppressing the signal at the second endpoint in response to the notice signal. According to another embodiment of the invention, a method for managing communication impairments between an Internet Protocol (“IP”) phone and an IP device is provided. The method provides sending a status signal to the device indicating that the phone is operating as a speakerphone. The method also includes suppressing the transmission of any comfort noise to the phone in response to the status signal.
摘要:
A method for synchronizing media in a call includes receiving a first input stream of packets of a first media type at a first call resource, generating a first output stream in response to the media streams, receiving synchronization information from a second call resource, and communicating the output stream from the first call resource to a endpoint in synchronization with the second call resource. A video conference bridge includes a first interface operable to receive a video input stream, a processor operable to generate video output, and a second interface operable to receive synchronization information from an audio conference bridge.
摘要:
According to one embodiment of the invention, a method for managing call requests in a limited bandwidth environment includes receiving a call request from a user, determining if a voice channel on a communication link is available for the call request, and providing a feedback signal for the user indicating when a voice channel will be available.
摘要:
A system and method for identifying a participant during a conference call include the capability to receive a packet containing data that represents audible sounds spoken by one of a plurality of participants in a conference call and to determine a speaker of the audible sounds using voice profile information of the participants. The system and method further include the capability to provide identification information of the speaker to the other participants in the conference call contemporaneously with providing audible sounds based on the data to those participants.
摘要:
Multiple audio streams are spatially separated with a context switching system to allow a listener to mentally focus on individual point sources of auditory information in the presence of other sound sources. The switching system simultaneously directs incoming sound sources to different spatial processors. Each spatial processor moves the received sound sources to different audibly perceived point sources. The outputs from the spatial processors are mixed into a stereo signal with left and right outputs and then output to the listener. Important sound sources are moved to a foreground point source for increased intelligibility while less important source sources are moved to a background point source.
摘要:
Systems and methods are disclosed for packet voice conferencing. An encoding system accepts two sound field signals, representing the same sound field sampled at two spatially-separated points. The relative delay between the two sound field signals is detected over a given time interval. The sound field signals are combined and then encoded as a single audio signal, e.g., by a method suitable for monophonic VoIP. The encoded audio payload and the relative delay are placed in one or more packets and sent to a decoding device via the packet network.The decoding device uses the relative delay to drive a playout splitter—once the encoded audio payload has been decoded, the playout splitter creates multiple presentation channels by inserting the transmitted relative delay in the decoded signal for one (or more) of the presentation channels. The listener thus perceives a speaker's voice as originating from a location related to the speaker's physical position at the other end of the conference. An advantage of these embodiments is that a pseudo-stereo conference can be conducted with virtually the same bandwidth as a monophonic conference.
摘要:
A system and method are disclosed for packet voice conferencing. The system and method divide a conferencing presentation sound field into sectors, and allocate one or more sectors to each conferencing endpoint. At some point between capture and playout, the voice data from each endpoint is mapped into its designated sector or sectors. Thereafter, when the voice data from a plurality of participants from multiple endpoints is combined, a listener can identify a unique apparent location within the presentation sound field for each participant. The system allows a conference participant to increase their comprehension when multiple participants speak simultaneously, as well as alleviate confusion as to who is speaking at any given time.
摘要:
A PC-based server platform includes a first backplane bus used for transferring data and commands to various PC peripheral devices. A network router and a telephony endpoint card are coupled to the backplane bus and separately coupled through a second Time Division Multiplexed (TDM) bus. The router includes interfaces to various packet switched networks such as a Wide Area Network (WAN) and a Local Area Network (LAN). The TDM bus is used to route telephony data between the different Internet Protocol (IP)-based networks and the telephony card independently of the host system. The PC host processor also uses the router as a standard LAN interface for transferring data packets. A DSP voice processing card is coupled between the backplane bus and the TDM bus to compress and decompress the telephony data transferred on the TDM bus.