摘要:
A method and apparatus that enhances a multicast information stream, such as an IP multicast session, in a communication network is provided. The stream is received through the communication network and is enhanced at substantially the time the first stream is received. The information stream may be enhanced by adding transcribed content, such as content generated by speech recognition software, or translated content, such as from a first language to a second language, to the stream. The information stream may also be enhanced by adding content to the first information stream, such as content is related to the original content. The enhanced stream may be sent to a user as a second multicast information stream. The enhanced stream may be received by the user in place of, or along with, the original information stream. The enhanced content may be sent to the user at the conclusion of the information stream, if desired.
摘要:
A network based platform uses face recognition, speech recognition, background change detection and key scene events to index multimedia communications. Before the multimedia communication begins, active participants register their speech and face models with a server. The process consists of creating a speech sample, capturing a sample image of the participant and storing the data in a database. The server provides an indexing function for the multimedia communication. During the multimedia communication, metadata including time stamping is retained along with the multimedia content. The time stamping information is used for synchronizing the multimedia elements. The multimedia communication is then processed through the server to identify the multimedia communication participants based on speaker and face recognition models. This allows the server to create an index table that becomes an index of the multimedia communication. In addition, through scene change detection and background recognition, certain backgrounds and key scene information can be used for indexing. Therefore, through this indexing apparatus and method, a specific participant can be recognized as speaking and the content that the participant discussed can also be used for indexing.
摘要:
A method and apparatus that enhances a multicast information stream, such as an IP multicast session, in a communication network is provided. The stream is received through the communication network and is enhanced at substantially the time the first stream is received. The information stream may be enhanced by adding transcribed content, such as content generated by speech recognition software, or translated content, such as from a first language to a second language, to the stream. The information stream may also be enhanced by adding content to the first information stream, such as content related to the original content. The enhanced stream may be sent to a user as a second multicast information stream. The enhanced stream may be received by the user in place of, or along with, the original information stream. The enhanced content may be sent to the user at the conclusion of the information stream, if desired.
摘要:
The invention provides an on-hold switching device that permits a subscriber to be engaged in other activities through a telephone network while being placed on-hold by another party. When placed on-hold, the on-hold switching device disconnects the subscriber from the other party and connects the subscriber to the telephone network so that the subscriber may engage in other activities. The on-hold switching device monitors the signal bus connected to the other party to determine whether the on-hold condition is removed. When the on-hold condition is removed, the subscriber is reconnected to the other party. Thus, the on-hold switching device permits a subscriber to be engaged in other activities when placed on-hold by the other party.
摘要:
The present invention provides an apparatus and method for conducting a video conference. The video conference apparatus is connected to at least one network. The video conference apparatus includes a MCU, an environment processor, a user database interface and an environment database interface. When users log onto the video conference apparatus, it is determined whether each user has designated an alternative environment from that normally detected by the camera device during the video conference. If the user has designated an alternative environment, the environment processor obtains the environment from the environment database and the video conference apparatus uses the designated environment during the video conference. However, if the user has not designated an alternative environment, the environment processor sends a request message providing a listing of possible environments which may be used during the video conference. Thus, the user may select a desired environment from the listing and use it during the video conference. If the user does not wish to select an alternative environment, a default environment corresponding to the environment normally detected through the camera device is used during the video conference.
摘要:
A network-based voice messaging system is provided. A voice message is received at a network. The network converts the voice message into a text message by utilizing speech recognition software. The text message is transmitted to the intended recipient as an electronic mail (e-mail) message or facsimile document and is received by the intended recipient on conventional text receiving equipment.
摘要:
A network information delivery system automatically determines end-user information output requirements based on predetermined data corresponding to each requesting end-user terminal. A user profile is maintained in a database either associated with a network information delivery system or with the end-user terminal and is accessed by the network information delivery device. If the network information delivery device has authority to access the end-user terminal, a program may be downloaded to the end-user terminal to determine the exact end-user terminal configuration. The program executing in the end-user terminal returns to the network information delivery device a user profile containing the end-user terminal capabilities so that the requested information may be formatted and delivered to the end-user in an optimal manner. The information to be delivered to end-users may be pre-stored in predetermined formats. The predetermined formats may be determined based on a volume of requests and the characteristic of the information. The information may also be stored in a generic format so that packaging the information for a specific user may be efficiently and timely performed.
摘要:
Real time delivery of multimedia information accessed either through the Internet, or otherwise, simultaneously, or sequentially time delayed to one more users, is enabled by delivering the multimedia information over a switched network via a multipoint control unit. A client establishes a connection with a server, or other remote location where desired multimedia information is resident, identifies the desired multimedia information and provides client information identifying the locations of the users. The client information may include the telephone numbers or other access numbers of each of the multiple users. The multimedia information is then delivered by a multimedia server to a bridging apparatus through a switched network guaranteeing high quality of service, secure connection and billing control. Connecting one or more users to a live agent is also made possible by providing the telephone numbers of the users and the live agent directly to the multipoint control unit. The delivery of the multimedia information can be secured by comparing the client information to a segmented "to call" list to determine whether the client is authorized to receive the requested multimedia information. Alternatively, or in addition, security can be implemented by mounting a video camera at the client's endpoint and connecting a database running face recognition software to the multimedia server. The multimedia server's call to the user triggers the camera to take a picture of the user. The selected content is restricted to authorized users by comparing the picture to pictures of authorized faces stored in the database.
摘要:
A network-based voice messaging system is provided. A voice message is received at a network. The network converts the voice message into a text message by utilizing speech recognition software. The text message is transmitted to the intended recipient as an electronic mail (e-mail) message or facsimile document and is received by the intended recipient on conventional text receiving equipment.
摘要:
A mobility support technique provides home agents and foreign agents in mobility aware access networks. Participating mobile hosts are assigned a home address that is used by other hosts as the mobile host's address. The home address actually addresses the home agent provided in the mobility aware access network. The home address provides additional privacy to the mobile host because it does not identify the mobile host's home premises network where the mobile host resides permanently absent any mobility of the mobile host. By providing home agents and foreign agents in a mobility aware access network, the agents may cooperatively establish optimal routing paths for data transmitted to a mobile host. The agents may identify a pseudo home agent, an agent in a mobility aware access network located near to a transmitting mobile host, that acts as the home agent of a destination mobile host. The pseudo home agent tunnels data directly to the destination mobile host without requiring the data to be routed first to the true home agent. In this regard, the pseudo home agent establishes a more direct routing path between the transmitting and destination mobile hosts.