摘要:
A network information delivery system automatically determines end-user information output requirements based on predetermined data corresponding to each requesting end-user terminal. A user profile is maintained in a database either associated with a network information delivery system or with the end-user terminal and is accessed by the network information delivery device. If the network information delivery device has authority to access the end-user terminal, a program may be downloaded to the end-user terminal to determine the exact end-user terminal configuration. The program executing in the end-user terminal returns to the network information delivery device a user profile containing the end-user terminal capabilities so that the requested information may be formatted and delivered to the end-user in an optimal manner. The information to be delivered to end-users may be pre-stored in predetermined formats. The predetermined formats may be determined based on a volume of requests and the characteristic of the information. The information may also be stored in a generic format so that packaging the information for a specific user may be efficiently and timely performed.
摘要:
Real time delivery of multimedia information accessed either through the Internet, or otherwise, simultaneously, or sequentially time delayed to one more users, is enabled by delivering the multimedia information over a switched network via a multipoint control unit. A client establishes a connection with a server, or other remote location where desired multimedia information is resident, identifies the desired multimedia information and provides client information identifying the locations of the users. The client information may include the telephone numbers or other access numbers of each of the multiple users. The multimedia information is then delivered by a multimedia server to a bridging apparatus through a switched network guaranteeing high quality of service, secure connection and billing control. Connecting one or more users to a live agent is also made possible by providing the telephone numbers of the users and the live agent directly to the multipoint control unit. The delivery of the multimedia information can be secured by comparing the client information to a segmented "to call" list to determine whether the client is authorized to receive the requested multimedia information. Alternatively, or in addition, security can be implemented by mounting a video camera at the client's endpoint and connecting a database running face recognition software to the multimedia server. The multimedia server's call to the user triggers the camera to take a picture of the user. The selected content is restricted to authorized users by comparing the picture to pictures of authorized faces stored in the database.
摘要:
A communications system uses a World Wide Web (Web) server to provide multimedia messaging functions over the Internet. Multimedia workstations are interconnected via the public switched telephone network (PSTN). Parties are provided with multimedia mailboxes on message servers that are connected to the PSTN and the Internet. In order to identify the message server on which a called party's mailbox is located, the Web server provides the multimedia number of the called party's message server when a call is made. In addition, the Web server provides the multimedia number of the called party. When a multimedia call is unanswered, the system uses the multimedia number of the message server and the called party multimedia number provided by the Web server to record and store a message for the called party in the called party's mailbox.
摘要:
A system and method for establishing a communication path for a call between a video telephone/teleconference call and a packet network telephone terminal through a packet network. The call is routed through a multimedia gateway which performs a conversion process between a video telephone/teleconference domain and a packet network telephone domain.
摘要:
An enduser at a POTS analog voice-only endpoint (136) and endusers at H.320 standard multimedia terminals (101, 102, 103, 104), which each communicate over separate voice, video and data streams, engage in a videoconference with each other in a pseudo multimedia manner through a central platform (135) that provides call conversion capabilities. A document to be shared by a user at the POTS endpoint with users at the multimedia endpoints is transmitted as a data signal from a facsimile machine (137) or PC terminal (138) associated with the POTS user to a server (146) in the platform. The received data signal is then inputted to a multimedia bridge (124) and transmitted on the data stream to each multimedia endpoint for display on a window on each multimedia terminal. Similarly, a document to be shared by a multimedia endpoint is transmitted on a data stream to the multimedia bridge, where it is bridged on the data stream transmitted to the other multimedia endpoints and to the server. The document is then transmitted from the server to the facsimile machine or PC terminal associated with the POTS endpoint. In conventional multimedia conferencing arrangements, voice-activated switching is used to determine which user's video image is bridged onto the video stream transmitted to each multimedia terminal. When the audio signal from the POTS user would cause a video signal from that user's terminal to be bridged to all the multimedia endpoints if in fact that user was at a multimedia terminal, a stored image of that user is retrieved from a database (151) and outputted by the bridge on the video stream transmitted to each multimedia terminal to enable the multimedia participants to visually identify the presently talking enduser.
摘要:
Multimedia content is provided based on a set of assignment weights. A portion of a first multimedia object and a portion of a second multimedia object are buffered. The portion of multimedia content of the first multimedia object and the portion of multimedia content of the second multimedia object are accepted. The portion of multimedia content of the first multimedia object corresponds to the first assignment weight; the portion of multimedia content of the second multimedia object corresponds to the second assignment weight. The portion of multimedia content of the first multimedia object and the portion of multimedia content of the second multimedia object are stored.
摘要:
A method and apparatus that enhances a multicast information stream, such as an IP multicast session, in a communication network is provided. The stream is received through the communication network and is enhanced at substantially the time the first stream is received. The information stream may be enhanced by adding transcribed content, such as content generated by speech recognition software, or translated content, such as from a first language to a second language, to the stream. The information stream may also be enhanced by adding content to the first information stream, such as content is related to the original content. The enhanced stream may be sent to a user as a second multicast information stream. The enhanced stream may be received by the user in place of, or along with, the original information stream. The enhanced content may be sent to the user at the conclusion of the information stream, if desired.
摘要:
A network-based voice messaging system is provided. A voice message is received at a network. The network converts the voice message into a text message by utilizing speech recognition software. The text message is transmitted to the intended recipient as an electronic mail (e-mail) message or facsimile document and is received by the intended recipient on conventional text receiving equipment.
摘要:
A network based platform uses face recognition, speech recognition, background change detection and key scene events to index multimedia communications. Before the multimedia communication begins, active participants register their speech and face models with a server. The process consists of creating a speech sample, capturing a sample image of the participant and storing the data in a database. The server provides an indexing function for the multimedia communication. During the multimedia communication, metadata including time stamping is retained along with the multimedia content. The time stamping information is used for synchronizing the multimedia elements. The multimedia communication is then processed through the server to identify the multimedia communication participants based on speaker and face recognition models. This allows the server to create an index table that becomes an index of the multimedia communication. In addition, through scene change detection and background recognition, certain backgrounds and key scene information can be used for indexing. Therefore, through this indexing apparatus and method, a specific participant can be recognized as speaking and the content that the participant discussed can also be used for indexing.
摘要:
The present invention provides an apparatus and method for conducting a video conference. The video conference apparatus is connected to at least one network. The video conference apparatus includes a MCU, an environment processor, a user database interface and an environment database interface. When users log onto the video conference apparatus, it is determined whether each user has designated an alternative environment from that normally detected by the camera device during the video conference. If the user has designated an alternative environment, the environment processor obtains the environment from the environment database and the video conference apparatus uses the designated environment during the video conference. However, if the user has not designated an alternative environment, the environment processor sends a request message providing a listing of possible environments which may be used during the video conference. Thus, the user may select a desired environment from the listing and use it during the video conference. If the user does not wish to select an alternative environment, a default environment corresponding to the environment normally detected through the camera device is used during the video conference.