Method and apparatus to enhance a multicast information stream in a communication network
    1.
    发明授权
    Method and apparatus to enhance a multicast information stream in a communication network 有权
    在通信网络中增强多播信息流的方法和装置

    公开(公告)号:US06845399B2

    公开(公告)日:2005-01-18

    申请号:US10124307

    申请日:2002-04-18

    摘要: A method and apparatus that enhances a multicast information stream, such as an IP multicast session, in a communication network is provided. The stream is received through the communication network and is enhanced at substantially the time the first stream is received. The information stream may be enhanced by adding transcribed content, such as content generated by speech recognition software, or translated content, such as from a first language to a second language, to the stream. The information stream may also be enhanced by adding content to the first information stream, such as content is related to the original content. The enhanced stream may be sent to a user as a second multicast information stream. The enhanced stream may be received by the user in place of, or along with, the original information stream. The enhanced content may be sent to the user at the conclusion of the information stream, if desired.

    摘要翻译: 提供了一种在通信网络中增强诸如IP多播会话之类的多播信息流的方法和装置。 通过通信网络接收流,并且在基本上在接收到第一流时被增强。 信息流可以通过将诸如由语音识别软件生成的内容或诸如从第一语言到第二语言的翻译内容的转录内容添加到流来增强。 还可以通过将内容添加到第一信息流来增强信息流,诸如内容与原始内容有关。 可以将增强流作为第二多播信息流发送给用户。 增强流可以由用户代替原始信息流或与原始信息流一起接收。 如果需要,增强内容可以在信息流的结束时发送给用户。

    Indexing multimedia communications
    2.
    发明授权
    Indexing multimedia communications 失效
    索引多媒体通信

    公开(公告)号:US06377995B2

    公开(公告)日:2002-04-23

    申请号:US09025940

    申请日:1998-02-19

    IPC分类号: G06F1516

    摘要: A network based platform uses face recognition, speech recognition, background change detection and key scene events to index multimedia communications. Before the multimedia communication begins, active participants register their speech and face models with a server. The process consists of creating a speech sample, capturing a sample image of the participant and storing the data in a database. The server provides an indexing function for the multimedia communication. During the multimedia communication, metadata including time stamping is retained along with the multimedia content. The time stamping information is used for synchronizing the multimedia elements. The multimedia communication is then processed through the server to identify the multimedia communication participants based on speaker and face recognition models. This allows the server to create an index table that becomes an index of the multimedia communication. In addition, through scene change detection and background recognition, certain backgrounds and key scene information can be used for indexing. Therefore, through this indexing apparatus and method, a specific participant can be recognized as speaking and the content that the participant discussed can also be used for indexing.

    摘要翻译: 基于网络的平台使用人脸识别,语音识别,背景变化检测和关键场景事件来索引多媒体通信。 在多媒体通信开始之前,主动参与者用服务器注册他们的语音和面部模型。 该过程包括创建语音样本,捕获参与者的样本图像并将数据存储在数据库中。 服务器为多媒体通信提供索引功能。 在多媒体通信期间,包含时间戳的元数据与多媒体内容一起被保留。 时间戳信息用于同步多媒体元素。 然后通过服务器处理多媒体通信,以基于扬声器和人脸识别模型识别多媒体通信参与者。 这允许服务器创建成为多媒体通信的索引的索引表。 另外,通过场景变化检测和背景识别,可以将某些背景和关键场景信息用于索引。 因此,通过该索引装置和方法,可以将特定的参与者识别为说话,并且参与者讨论的内容也可以用于索引。

    Method and apparatus to enhance a multicast information stream in a communication network
    3.
    发明授权
    Method and apparatus to enhance a multicast information stream in a communication network 有权
    在通信网络中增强多播信息流的方法和装置

    公开(公告)号:US06412011B1

    公开(公告)日:2002-06-25

    申请号:US09152404

    申请日:1998-09-14

    IPC分类号: G06F1516

    摘要: A method and apparatus that enhances a multicast information stream, such as an IP multicast session, in a communication network is provided. The stream is received through the communication network and is enhanced at substantially the time the first stream is received. The information stream may be enhanced by adding transcribed content, such as content generated by speech recognition software, or translated content, such as from a first language to a second language, to the stream. The information stream may also be enhanced by adding content to the first information stream, such as content related to the original content. The enhanced stream may be sent to a user as a second multicast information stream. The enhanced stream may be received by the user in place of, or along with, the original information stream. The enhanced content may be sent to the user at the conclusion of the information stream, if desired.

    摘要翻译: 提供了一种在通信网络中增强诸如IP多播会话之类的多播信息流的方法和装置。 通过通信网络接收流,并且在基本上在接收到第一流时被增强。 信息流可以通过将诸如由语音识别软件产生的内容或诸如从第一语言到第二语言的翻译内容的转录内容添加到流来增强。 还可以通过将内容添加到第一信息流(例如与原始内容相关的内容)来增强信息流。 可以将增强流作为第二多播信息流发送给用户。 增强流可以由用户代替原始信息流或与原始信息流一起接收。 如果需要,增强内容可以在信息流的结束时发送给用户。

    Method and apparatus for on-hold switching
    4.
    发明授权
    Method and apparatus for on-hold switching 有权
    保持开关的方法和装置

    公开(公告)号:US06208729B1

    公开(公告)日:2001-03-27

    申请号:US09174013

    申请日:1998-10-16

    IPC分类号: H04M300

    CPC分类号: H04M3/428

    摘要: The invention provides an on-hold switching device that permits a subscriber to be engaged in other activities through a telephone network while being placed on-hold by another party. When placed on-hold, the on-hold switching device disconnects the subscriber from the other party and connects the subscriber to the telephone network so that the subscriber may engage in other activities. The on-hold switching device monitors the signal bus connected to the other party to determine whether the on-hold condition is removed. When the on-hold condition is removed, the subscriber is reconnected to the other party. Thus, the on-hold switching device permits a subscriber to be engaged in other activities when placed on-hold by the other party.

    摘要翻译: 本发明提供了一种保持开关装置,其允许用户通过电话网络从事其他活动,同时被另一方置于保持状态。 当保持时,保持开关设备将用户与另一方断开连接,并将用户连接到电话网络,使得用户可以从事其他活动。 保持开关装置监视与另一方相连的信号总线,以确定是否取消保持状态。 当保持状态被移除时,用户被重新连接到另一方。 因此,保持开关装置允许用户在被另一方置于保持状态时从事其他活动。

    Apparatus and method for incorporating virtual video conferencing environments
    5.
    发明授权
    Apparatus and method for incorporating virtual video conferencing environments 失效
    用于结合虚拟视频会议环境的装置和方法

    公开(公告)号:US06414707B1

    公开(公告)日:2002-07-02

    申请号:US09174014

    申请日:1998-10-16

    IPC分类号: H04N714

    CPC分类号: H04N7/152 H04N7/147

    摘要: The present invention provides an apparatus and method for conducting a video conference. The video conference apparatus is connected to at least one network. The video conference apparatus includes a MCU, an environment processor, a user database interface and an environment database interface. When users log onto the video conference apparatus, it is determined whether each user has designated an alternative environment from that normally detected by the camera device during the video conference. If the user has designated an alternative environment, the environment processor obtains the environment from the environment database and the video conference apparatus uses the designated environment during the video conference. However, if the user has not designated an alternative environment, the environment processor sends a request message providing a listing of possible environments which may be used during the video conference. Thus, the user may select a desired environment from the listing and use it during the video conference. If the user does not wish to select an alternative environment, a default environment corresponding to the environment normally detected through the camera device is used during the video conference.

    摘要翻译: 本发明提供了一种用于进行视频会议的装置和方法。 视频会议装置连接至少一个网络。 视频会议装置包括MCU,环境处理器,用户数据库接口和环境数据库接口。 当用户登录到视频会议装置时,确定每个用户是否在视频会议期间通常由相机装置检测到的环境指定了替代环境。 如果用户已经指定了替代环境,则环境处理器从环境数据库获得环境,视频会议设备在视频会议期间使用指定的环境。 然而,如果用户没有指定替代环境,则环境处理器发送请求消息,提供在视频会议期间可能使用的可能环境的列表。 因此,用户可以从列表中选择期望的环境并在视频会议期间使用它。 如果用户不想选择替代环境,则在视频会议期间使用通常通过相机设备检测到的环境对应的默认环境。

    Network information delivery system for delivering information based on
end user terminal requirements
    7.
    发明授权
    Network information delivery system for delivering information based on end user terminal requirements 失效
    网络信息传递系统,用于根据最终用户终端要求提供信息

    公开(公告)号:US6035339A

    公开(公告)日:2000-03-07

    申请号:US816234

    申请日:1997-03-13

    IPC分类号: H04L29/06 H04L29/08 G06F13/00

    摘要: A network information delivery system automatically determines end-user information output requirements based on predetermined data corresponding to each requesting end-user terminal. A user profile is maintained in a database either associated with a network information delivery system or with the end-user terminal and is accessed by the network information delivery device. If the network information delivery device has authority to access the end-user terminal, a program may be downloaded to the end-user terminal to determine the exact end-user terminal configuration. The program executing in the end-user terminal returns to the network information delivery device a user profile containing the end-user terminal capabilities so that the requested information may be formatted and delivered to the end-user in an optimal manner. The information to be delivered to end-users may be pre-stored in predetermined formats. The predetermined formats may be determined based on a volume of requests and the characteristic of the information. The information may also be stored in a generic format so that packaging the information for a specific user may be efficiently and timely performed.

    摘要翻译: 网络信息传递系统基于与每个请求终端用户终端对应的预定数据,自动确定最终用户信息输出要求。 用户简档保存在与网络信息传递系统或终端用户终端相关联并由网络信息传递设备访问的数据库中。 如果网络信息传递设备具有访问终端用户终端的权限,则可以向最终用户终端下载程序以确定确切的终端用户终端配置。 在最终用户终端中执行的程序向网络信息传递设备返回包含最终用户终端能力的用户简档,以便所请求的信息可以被格式化并以最佳方式传送给最终用户。 要传送给最终用户的信息可以预先存储为预定格式。 可以基于请求的量和信息的特性来确定预定格式。 信息也可以以通用格式存储,从而可以有效和及时地执行用于特定用户的信息的打包。

    Selective noise/channel/coding models and recognizers for automatic speech recognition
    10.
    再颁专利
    Selective noise/channel/coding models and recognizers for automatic speech recognition 有权
    选择性噪声/信道/编码模型和自动语音识别识别器

    公开(公告)号:USRE45289E1

    公开(公告)日:2014-12-09

    申请号:US09978250

    申请日:2001-10-17

    IPC分类号: G10L15/20 G10L15/26

    CPC分类号: G10L15/20

    摘要: An apparatus and method for the robust recognition of speech during a call in a noisy environment is presented. Specific background noise models are created to model various background noises which may interfere in the error free recognition of speech. These background noise models are then used to determine which noise characteristics a particular call has. Once a determination has been made of the background noise in any given call, speech recognition is carried out using the appropriate background noise model.

    摘要翻译: 提出了一种用于在嘈杂环境中的呼叫期间鲁棒识别语音的装置和方法。 创建特定的背景噪声模型来模拟可能干扰语音的无错误识别的各种背景噪声。 然后使用这些背景噪声模型来确定特定呼叫的哪个噪声特性。 一旦确定了任何给定呼叫中的背景噪声,则使用适当的背景噪声模型进行语音识别。