Translingual visual speech synthesis
    1.
    发明授权
    Translingual visual speech synthesis 失效
    横向视觉语音综合

    公开(公告)号:US06813607B1

    公开(公告)日:2004-11-02

    申请号:US09494582

    申请日:2000-01-31

    IPC分类号: G10L1100

    摘要: A computer implemented method in a language independent system generates audio-driven facial animation given the speech recognition system for just one language. The method is based on the recognition that once alignment is generated, the mapping and the animation hardly have any language dependency in them. Translingual visual speech synthesis can be achieved if the first step of alignment generation can be made speech independent. Given a speech recognition system for a base language, the method synthesizes video with speech of any novel language as the input.

    摘要翻译: 语言独立系统中的计算机实现的方法产生音频驱动的面部动画,给出仅一种语言的语音识别系统。 该方法基于识别一旦生成对齐,映射和动画在它们中几乎没有任何语言依赖关系。 如果可以使语音不依赖于对准生成的第一步,则可以实现视觉语音合成。 给定基本语言的语音识别系统,该方法以任何新颖语言的语音合成视频作为输入。

    Late integration in audio-visual continuous speech recognition
    2.
    发明授权
    Late integration in audio-visual continuous speech recognition 有权
    视听连续语音识别的后期整合

    公开(公告)号:US06633844B1

    公开(公告)日:2003-10-14

    申请号:US09452919

    申请日:1999-12-02

    IPC分类号: G10L1504

    CPC分类号: G10L15/25

    摘要: The combination of audio and video speech recognition in a manner to improve the robustness of speech recognition systems in noisy environments. Contemplated are methods and apparatus in which a video signal associated with a video source and an audio signal associated with the video signal are processed, the most likely viseme associated with the audio signal and video signal is determined and, thereafter, the most likely phoneme associated with the audio signal and video signal is determined.

    摘要翻译: 音频和视频语音识别的组合以提高语音识别系统在嘈杂环境中的鲁棒性的方式。 考虑到其中处理与视频源相关联的视频信号和与视频信号相关联的音频信号的方法和装置,确定与音频信号和视频信号相关联的最可能的视觉,并且之后,最可能的音素相关联 音频信号和视频信号被确定。

    Method and system for adapter configuration in a data processing system
    5.
    发明授权
    Method and system for adapter configuration in a data processing system 失效
    数据处理系统中适配器配置的方法和系统

    公开(公告)号:US5619701A

    公开(公告)日:1997-04-08

    申请号:US631743

    申请日:1996-04-04

    申请人: Chalapathy Neti

    发明人: Chalapathy Neti

    IPC分类号: G06F12/06 G06F15/177 G06F9/00

    摘要: A method and system for sequence independent configuration of adapters installed in a data processing system. Adapters such as disk drive controllers, Token Ring adapters, terminal emulators and the like each include multiple choices associated therewith which specify selected memory allocations which must be utilized in configuring the adapters. A determination is first made of the number of possible combinations of such choices which exist, and if that number is not substantial, an exhaustive evaluation of each possible combination is made to determine if a conflict exists. In the absence of a conflict, each combination is examined for an optimum allocation of memory which maximizes the number of sixteen kilobyte free memory pages remaining within the system memory after configuration for utilization by an expanded memory system. If the number of possible combinations exceeds a predetermined number, only a predetermined number of random combinations are evaluated and an optimum allocation is selected from those random combinations. In order to minimize the probability of chosing combinations with conflicts arising from system utilization of duplicate adapters, a random choice for each successive adapter is selected which, with a high degree of probability, is not identical to a choice selected for a previous adapter.

    摘要翻译: 用于顺序独立配置安装在数据处理系统中的适配器的方法和系统。 诸如磁盘驱动器控制器,令牌环适配器,终端仿真器等的适配器各自包括与其相关联的多个选项,其指定必须在配置适配器中使用的所选择的存储器分配。 首先确定存在的这种选择的可能组合的数量,并且如果该数量不大,则进行每个可能组合的详尽评估以确定是否存在冲突。 在没有冲突的情况下,检查每个组合以便最佳地分配存储器,这使存储器内的十六千字节可用存储器页数保持在系统存储器中,以供扩展存储器系统利用后配置。 如果可能组合的数量超过预定数量,则仅评估预定数量的随机组合,并从这些随机组合中选择最佳分配。 为了最小化从系统利用重复适配器引起的冲突选择组合的可能性,选择每个连续适配器的随机选择,其高度概率与为先前适配器选择的选择不同。

    Method, apparatus, and program for cross-linking information sources using multiple modalities
    6.
    发明申请
    Method, apparatus, and program for cross-linking information sources using multiple modalities 审中-公开
    使用多种模式来交流信息源的方法,装置和程序

    公开(公告)号:US20050038814A1

    公开(公告)日:2005-02-17

    申请号:US10640894

    申请日:2003-08-13

    IPC分类号: G06F17/30 G06F7/00

    摘要: A mechanism is provided for cross-linking information sources using multiple modalities. Text documents, images, audio sources, video, and other media are analyzed to determine media descriptors, which are metadata describing the content of the media sources. The media descriptors from all modalities are collated and cross-linked. A query processing and presentation module, which receives queries and presents results, may also be provided. A query may consist of textual keywords from user input. Alternatively, a query may derive from a media source, such as a text document, image, audio source, or video source.

    摘要翻译: 提供了一种使用多种模式来交换信息源的机制。 分析文本文档,图像,音频源,视频和其他媒体以确定媒体描述符,其是描述媒体源的内容的元数据。 来自所有模式的媒体描述符进行整理和交互。 还可以提供接收查询和呈现结果的查询处理和呈现模块。 查询可以由用户输入的文本关键字组成。 或者,查询可以从诸如文本文档,图像,音频源或视频源的媒体源导出。

    System and method for presenting and browsing information
    8.
    发明申请
    System and method for presenting and browsing information 审中-公开
    用于呈现和浏览信息的系统和方法

    公开(公告)号:US20050203748A1

    公开(公告)日:2005-09-15

    申请号:US10797847

    申请日:2004-03-10

    IPC分类号: G06F7/00

    摘要: Disclosed is a system and method for presenting and browsing information, comprising the steps of classifying the information into a plurality of classes and sub-classes, each class having at least one sub-class; and presenting the plurality of classes of information to a user. The a system and method capable of interactively controlling the presentation of the sub-classes.

    摘要翻译: 公开了一种用于呈现和浏览信息的系统和方法,包括以下步骤:将信息分类成多个类和子类,每个类具有至少一个子类; 并向用户呈现多个类别的信息。 能够交互地控制子类的呈现的系统和方法。

    Device and method for trainable radio scanning
    9.
    发明授权
    Device and method for trainable radio scanning 失效
    无线电扫描的装置和方法

    公开(公告)号:US06611678B1

    公开(公告)日:2003-08-26

    申请号:US09677086

    申请日:2000-09-29

    IPC分类号: H04B116

    摘要: A trainable radio scanner, including a station monitoring circuit to scan a plurality of radio frequencies and extract audio samples of a predetermined duration from each one of the plurality of radio frequencies having a signal strength above a reception threshold; a memory storing audio classification data and the plurality of audio samples; and an audio analyzer to analyze each one of the plurality of audio samples using the audio classification data and classifies each audio sample into a musical style category; a style discriminator to control a radio station scanning operation of the radio receiver to tune only to preferred radio stations having a radio frequency at which the corresponding audio sample is classified in at least one preferred musical style category.

    摘要翻译: 一种可训练无线电扫描仪,包括:站监视电路,用于扫描多个射频,并从具有高于接收阈值的信号强度的多个射频中的每个射频提取预定持续时间的音频样本; 存储音频分类数据和多个音频样本的存储器; 以及音频分析器,用于使用音频分类数据分析多个音频样本中的每一个,并将每个音频样本分类为音乐风格类别; 用于控制无线电接收机的无线电台扫描操作的风格鉴别器,仅调谐到具有射频的优选无线电台,在该无线电台上相应的音频样本被分类到至少一个优选的音乐风格类别中。

    Automated decision making using time-varying stream reliability prediction
    10.
    发明授权
    Automated decision making using time-varying stream reliability prediction 失效
    使用时变流可靠性预测的自动决策

    公开(公告)号:US07228279B2

    公开(公告)日:2007-06-05

    申请号:US10397762

    申请日:2003-03-26

    IPC分类号: G10L17/00

    CPC分类号: G10L17/06 G10L17/20

    摘要: Automated decision making techniques are provided. For example, a technique for generating a decision associated with an individual or an entity includes the following steps. First, two or more data streams associated with the individual or the entity are captured. Then, at least one time-varying measure is computed in accordance with the two or more data streams. Lastly, a decision is computed based on the at least one time-varying measure. One form of the time-varying measure may include a measure of the coverage of a model associated with previously-obtained training data by at least a portion of the captured data. Another form of the time-varying measure may include a measure of the stability of at least a portion of the captured data. While either measure may be employed alone to compute a decision, preferably both the coverage and stability measures are employed. The technique may be used to authenticate a speaker.

    摘要翻译: 提供自动决策技术。 例如,用于生成与个体或实体相关联的决定的技术包括以下步骤。 首先,捕获与个体或实体相关联的两个或多个数据流。 然后,根据两个或多个数据流来计算至少一个时变度量。 最后,基于至少一个时变度量来计算决定。 时变测量的一种形式可以包括通过所捕获的数据的至少一部分与先前获得的训练数据相关联的模型的覆盖度的度量。 时变措施的另一种形式可以包括所捕获的数据的至少一部分的稳定性的度量。 尽管可以单独使用任一种方法来计算决策,但优选采用覆盖和稳定性度量。 该技术可用于认证扬声器。