专利检索 ap:("Hugh Adams Jr." OR "Giridharen Iyengar" OR "Ching-Yung Lin" OR "Chalapathy Neti" OR "John Smith" OR "Belle Tseng") AND inv:"Chalapathy Neti" 第 1 页

1.

发明申请
System and method for annotating multi-modal characteristics in multimedia documents 有权
标题翻译：多媒体文件注释多模态特征的系统和方法

公开(公告)号：US20060218481A1

公开(公告)日：2006-09-28

申请号：US10539890

申请日：2003-12-19

申请人： Hugh Adams Jr. , Giridharen Iyengar , Ching-Yung Lin , Chalapathy Neti , John Smith , Belle Tseng

发明人： Hugh Adams Jr. , Giridharen Iyengar , Ching-Yung Lin , Chalapathy Neti , John Smith , Belle Tseng

IPC分类号： G06F15/00

CPC分类号： G06F17/30038 , G06F17/241

摘要： A manual annotation system of multi-modal characteristics in multimedia files. There is provided an arrangement for selection an observation modality of video with audio, video without audio, audio with video, or audio without video, to be used to annotate multimedia content. While annotating video or audio features is isolation results in less confidence in the identification of features, observing both audio and video simultaneously and annotating that observation results in a higher confidence level.

摘要翻译： 多媒体文件中多模态特征的手动注释系统。提供了一种用于选择具有音频的视频，没有音频的视频，具有视频的音频或无视频的音频的观察模式的布置，用于注释多媒体内容。虽然注释视频或音频功能是隔离的结果对于识别功能的信心较差，同时观察音频和视频，并注释观察结果导致更高的置信水平。

2.

发明申请
AUDIO-VISUAL CODEBOOK DEPENDENT CEPSTRAL NORMALIZATION 有权
标题翻译：视听代码依赖于CEPSTRAL正常化

公开(公告)号：US20080059181A1

公开(公告)日：2008-03-06

申请号：US11932996

申请日：2007-10-31

申请人： Sabine Deligne , Chalapathy Neti , Gerasimos Potamianos

发明人： Sabine Deligne , Chalapathy Neti , Gerasimos Potamianos

IPC分类号： G10L15/00

CPC分类号： G10L15/20 , G10L15/24

摘要： An arrangement for yielding enhanced audio features towards the provision of enhanced audio-visual features for speech recognition. Input is provided in the form of noisy audio-visual features and noisy audio features related to the noisy audio-visual features.

摘要翻译： 用于产生增强的音频特征以提供用于语音识别的增强的视听特征的装置。输入以嘈杂的视听功能和与嘈杂的视听功能相关的嘈杂音频功能的形式提供。

3.

发明授权
Method and system for adapter configuration in a data processing system 失效
标题翻译：数据处理系统中适配器配置的方法和系统

公开(公告)号：US5619701A

公开(公告)日：1997-04-08

申请号：US631743

申请日：1996-04-04

申请人： Chalapathy Neti

发明人： Chalapathy Neti

IPC分类号： G06F12/06 , G06F15/177 , G06F9/00

CPC分类号： G06F9/4411 , G06F12/0661 , G06F15/177

摘要： A method and system for sequence independent configuration of adapters installed in a data processing system. Adapters such as disk drive controllers, Token Ring adapters, terminal emulators and the like each include multiple choices associated therewith which specify selected memory allocations which must be utilized in configuring the adapters. A determination is first made of the number of possible combinations of such choices which exist, and if that number is not substantial, an exhaustive evaluation of each possible combination is made to determine if a conflict exists. In the absence of a conflict, each combination is examined for an optimum allocation of memory which maximizes the number of sixteen kilobyte free memory pages remaining within the system memory after configuration for utilization by an expanded memory system. If the number of possible combinations exceeds a predetermined number, only a predetermined number of random combinations are evaluated and an optimum allocation is selected from those random combinations. In order to minimize the probability of chosing combinations with conflicts arising from system utilization of duplicate adapters, a random choice for each successive adapter is selected which, with a high degree of probability, is not identical to a choice selected for a previous adapter.

摘要翻译： 用于顺序独立配置安装在数据处理系统中的适配器的方法和系统。诸如磁盘驱动器控制器，令牌环适配器，终端仿真器等的适配器各自包括与其相关联的多个选项，其指定必须在配置适配器中使用的所选择的存储器分配。首先确定存在的这种选择的可能组合的数量，并且如果该数量不大，则进行每个可能组合的详尽评估以确定是否存在冲突。在没有冲突的情况下，检查每个组合以便最佳地分配存储器，这使存储器内的十六千字节可用存储器页数保持在系统存储器中，以供扩展存储器系统利用后配置。如果可能组合的数量超过预定数量，则仅评估预定数量的随机组合，并从这些随机组合中选择最佳分配。为了最小化从系统利用重复适配器引起的冲突选择组合的可能性，选择每个连续适配器的随机选择，其高度概率与为先前适配器选择的选择不同。

4.

发明申请
Method, apparatus, and program for cross-linking information sources using multiple modalities 审中-公开
标题翻译：使用多种模式来交流信息源的方法，装置和程序

公开(公告)号：US20050038814A1

公开(公告)日：2005-02-17

申请号：US10640894

申请日：2003-08-13

申请人： Giridharan Iyengar , Chalapathy Neti , Harriet Nock

发明人： Giridharan Iyengar , Chalapathy Neti , Harriet Nock

IPC分类号： G06F17/30 , G06F7/00

CPC分类号： G06F16/7834 , G06F16/48 , G06F16/784

摘要： A mechanism is provided for cross-linking information sources using multiple modalities. Text documents, images, audio sources, video, and other media are analyzed to determine media descriptors, which are metadata describing the content of the media sources. The media descriptors from all modalities are collated and cross-linked. A query processing and presentation module, which receives queries and presents results, may also be provided. A query may consist of textual keywords from user input. Alternatively, a query may derive from a media source, such as a text document, image, audio source, or video source.

摘要翻译： 提供了一种使用多种模式来交换信息源的机制。分析文本文档，图像，音频源，视频和其他媒体以确定媒体描述符，其是描述媒体源的内容的元数据。来自所有模式的媒体描述符进行整理和交互。还可以提供接收查询和呈现结果的查询处理和呈现模块。查询可以由用户输入的文本关键字组成。或者，查询可以从诸如文本文档，图像，音频源或视频源的媒体源导出。

5.

发明授权
Translingual visual speech synthesis 失效
标题翻译：横向视觉语音综合

公开(公告)号：US06813607B1

公开(公告)日：2004-11-02

申请号：US09494582

申请日：2000-01-31

申请人： Tanveer Afzal Faruquie , Chalapathy Neti , Nitendra Rajput , L. Venkata Subramaniam , Ashish Verma

发明人： Tanveer Afzal Faruquie , Chalapathy Neti , Nitendra Rajput , L. Venkata Subramaniam , Ashish Verma

IPC分类号： G10L1100

CPC分类号： G10L13/00 , G10L15/00 , G10L21/06 , G10L2021/105

摘要： A computer implemented method in a language independent system generates audio-driven facial animation given the speech recognition system for just one language. The method is based on the recognition that once alignment is generated, the mapping and the animation hardly have any language dependency in them. Translingual visual speech synthesis can be achieved if the first step of alignment generation can be made speech independent. Given a speech recognition system for a base language, the method synthesizes video with speech of any novel language as the input.

摘要翻译： 语言独立系统中的计算机实现的方法产生音频驱动的面部动画，给出仅一种语言的语音识别系统。该方法基于识别一旦生成对齐，映射和动画在它们中几乎没有任何语言依赖关系。如果可以使语音不依赖于对准生成的第一步，则可以实现视觉语音合成。给定基本语言的语音识别系统，该方法以任何新颖语言的语音合成视频作为输入。

6.

发明授权
Late integration in audio-visual continuous speech recognition 有权
标题翻译：视听连续语音识别的后期整合

公开(公告)号：US06633844B1

公开(公告)日：2003-10-14

申请号：US09452919

申请日：1999-12-02

申请人： Ashish Verma , Sankar Basu , Chalapathy Neti

发明人： Ashish Verma , Sankar Basu , Chalapathy Neti

IPC分类号： G10L1504

CPC分类号： G10L15/25

摘要： The combination of audio and video speech recognition in a manner to improve the robustness of speech recognition systems in noisy environments. Contemplated are methods and apparatus in which a video signal associated with a video source and an audio signal associated with the video signal are processed, the most likely viseme associated with the audio signal and video signal is determined and, thereafter, the most likely phoneme associated with the audio signal and video signal is determined.

摘要翻译： 音频和视频语音识别的组合以提高语音识别系统在嘈杂环境中的鲁棒性的方式。考虑到其中处理与视频源相关联的视频信号和与视频信号相关联的音频信号的方法和装置，确定与音频信号和视频信号相关联的最可能的视觉，并且之后，最可能的音素相关联音频信号和视频信号被确定。

7.

发明申请
Non-linear example ordering with cached lexicon and optional detail-on-demand in digital annotation 审中-公开
标题翻译：具有缓存词典的非线性示例排序和数字注释中的可选详细按需

公开(公告)号：US20050246625A1

公开(公告)日：2005-11-03

申请号：US10836843

申请日：2004-04-30

申请人： Giridharan Iyengar , Chalapathy Neti , Harriet Nock

发明人： Giridharan Iyengar , Chalapathy Neti , Harriet Nock

IPC分类号： G06F17/24 , G06F17/30

CPC分类号： G06F17/241 , G06F16/48

摘要： Methods and arrangements for annotating digital input. Digital media input is accepted, with the input being arranged in frames, while in annotating at least one of the following are performed: the presentation of frames for annotation in non-linear fashion; and the employment of a cached annotation lexicon for applying labels to frames.

摘要翻译： 注释数字输入的方法和安排。数字媒体输入被接受，其中输入被布置成帧，而在注释中执行以下中的至少一个：以非线性方式呈现用于注释的帧; 并使用缓存的注释词典将标签应用于框架。

8.

发明申请
System and method for presenting and browsing information 审中-公开
标题翻译：用于呈现和浏览信息的系统和方法

公开(公告)号：US20050203748A1

公开(公告)日：2005-09-15

申请号：US10797847

申请日：2004-03-10

申请人： Anthony Levas , Chalapathy Neti , Joseph Branc , Terence West

发明人： Anthony Levas , Chalapathy Neti , Joseph Branc , Terence West

IPC分类号： G06F7/00

CPC分类号： G06F16/358 , G06F16/353 , G06F16/93

摘要： Disclosed is a system and method for presenting and browsing information, comprising the steps of classifying the information into a plurality of classes and sub-classes, each class having at least one sub-class; and presenting the plurality of classes of information to a user. The a system and method capable of interactively controlling the presentation of the sub-classes.

摘要翻译： 公开了一种用于呈现和浏览信息的系统和方法，包括以下步骤：将信息分类成多个类和子类，每个类具有至少一个子类; 并向用户呈现多个类别的信息。能够交互地控制子类的呈现的系统和方法。

9.

发明授权
Device and method for trainable radio scanning 失效
标题翻译：无线电扫描的装置和方法

公开(公告)号：US06611678B1

公开(公告)日：2003-08-26

申请号：US09677086

申请日：2000-09-29

申请人： Geoffrey Zweig , Chalapathy Neti

发明人： Geoffrey Zweig , Chalapathy Neti

IPC分类号： H04B116

CPC分类号： H04H60/65 , H03J1/0075 , H03J1/0091 , H03J2200/20 , H04H20/26 , H04H60/27 , H04H60/37 , H04H60/46 , H04H60/47 , H04H60/58

摘要： A trainable radio scanner, including a station monitoring circuit to scan a plurality of radio frequencies and extract audio samples of a predetermined duration from each one of the plurality of radio frequencies having a signal strength above a reception threshold; a memory storing audio classification data and the plurality of audio samples; and an audio analyzer to analyze each one of the plurality of audio samples using the audio classification data and classifies each audio sample into a musical style category; a style discriminator to control a radio station scanning operation of the radio receiver to tune only to preferred radio stations having a radio frequency at which the corresponding audio sample is classified in at least one preferred musical style category.

摘要翻译： 一种可训练无线电扫描仪，包括：站监视电路，用于扫描多个射频，并从具有高于接收阈值的信号强度的多个射频中的每个射频提取预定持续时间的音频样本; 存储音频分类数据和多个音频样本的存储器; 以及音频分析器，用于使用音频分类数据分析多个音频样本中的每一个，并将每个音频样本分类为音乐风格类别; 用于控制无线电接收机的无线电台扫描操作的风格鉴别器，仅调谐到具有射频的优选无线电台，在该无线电台上相应的音频样本被分类到至少一个优选的音乐风格类别中。

10.

发明授权
Automated decision making using time-varying stream reliability prediction 失效
标题翻译：使用时变流可靠性预测的自动决策

公开(公告)号：US07228279B2

公开(公告)日：2007-06-05

申请号：US10397762

申请日：2003-03-26

申请人： Upendra V. Chaudhari , Chalapathy Neti , Gerasimos Potamianos , Ganesh N. Ramaswamy

发明人： Upendra V. Chaudhari , Chalapathy Neti , Gerasimos Potamianos , Ganesh N. Ramaswamy

IPC分类号： G10L17/00

CPC分类号： G10L17/06 , G10L17/20

摘要： Automated decision making techniques are provided. For example, a technique for generating a decision associated with an individual or an entity includes the following steps. First, two or more data streams associated with the individual or the entity are captured. Then, at least one time-varying measure is computed in accordance with the two or more data streams. Lastly, a decision is computed based on the at least one time-varying measure. One form of the time-varying measure may include a measure of the coverage of a model associated with previously-obtained training data by at least a portion of the captured data. Another form of the time-varying measure may include a measure of the stability of at least a portion of the captured data. While either measure may be employed alone to compute a decision, preferably both the coverage and stability measures are employed. The technique may be used to authenticate a speaker.

摘要翻译： 提供自动决策技术。例如，用于生成与个体或实体相关联的决定的技术包括以下步骤。首先，捕获与个体或实体相关联的两个或多个数据流。然后，根据两个或多个数据流来计算至少一个时变度量。最后，基于至少一个时变度量来计算决定。时变测量的一种形式可以包括通过所捕获的数据的至少一部分与先前获得的训练数据相关联的模型的覆盖度的度量。时变措施的另一种形式可以包括所捕获的数据的至少一部分的稳定性的度量。尽管可以单独使用任一种方法来计算决策，但优选采用覆盖和稳定性度量。该技术可用于认证扬声器。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类