-
公开(公告)号:US07567722B2
公开(公告)日:2009-07-28
申请号:US11087226
申请日:2005-03-22
CPC分类号: H04N19/192 , H04N19/115 , H04N19/124 , H04N19/132 , H04N19/136 , H04N19/147 , H04N19/149 , H04N19/15 , H04N19/172 , H04N19/18 , H04N19/60
摘要: A dynamically scaled file encoding method and apparatus are disclosed. A file encoding system using JPEG encoding can be configured to produce relatively constant compressed file sizes irrespective of the initial file size and file contents. The system retrieves an initial file or image that is to be compressed and determines a target bit rate corresponding to the compressed file. The target bit rate is used to determine an initial scaling factor. The initial file is encoded using a JPEG encoder having coefficients scaled by the initial scaling factor. The resultant bit rate can be adjusted in a second loop if greater than the desired bit rate. To adjust the bit rate, a recomputed scaling factor is determined from the resultant bit rate. The initial file is then encoded with coefficients scaled by the recomputed scaling factor to achieve a bit rate that is within the target bit rate.
摘要翻译: 公开了一种动态缩放的文件编码方法和装置。 使用JPEG编码的文件编码系统可以被配置为产生相对恒定的压缩文件大小,而与初始文件大小和文件内容无关。 系统检索要压缩的初始文件或图像,并确定与压缩文件对应的目标比特率。 目标比特率用于确定初始缩放因子。 使用具有由初始缩放因子缩放的系数的JPEG编码器对初始文件进行编码。 如果大于所需比特率,则可以在第二循环中调整所得比特率。 为了调整比特率,从所得比特率确定重新计算的比例因子。 然后用由重新计算的缩放因子缩放的系数对初始文件进行编码,以实现在目标比特率内的比特率。
-
公开(公告)号:US06836758B2
公开(公告)日:2004-12-28
申请号:US09757713
申请日:2001-01-09
申请人: Ning Bi , Andrew P. DeJaco , Harinath Garudadri , Chienchung Chang , William Yee-Ming Huang , Narendranath Malayath , Suhail Jalil , David Puig Oses , Yingyong Qi
发明人: Ning Bi , Andrew P. DeJaco , Harinath Garudadri , Chienchung Chang , William Yee-Ming Huang , Narendranath Malayath , Suhail Jalil , David Puig Oses , Yingyong Qi
IPC分类号: G10L1500
CPC分类号: G10L15/32 , G10L15/12 , G10L15/142
摘要: A method and system for speech recognition combines different types of engines in order to recognize user-defined digits and control words, predefined digits and control words, and nametags. Speaker-independent engines are combined with speaker-dependent engines. A Hidden Markov Model (HMM) engine is combined with Dynamic Time Warping (DTW) engines.
摘要翻译: 用于语音识别的方法和系统组合不同类型的引擎,以便识别用户定义的数字和控制词,预定义的数字和控制词以及名称。 扬声器独立引擎与扬声器相关的引擎相结合。 隐马尔可夫模型(HMM)引擎与动态时间扭曲(DTW)引擎相结合。
-
公开(公告)号:US5937381A
公开(公告)日:1999-08-10
申请号:US632723
申请日:1996-04-10
CPC分类号: G10L17/24
摘要: A system and a method is disclosed for verifying a voice of a user conducting a telephone transaction. The system and method includes a mechanism for prompting the user to speak in a limited vocabulary. A feature extractor converts the limited vocabulary into a plurality of speech frames. A pre-processor is coupled to the feature extractor for processing the plurality of speech frames to produce a plurality of processed frames. The processing includes frame selection, which eliminates each of the plurality of speech frames having an absence of words. A Viterbi decoder is also coupled to said feature extractor for assigning a frame label to each of the plurality of speech frames to produce a plurality of frame labels. The processed frames and frame labels are then combined to produce a voice model, which includes each of the plurality of frame labels that correspond to the number of plurality of processed frames. A mechanism is also provided for comparing the voice model with the claimant's voice model, derived during a previous enrollment session. The voice model also is compared with an alternate voice model set, derived during previous enrollment sessions. The identity claimed is accepted if the voice model matches the claimant's voice model better than the alternative voice model set.
摘要翻译: 公开了一种用于验证进行电话交易的用户的语音的系统和方法。 该系统和方法包括用于提示用户以有限的词汇表达的机制。 特征提取器将有限词汇转换成多个语音帧。 预处理器耦合到特征提取器,用于处理多个语音帧以产生多个经处理的帧。 该处理包括帧选择,其消除了不存在字的多个语音帧中的每一个。 维特比解码器还耦合到所述特征提取器,用于将帧标签分配给多个语音帧中的每一个以产生多个帧标签。 然后,处理的帧和帧标签被组合以产生语音模型,其包括与多个处理帧的数量相对应的多个帧标签中的每一个。 还提供了一种机制,用于将语音模型与在先前注册会话期间派生的索赔人的语音模型进行比较。 语音模型也与以前的注册会话中派生的替代语音模型集进行比较。 如果语音模型比替代语音模型集更好地与索赔人的语音模型匹配,则所接受的身份被接受。
-
-