METHOD AND APPARATUS FOR SEEDED USER INTEREST MODELING
    61.
    发明申请
    METHOD AND APPARATUS FOR SEEDED USER INTEREST MODELING 有权
    种子用户兴趣建模的方法与装置

    公开(公告)号:US20130013644A1

    公开(公告)日:2013-01-10

    申请号:US13637001

    申请日:2010-03-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867 G06F17/30734

    摘要: Methods and apparatuses are provided for user interest modeling. A method may include receiving an input from a user for specifying one or more topics from among a predetermined hierarchy of topics and subtopics. The method may additionally include retrieving one or more documents associated with the user and extracting language tokens from the documents based, at least in part, on the specified topics. Corresponding apparatuses are also provided.

    摘要翻译: 为用户兴趣建模提供了方法和设备。 一种方法可以包括从用户接收用于从主题和子主题的预定层级中指定一个或多个主题的输入。 该方法可以另外包括:至少部分地基于指定的主题来检索与用户相关联的一个或多个文档和从文档中提取语言令牌。 还提供了相应的装置。

    Method and Apparatus for Providing Context Attributes and Informational Links for Media Data
    62.
    发明申请
    Method and Apparatus for Providing Context Attributes and Informational Links for Media Data 审中-公开
    为媒体数据提供上下文属性和信息链接的方法和装置

    公开(公告)号:US20120303452A1

    公开(公告)日:2012-11-29

    申请号:US13576755

    申请日:2010-02-03

    IPC分类号: G06F3/01 G06F15/16 G06Q30/02

    CPC分类号: H04L51/02 H04L51/04

    摘要: Various embodiments include a method including receiving media data on an apparatus, and receiving one or more context attributes related to the apparatus or accessed by the apparatus. The method further includes determining whether the one or more context attributes relate to the media data, and causing, at least in part, display of the media data with the one or more context attributes that are determined to relate to the media data. Also, a method is provided that includes receiving media data on an apparatus, parsing the media data into one or more structured elements, determining one or more informational links that relate to the one or more structured elements of the media data, and causing, at least in part, display of the media data with the one or more informational links that are determined to relate to the one or more structured elements.

    摘要翻译: 各种实施例包括一种方法,包括在装置上接收媒体数据,以及接收与该装置相关或由该装置访问的一个或多个上下文属性。 所述方法还包括确定所述一个或多个上下文属性是否与所述媒体数据相关联,以及至少部分地使用所确定的与所述媒体数据相关的一个或多个上下文属性显示所述媒体数据。 此外,提供了一种方法,其包括在装置上接收媒体数据,将媒体数据解析为一个或多个结构化元素,确定与媒体数据的一个或多个结构化元素相关的一个或多个信息链接, 至少部分地,与被确定为与一个或多个结构化元素相关联的一个或多个信息链接显示媒体数据。

    Method, device, and computer program product for multi-lingual speech recognition
    63.
    发明授权
    Method, device, and computer program product for multi-lingual speech recognition 有权
    用于多语言语音识别的方法,设备和计算机程序产品

    公开(公告)号:US07840399B2

    公开(公告)日:2010-11-23

    申请号:US11100991

    申请日:2005-04-07

    摘要: A method of multi-lingual speech recognition can include determining whether characters in a word are in a source list of a language-specific alphabet mapping table for a language, converting each character not in the source list according to a general alphabet mapping table, converting each converted character according to the language-specific alphabet mapping table, verifying that each character in the word is in a character set of the language, removing characters not in the character set of the language, and identifying a pronunciation of the word.

    摘要翻译: 一种多语言语音识别的方法可以包括确定单词中的字符是否在用于语言的语言特定字母映射表的源列表中,根据一般字母映射表转换不在源列表中的每个字符,转换 每个转换的字符根据语言特定的字母映射表,验证该单词中的每个字符是该语言的字符集,去除不在该语言的字符集中的字符,以及识别该单词的发音。

    Method, apparatus, mobile terminal and computer program product for providing data clustering and mode selection
    64.
    发明授权
    Method, apparatus, mobile terminal and computer program product for providing data clustering and mode selection 失效
    用于提供数据聚类和模式选择的方法,装置,移动终端和计算机程序产品

    公开(公告)号:US07725411B2

    公开(公告)日:2010-05-25

    申请号:US11396831

    申请日:2006-04-03

    IPC分类号: G06E1/00 G06E3/00 G06G7/00

    摘要: An apparatus for providing data clustering and mode selection includes a training element and a transformation element. The training element is configured to receive a first training data set, a second training data set and auxiliary data extracted from the same material as the first training data set. The training element is also configured to train a classifier to group the first training data set into M clusters based on the auxiliary data and the first training data set and train M processing schemes corresponding to the M clusters for transforming the first training data set into the second training data set. The transformation element is in communication with the training element and is configured to cluster the second training data set into M clusters based on features associated with the second training data set.

    摘要翻译: 一种用于提供数据聚类和模式选择的装置包括训练元素和变换元素。 训练元件被配置为接收从与第一训练数据集相同的材料提取的第一训练数据集,第二训练数据集和辅助数据。 训练元素还被配置为训练分类器,以基于辅助数据和第一训练数据集以及对应于M个簇的训练M个处理方案将第一训练数据集合分组成M个群集,以将第一训练数据集变换为 第二训练数据集。 转换元件与训练元素通信,并且被配置为基于与第二训练数据集相关联的特征将第二训练数据集聚集成M个群集。

    METHODS, APPARATUSES, AND COMPUTER PROGRAM PRODUCTS FOR PROVIDING A MIXED LANGUAGE ENTRY SPEECH DICTATION SYSTEM
    65.
    发明申请
    METHODS, APPARATUSES, AND COMPUTER PROGRAM PRODUCTS FOR PROVIDING A MIXED LANGUAGE ENTRY SPEECH DICTATION SYSTEM 审中-公开
    用于提供混合语言入口词汇系统的方法,设备和计算机程序产品

    公开(公告)号:US20090326945A1

    公开(公告)日:2009-12-31

    申请号:US12146987

    申请日:2008-06-26

    申请人: Jilei Tian

    发明人: Jilei Tian

    IPC分类号: G10L15/04 G10L15/18

    摘要: An apparatus may include a processor configured to receive vocabulary entry data. The processor may be further configured to determine a class for the received vocabulary entry data. The processor may be additionally configured to identify one or more languages for the vocabulary entry data based upon the determined class. The processor may also be configured to generate a phoneme sequence for the vocabulary entry data for each identified language. Corresponding methods and computer program products are also provided.

    摘要翻译: 设备可以包括被配置为接收词汇条目数据的处理器。 处理器还可以被配置为确定所接收的词汇条目数据的类别。 处理器可以被附加地配置为基于所确定的类来识别用于词汇条目数据的一种或多种语言。 处理器还可以被配置为为每个识别的语言的词汇表数据生成音素序列。 还提供了相应的方法和计算机程序产品。

    Hybrid Approach in Voice Conversion
    66.
    发明申请
    Hybrid Approach in Voice Conversion 失效
    语音转换中的混合方法

    公开(公告)号:US20090171657A1

    公开(公告)日:2009-07-02

    申请号:US11966255

    申请日:2007-12-28

    IPC分类号: G10L19/04

    CPC分类号: G10L21/00 G10L2021/0135

    摘要: A hybrid approach is described for combining frequency warping and Gaussian Mixture Modeling (GMM) to achieve better speaker identity and speech quality. To train the voice conversion GMM model, line spectral frequency and other features are extracted from a set of source sounds to generate a source feature vector and from a set of target sounds to generate a target feature vector. The GMM model is estimated based on the aligned source feature vector and the target feature vector. A mixture specific warping function is generated each set of mixture mean pairs of the GMM model, and a warping function is generated based on a weighting of each of the mixture specific warping functions. The warping function can be used to convert sounds received from a source speaker to approximate speech of a target speaker.

    摘要翻译: 描述了混合方法,用于组合频率扭曲和高斯混合建模(GMM),以实现更好的扬声器身份和语音质量。 为了训练语音转换GMM模型,从一组源声音中提取线谱频率和其他特征以产生源特征向量和从一组目标声音生成目标特征向量。 基于对齐的源特征向量和目标特征向量来估计GMM模型。 每个GMM模型的混合均值对都产生混合特定的翘曲函数,并且基于每个混合特定翘曲函数的加权产生翘曲函数。 翘曲功能可用于将从源扬声器接收的声音转换为目标扬声器的近似语音。

    Method, apparatus, mobile terminal and computer program product for providing efficient evaluation of feature transformation
    67.
    发明授权
    Method, apparatus, mobile terminal and computer program product for providing efficient evaluation of feature transformation 有权
    方法,装置,移动终端和计算机程序产品,用于提供特征转换的有效评估

    公开(公告)号:US07480641B2

    公开(公告)日:2009-01-20

    申请号:US11400629

    申请日:2006-04-07

    摘要: An apparatus for providing efficient evaluation of feature transformation includes a training module and a transformation module. The training module is configured to train a Gaussian mixture model (GMM) using training source data and training target data. The transformation module is in communication with the training module. The transformation module is configured to produce a conversion function in response to the training of the GMM. The training module is further configured to determine a quality of the conversion function prior to use of the conversion function by calculating a trace measurement of the GMM.

    摘要翻译: 用于提供特征变换的有效评估的装置包括训练模块和变换模块。 训练模块被配置为使用训练源数据和训练目标数据训练高斯混合模型(GMM)。 变换模块与训练模块通信。 转换模块被配置为响应于GMM的训练而产生转换功能。 训练模块还被配置为通过计算GMM的跟踪测量来确定在使用转换功能之前的转换功能的质量。

    Method, apparatus and computer program product for providing voice conversion using temporal dynamic features
    68.
    发明申请
    Method, apparatus and computer program product for providing voice conversion using temporal dynamic features 有权
    用于使用时间动态特征提供语音转换的方法,装置和计算机程序产品

    公开(公告)号:US20080262838A1

    公开(公告)日:2008-10-23

    申请号:US11788263

    申请日:2007-04-17

    IPC分类号: G10L21/00

    CPC分类号: G10L13/033

    摘要: An apparatus for providing voice conversion using temporal dynamic features includes a feature extractor and a transformation element. The feature extractor may be configured to extract dynamic feature vectors from source speech. The transformation element may be in communication with the feature extractor and configured to apply a first conversion function to a signal including the extracted dynamic feature vectors to produce converted dynamic feature vectors. The first conversion function may have been trained using at least dynamic feature data associated with training source speech and training target speech. The transformation element may be further configured to produce converted speech based on an output of applying the first conversion function.

    摘要翻译: 用于使用时间动态特征提供语音转换的装置包括特征提取器和变换元件。 特征提取器可以被配置为从源语音提取动态特征向量。 变换元件可以与特征提取器通信并且被配置为将第一转换函数应用于包括所提取的动态特征向量的信号以产生转换的动态特征向量。 可以使用至少与训练源语音和训练目标语音相关联的动态特征数据来训练第一转换功能。 转换元件还可以被配置为基于应用第一转换函数的输出来产生转换的语音。

    Voice Conversion Training and Data Collection
    69.
    发明申请
    Voice Conversion Training and Data Collection 有权
    语音转换培训和数据收集

    公开(公告)号:US20080255827A1

    公开(公告)日:2008-10-16

    申请号:US11733329

    申请日:2007-04-10

    IPC分类号: G10L19/00

    摘要: It may be desirable to provide a way to collect high quality speech training data without undue burden to the user. Speech training data may be collected during normal usage of a device. In this way, the collection of speech training data may be effectively transparent to the user, without the need for a distinct collection mode from the user's point of view. For example, where the device is or includes a phone (such as a cellular phone), when the user makes or receives a phone call to/from another party, speech training data may be automatically collected from one or both of the parties during the phone call.

    摘要翻译: 可能需要提供一种收集高质量语音训练数据的方式,而不会对用户造成不必要的负担。 在设备的正常使用期间可以收集语音训练数据。 以这种方式,语音训练数据的收集对于用户来说可能是有效的透明的,而不需要用户的观点的不同的收集模式。 例如,当设备是或包括电话(例如蜂窝电话)时,当用户向/从另一方进行或接收电话呼叫时,可以在一方或两方当中自动收集语音训练数据 电话。