VOICE TRANSFORMATION WITH ENCODED INFORMATION
    2.
    发明申请
    VOICE TRANSFORMATION WITH ENCODED INFORMATION 有权
    语音转换与编码信息

    公开(公告)号:US20120239387A1

    公开(公告)日:2012-09-20

    申请号:US13049924

    申请日:2011-03-17

    IPC分类号: G10L19/02

    CPC分类号: G10L21/003 G10L19/018

    摘要: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

    摘要翻译: 提供语音转换的方法,系统和计算机程序产品。 该方法包括使用变换参数来变换源语言,以及使用隐写术对输入语音中的变换参数对信息进行编码,其中可以使用输出语音和关于变换参数的信息来重构源语音。 还提供了一种用于重建语音变换的方法,包括:接收语音转换系统的输出语音,其中输出语音是使用隐写术编码关于变换参数的信息的变换语音; 提取变换参数信息; 并执行输出语音的逆变换以获得原始源语音的近似。

    Document session replay for multimodal applications
    3.
    发明授权
    Document session replay for multimodal applications 有权
    文档会话重播多模态应用程序

    公开(公告)号:US07801728B2

    公开(公告)日:2010-09-21

    申请号:US11678830

    申请日:2007-02-26

    IPC分类号: G10L21/00 G06F3/16

    CPC分类号: G10L15/26 G10L15/22

    摘要: Methods, apparatus, and computer program products are described for document session replay for multimodal applications. including identifying, by a multimodal browser in dependence upon a log produced by a Form Interpretation Algorithm (‘FIA’) during a previous document session with a user, a speech prompt provided by a multimodal application in the previous document session; identifying, by a multimodal browser in replay mode in dependence upon the log, a response to the prompt provided by a user of the multimodal application in the previous document session; retrieving, by the multimodal browser in dependence upon the log, an X+V page of the multimodal application associated with the speech prompt and the response; rendering, by the multimodal browser, the visual elements of the retrieved X+V page; replaying, by the multimodal browser, the speech prompt; and replaying, by a multimodal browser, the response.

    摘要翻译: 描述用于多模式应用的文档会话重放的方法,装置和计算机程序产品。 包括通过多模式浏览器根据在与用户的先前文档会话期间由表单解释算法(“FIA”)产生的日志来识别由多模式应用在先前文档会话中提供的语音提示; 通过多模式浏览器根据日志在重放模式下识别由先前文档会话中的多模式应用的用户提供的提示的响应; 通过多模式浏览器根据日志检索与语音提示相关联的多模式应用的X + V页面和响应; 通过多模式浏览器呈现检索到的X + V页面的视觉元素; 由多模式浏览器重播演讲提示; 并通过多模式浏览器重播响应。

    CDMA multi-user detection with a real symbol constellation

    公开(公告)号:US07035354B2

    公开(公告)日:2006-04-25

    申请号:US09917837

    申请日:2001-07-31

    IPC分类号: H04L27/06

    CPC分类号: H04L25/067

    摘要: A method for multi-user detection includes receiving a complex input signal due to a superposition of waveforms encoding symbols in a real-valued constellation, which are transmitted respectively by a plurality of transmitters in a common frequency band. The complex input signal is sampled at sampling intervals over the duration of an observation period to provide a sequence of complex samples. The sequence of complex samples is processed to determine soft decision values corresponding to the symbols transmitted by the plurality of the transmitters in the observation period, while constraining the soft decision values to be real values. The soft decision values are then projected onto the constellation to estimate the transmitted symbols.

    Voice transformation with encoded information
    6.
    发明授权
    Voice transformation with encoded information 有权
    具有编码信息的语音变换

    公开(公告)号:US08930182B2

    公开(公告)日:2015-01-06

    申请号:US13049924

    申请日:2011-03-17

    CPC分类号: G10L21/003 G10L19/018

    摘要: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

    摘要翻译: 提供语音转换的方法,系统和计算机程序产品。 该方法包括使用变换参数来变换源语言,以及使用隐写术对输入语音中的变换参数对信息进行编码,其中可以使用输出语音和关于变换参数的信息来重构源语音。 还提供了一种用于重建语音变换的方法,包括:接收语音转换系统的输出语音,其中输出语音是使用隐写术编码关于变换参数的信息的变换语音; 提取变换参数信息; 并执行输出语音的逆变换以获得原始源语音的近似。

    Synchronization of Data Streams with Associated Metadata Streams
    7.
    发明申请
    Synchronization of Data Streams with Associated Metadata Streams 有权
    数据流与相关元数据流的同步

    公开(公告)号:US20130019121A1

    公开(公告)日:2013-01-17

    申请号:US13184542

    申请日:2011-07-17

    IPC分类号: G06F1/12

    摘要: Synchronizing a data stream with an associated metadata stream by receiving a data stream and a metadata stream having a plurality of metadata events associated with the data stream, identifying within the data stream a plurality of data events, matching each of the data events to one of the metadata events in accordance with a matching criterion, and synchronizing the data stream with the metadata stream by effecting a relative time shift between the metadata stream and the data stream in accordance with a time shift adjustment value that results in the smallest sum of absolute differences between time indices of each matched data event and metadata event.

    摘要翻译: 通过接收数据流和具有与数据流相关联的多个元数据事件的元数据流来同步数据流与相关联的元数据流,在数据流内识别多个数据事件,将每个数据事件匹配到 根据匹配标准的元数据事件,并且通过根据导致绝对差的最小和的时移调整值来实现元数据流和数据流之间的相对时移,使数据流与元数据流同步 在每个匹配的数据事件和元数据事件的时间索引之间。

    Vectorization in a SIMdD DSP architecture
    8.
    发明授权
    Vectorization in a SIMdD DSP architecture 失效
    SIMdD DSP架构中的向量化

    公开(公告)号:US07313788B2

    公开(公告)日:2007-12-25

    申请号:US10695970

    申请日:2003-10-29

    IPC分类号: G06F9/45 G06F7/00 G06F15/80

    CPC分类号: G06F8/41

    摘要: A method for determining vectorization configurations in a computer processor architecture, the method including identifying a vectorizable loop in a computer program, identifying a memory access pattern of data required for implementing the loop in the architecture, computing a set of candidate configurations of resources required for vectorizing the data in the architecture, where the computing step includes configuring a vector pointer register of the architecture in support of either of reorder-on-read use and reorder-on-write use of a vector element file of the architecture, selecting one of the candidates in accordance with predefined selection criteria, and implementing the selected vectorization configuration in the architecture.

    摘要翻译: 一种用于确定计算机处理器架构中的向量化配置的方法,所述方法包括识别计算机程序中的可矢量化循环,识别在所述架构中实现所述循环所需的数据的存储器访问模式,计算所述资源所需的一组候选配置 对架构中的数据进行向量化,其中计算步骤包括配置架构的向量指针寄存器,以支持对读取结构的重新排序使用和对该体系结构的向量元素文件进行重写顺序的使用, 候选人根据预定义的选择标准,并在架构中实现所选择的向量化配置。

    SYNCHRONIZING DISTRIBUTED SPEECH RECOGNITION
    9.
    发明申请
    SYNCHRONIZING DISTRIBUTED SPEECH RECOGNITION 有权
    同步分布式语音识别

    公开(公告)号:US20070265851A1

    公开(公告)日:2007-11-15

    申请号:US11382573

    申请日:2006-05-10

    IPC分类号: G10L11/00

    CPC分类号: G10L15/30

    摘要: Methods, apparatus, and computer program products are disclosed for synchronizing distributed speech recognition (‘DSR’) that include receiving in a DSR client notification from a voice server of readiness to conduct speech recognition and, responsive to the receiving, transmitting by the DSR client, from the DSR client to the voice server, speech for recognition.

    摘要翻译: 公开了用于同步分布式语音识别(“DSR”)的方法,装置和计算机程序产品,其包括在来自语音服务器的DSR客户端通知中接收准备进行语音识别的响应于DSR客户端的接收,发送 ,从DSR客户端到语音服务器,语音识别。

    Load-balancing metrics for adaptive dispatching of long asynchronous network requests
    10.
    发明申请
    Load-balancing metrics for adaptive dispatching of long asynchronous network requests 审中-公开
    用于自适应调度长异步网络请求的负载平衡度量

    公开(公告)号:US20070143460A1

    公开(公告)日:2007-06-21

    申请号:US11311790

    申请日:2005-12-19

    IPC分类号: G06F15/173

    摘要: Methods and systems are provided for load-balancing a data network, which is configured with a plurality of servers for servicing client requests asynchronously, and with a network dispatcher for assigning each new request to a selected server. The servers generate metrics indicative of their currently assigned workloads. The network dispatcher receives the metrics, and allocates requests according to weighted server probabilities reflecting the servers' capabilities and the metrics. Connections with the client are thereupon terminated, and reinstated after service of the request. The servers may be weighted in accordance with their respective capabilities, and the metrics adjusted by the weights.

    摘要翻译: 提供了用于负载平衡数据网络的方法和系统,所述数据网络被配置有用于异步地服务客户端请求的多个服务器,以及用于将每个新请求分配给所选择的服务器的网络调度器。 服务器产生指示其当前分配的工作负载的度量。 网络调度员接收度量,并根据反映服务器能力和度量的加权服务器概率分配请求。 与客户端的连接随后终止,并在请求服务后恢复。 服务器可以根据其各自的能力进行加权,并且根据权重来调整度量。