Sequential multimodal input
    81.
    发明授权
    Sequential multimodal input 失效
    顺序多模态输入

    公开(公告)号:US07158779B2

    公开(公告)日:2007-01-02

    申请号:US10705155

    申请日:2003-11-11

    IPC分类号: H04Q7/22

    摘要: A method of interacting with a client/server architecture with a 2G mobile phone is provided. The 2G phone includes a data channel for transmitting data and a voice channel for transmitting speech. The method includes receiving a web page from a web server pursuant to an application through the data channel and rendering the web page on the 2G phone. Speech is received from the user corresponding to at least one data field on the web page. A call is established from the 2G phone to a telephony server over the voice channel. The telephony server is remote from the 2G phone and is adapted to process speech. The telephony server obtains a speech-enabled web page from the web server corresponding to the web page provided to the 2G phone. Speech is transmitted from the 2G phone to the telephony server. The speech is processed in accordance with the speech-enabled web page to obtain textual data. The textual data is transmitted to the web server. The 2G phone obtains a new web page through the data channel and renders the new web page having the textual data.

    摘要翻译: 提供了一种与2G手机与客户端/服务器体系结构交互的方法。 2G电话包括用于发送数据的数据信道和用于发送语音的语音信道。 该方法包括根据通过数据通道的应用从Web服务器接收网页,并在2G电话上呈现网页。 从用户接收到对应于网页上的至少一个数据字段的语音。 通过语音信道从2G电话建立到电话服务器的呼叫。 电话服务器远离2G电话,适用于处理语音。 电话服务器从对应于提供给2G电话的网页的web服务器获取具有语音的网页。 语音从2G电话发送到电话服务器。 根据具有语音功能的网页处理语音以获得文本数据。 文本数据被传送到Web服务器。 2G手机通过数据通道获取新的网页,并使新网页具有文本数据。

    Method and apparatus for performing plan-based dialog

    公开(公告)号:US06785651B1

    公开(公告)日:2004-08-31

    申请号:US09662242

    申请日:2000-09-14

    申请人: Kuansan Wang

    发明人: Kuansan Wang

    IPC分类号: G10L1500

    摘要: The present invention provides a dialog system in which the subsystems are integrated under a single technology model. In particular, each of the sub-systems uses stochastic modeling to identify a probability for its respective output. The combined probabilities identify a most probable action to be taken by the dialog system given the latest input from the user and the past dialog states. An additional aspect of the present invention is an embodiment in which the sub-systems communicate with one another through XML pages, thus allowing the sub-systems to be distributed across a distributed network.

    Temporal pattern recognition method and apparatus utilizing segment and frame-based models
    84.
    发明授权
    Temporal pattern recognition method and apparatus utilizing segment and frame-based models 有权
    利用片段和帧模型的时间模式识别方法和装置

    公开(公告)号:US06662158B1

    公开(公告)日:2003-12-09

    申请号:US09560506

    申请日:2000-04-27

    IPC分类号: G10L1504

    CPC分类号: G10L15/148

    摘要: A method and apparatus is provided for identifying patterns from a series of feature vectors representing a time-varying signal. The method and apparatus use both a frame-based model and a segment model in a unified framework. The frame-based model determines the probability of an individual feature vector given a frame state. The segment model determines the probability of sub-sequences of feature vectors given a single segment state. The probabilities from the frame-based model and the segment model are then combined to form a single path score that is indicative of the probability of a sequence of patterns. Another aspect of the invention is the use of a frame-based model and a segment model to segment feature vectors during model training. Under this aspect of the invention, the frame-based model and the segment model are used together to identify probabilities associated with different segmentations.

    摘要翻译: 提供了一种用于从表示时变信号的一系列特征向量中识别图案的方法和装置。 该方法和装置在统一框架中使用基于帧的模型和段模型。 基于帧的模型确定给定帧状态的个体特征向量的概率。 段模型确定给定单个段状态的特征向量的子序列的概率。 然后将来自基于帧的模型和段模型的概率组合以形成指示模式序列的概率的单个路径分数。 本发明的另一方面是在模型训练期间使用基于帧的模型和段模型来分割特征向量。 在本发明的这个方面,基于帧的模型和段模型一起用于识别与不同分段相关联的概率。

    Entity recognition using probabilities for out-of-collection data
    85.
    发明授权
    Entity recognition using probabilities for out-of-collection data 有权
    使用不合规数据的概率进行实体识别

    公开(公告)号:US09104979B2

    公开(公告)日:2015-08-11

    申请号:US13162563

    申请日:2011-06-16

    IPC分类号: G06F15/18 G06N99/00

    CPC分类号: G06N99/005

    摘要: A classifier that disambiguates among entities based on a dictionary, such as corpus of documents about those entities, is built by incorporating probabilities that an entity exists that is not in the dictionary. Given a document it is associated by the classifier with an entity. By incorporating out of collection probabilities into the classifier, a higher level of confidence in the match between an entity and a document is achieved.

    摘要翻译: 基于字典的实体之间消歧的分类器,例如关于这些实体的文档语料库,是通过并入不在字典中的实体存在的概率来构建的。 给定一个文档,它由分类器与实体相关联。 通过将收集概率合并到分类器中,实现了实体和文档之间的匹配的更高的置信度。

    Auto answer in voice over internet protocol
    86.
    发明授权
    Auto answer in voice over internet protocol 有权
    通过互联网协议自动应答

    公开(公告)号:US09025587B2

    公开(公告)日:2015-05-05

    申请号:US11504924

    申请日:2006-08-16

    摘要: An auto-answer feature is implemented in SIP by configuring a receiving device to automatically acknowledge and answer an incoming call or session from a specific trusted third party. The receiving device may skip to an OK response to an INVITE request when the call is routed through the trusted third party. When the device can automatically answer the incoming call, advanced features such as Push To Talk, Information Tone, Click to Call, and Remote Monitoring may be easily implemented.

    摘要翻译: 通过配置接收设备来自动确认和应答来自特定可信第三方的呼入或会话,在SIP中实现自动应答功能。 当呼叫通过受信任的第三方路由时,接收设备可跳过对INVITE请求的OK响应。 当设备可以自动接听来电时,可以轻松实现诸如即按即说,信息音,点击通话和远程监控等高级功能。

    Assessing gateway quality using audio systems
    87.
    发明授权
    Assessing gateway quality using audio systems 有权
    使用音频系统评估网关质量

    公开(公告)号:US08599704B2

    公开(公告)日:2013-12-03

    申请号:US11656606

    申请日:2007-01-23

    摘要: Described is automatically testing the quality of an audio channel between a caller and a callee that includes a device under test, such as a VoIP or other gateway. An analyzer receives timestamps from a caller and callee during a calling session, including timestamps for when the callee initially provides audio (e.g., speech) to the caller, when the caller initially detects sound, when the caller initially provides audio to the callee, and when the callee initially detects sound. The analyzer uses the relative timing of the timestamps and the speech recognizer's outcome to determine whether the audio channel is experiencing interference or echo. When the audio includes speech, a confidence level corresponding to accuracy of speech recognition also may establish the audio channel's quality. Random selection and timing of output may be employed, such as to vary the testing patterns during repetitive tests.

    摘要翻译: 描述的是自动测试呼叫者和被叫者之间的音频频道的质量,包括被测设备,如VoIP或其他网关。 分析仪在呼叫会话期间从呼叫者和被呼叫者接收时间戳,包括呼叫者最初在呼叫者最初检测到声音时最初向呼叫者提供音频(例如,语音)的时间戳,当呼叫者最初向被叫者提供音频时,以及 当被调用者最初检测到声音时。 分析仪使用时间戳和语音识别器的结果的相对定时来确定音频信道是否正在经历干扰或回波。 当音频包括语音时,对应于语音识别的准确性的置信水平也可以建立音频通道的质量。 可以采用随机选择和输出定时,例如在重复测试期间改变测试模式。

    Adaptive construction of a statistical language model
    88.
    发明授权
    Adaptive construction of a statistical language model 有权
    统计语言模型的自适应构建

    公开(公告)号:US08577670B2

    公开(公告)日:2013-11-05

    申请号:US12684749

    申请日:2010-01-08

    IPC分类号: G06F17/27

    摘要: A statistical language model (SLM) may be iteratively refined by considering N-gram counts in new data, and blending the information contained in the new data with the existing SLM. A first group of documents is evaluated to determine the probabilities associated with the different N-grams observed in the documents. An SLM is constructed based on these probabilities. A second group of documents is then evaluated to determine the probabilities associated with each N-gram in that second group. The existing SLM is then evaluated to determine how well it explains the probabilities in the second group of documents, and a weighting parameter is calculated from that evaluation. Using the weighting parameter, a new SLM is then constructed as a weighted average of the existing SLM and the new probabilities.

    摘要翻译: 可以通过考虑新数据中的N-gram计数,并将新数据中包含的信息与现有SLM进行混合来迭代地改进统计语言模型(SLM)。 评估第一组文件以确定与文件中观察到的不同N-gram相关联的概率。 基于这些概率构建SLM。 然后评估第二组文件以确定与该第二组中的每个N-gram相关联的概率。 然后评估现有SLM以确定它如何解释第二组文档中的概率,并从该评估计算加权参数。 使用加权参数,然后构建新的SLM作为现有SLM的加权平均值和新概率。

    Inferring view sequence and relevance data
    89.
    发明授权
    Inferring view sequence and relevance data 有权
    推测视图序列和相关性数据

    公开(公告)号:US08489533B2

    公开(公告)日:2013-07-16

    申请号:US12499092

    申请日:2009-07-08

    IPC分类号: G06F17/00

    CPC分类号: G06F9/451

    摘要: Technologies pertaining to inferring a view sequence of a user are described herein. A view sequence is an order that graphical objects on a graphical user interface are viewed by a user. A view sequence with respect to graphical objects presented on a graphical user interface is inferred based upon historically observed user actions, such as selection of a link or hovering over respective graphical objects. The view sequence is inferred without employment of sensor equipment that tracks eye movements of users.

    摘要翻译: 本文描述了关于推断用户的视图序列的技术。 视图序列是图形用户界面上的图形对象被用户查看的顺序。 基于历史上观察到的用户动作(例如选择链接或悬停在各个图形对象上)来推断关于在图形用户界面上呈现的图形对象的视图序列。 推测视图序列,而不使用跟踪用户眼睛移动的传感器设备。

    Distributed speech service
    90.
    发明授权
    Distributed speech service 失效
    分布式语音服务

    公开(公告)号:US08396973B2

    公开(公告)日:2013-03-12

    申请号:US11058892

    申请日:2005-02-16

    申请人: Kuansan Wang

    发明人: Kuansan Wang

    IPC分类号: G06F15/16 G06F15/177

    摘要: The present invention relates to establishing a media channel and a signaling channel between a client and a server. The media channel uses a chosen codec and protocol for communication. Through the media channel and signaling channel, an application on the client can utilize speech services on the server.

    摘要翻译: 本发明涉及在客户端和服务器之间建立媒体信道和信令信道。 媒体频道使用选择的编解码器和协议进行通信。 通过媒体渠道和信令通道,客户端上的应用可以利用服务器上的语音服务。