Iteratively Locating A Position Corresponding To A Desired Seek Time
    15.
    发明申请
    Iteratively Locating A Position Corresponding To A Desired Seek Time 有权
    迭代地找到相应于期望寻求时间的位置

    公开(公告)号:US20080276173A1

    公开(公告)日:2008-11-06

    申请号:US11743482

    申请日:2007-05-02

    IPC分类号: G06F3/00

    摘要: Techniques enable locating a position within a file that corresponds to a desired seek time without having access to an index specifying the desired seek time's position. An iterative process may be used to estimate the position that corresponds to the desired seek time. The process may iterate through multiple estimations until a difference between a time corresponding to an estimated position and the desired seek time is within an acceptable amount or until the process reaches an iteration threshold. The file may then be played beginning at or near the desired seek time. The techniques may therefore allow a user to seek within a file while the user progressively downloads or streams the file.

    摘要翻译: 技术可以在不需要访问指定期望的寻道时间位置的索引的情况下,定位文件中与期望的寻道时间相对应的位置。 可以使用迭代过程来估计对应于期望的寻道时间的位置。 该过程可以遍历多个估计,直到对应于估计位置的时间与期望的寻道时间之间的差在可接受的量内,或直到该过程达到迭代阈值。 然后可以在期望的寻道时间或附近播放该文件。 因此,这些技术可以允许用户在用户逐渐下载或流式传输文件的同时寻找文件。

    Method for adding phonetic descriptions to a speech recognition lexicon
    16.
    发明授权
    Method for adding phonetic descriptions to a speech recognition lexicon 失效
    将语音描述添加到语音识别词典中的方法

    公开(公告)号:US06973427B2

    公开(公告)日:2005-12-06

    申请号:US09748453

    申请日:2000-12-26

    CPC分类号: G10L15/063 G10L2015/0636

    摘要: A method and computer-readable medium convert the text of a word and a user's pronunciation of the word into a phonetic description to be added to a speech recognition lexicon. Initially, two possible phonetic descriptions are generated. One phonetic description is formed from the text of the word. The other phonetic description is formed by decoding a speech signal representing the user's pronunciation of the word. Both phonetic descriptions are scored based on their correspondence to the user's pronunciation. The phonetic description with the highest score is then selected for entry in the speech recognition lexicon.

    摘要翻译: 一种方法和计算机可读介质将单词的文本和用户的该单词的发音转换成要添加到语音识别词典的语音描述中。 最初,会生成两个可能的语音描述。 一个语音描述从单词的文字形成。 另一个语音描述是通过对表示用户对该单词的发音的语音信号进行解码形成的。 基于与用户发音的对应关系,语音描述都得分。 然后选择具有最高分数的语音描述,用于语音识别词典中的输入。

    Media foundation media sink
    18.
    发明授权
    Media foundation media sink 有权
    媒体基础媒体下沉

    公开(公告)号:US07725920B2

    公开(公告)日:2010-05-25

    申请号:US10608869

    申请日:2003-06-27

    CPC分类号: H04L65/604 H04L67/02

    摘要: A method and system provides interfaces, data structures and events for representing a “sink” of multimedia data to interact with objects in a multimedia system to control multimedia objects. The interfaces and data structures enable efficient management for media objects that must interface directly with each other. One embodiment is directed to providing a common interface and a single API to a plurality of media objects. In an embodiment, the API is a control layer that isolates the media objects from each other and provides a single point of control, allowing media objects to be added or removed without affecting any other media objects. The control layer allows users to become familiar with only one API instead of many thereby facilitating the tasks of programming and documentation.

    摘要翻译: 方法和系统提供用于表示多媒体数据的“接收”的接口,数据结构和事件,以与多媒体系统中的对象交互以控制多媒体对象。 接口和数据结构可以实现对必须直接彼此接口的媒体对象的高效管理。 一个实施例旨在向多个媒体对象提供公共接口和单个API。 在一个实施例中,API是将媒体对象彼此隔离并提供单个控制点的控制层,允许添加或移除媒体对象而不影响任何其他媒体对象。 控制层允许用户只熟悉一个API而不是许多API,从而便于编程和文档的任务。

    Content publication
    19.
    发明授权
    Content publication 有权
    内容出版

    公开(公告)号:US07680937B2

    公开(公告)日:2010-03-16

    申请号:US11426820

    申请日:2006-06-27

    IPC分类号: G06F15/173 G06F15/16

    摘要: Publishing content using a peer-to-peer content distribution system is described. A publisher is required to request authorization to publish from an authorization body. Resources such as tracker and seed nodes are allocated in a peer-to-peer content distribution system and pre-processing of content to be published is carried out. For example, a content description is generated for each item of content as well as a set of checksums or other items for validating blocks of content. Publication can be terminated in a variety of different ways. For example, by using expiry methods, by active revocation of publishers, authorization bodies, or individual items of content.

    摘要翻译: 描述使用对等内容分发系统发布内容。 出版商必须要求授权才能从授权机构发布。 诸如跟踪器和种子节点的资源被分配在对等内容分发系统中,并且执行要发布的内容的预处理。 例如,针对每个内容项产生内容描述以及用于验证内容块的一组校验和或其他项目。 出版物可以以各种不同的方式终止。 例如,通过使用到期方法,通过主动撤销发布者,授权机构或单个内容项目。

    Method and apparatus for constructing and using syllable-like unit language models
    20.
    发明授权
    Method and apparatus for constructing and using syllable-like unit language models 有权
    用于构建和使用音节类单位语言模型的方法和装置

    公开(公告)号:US07676365B2

    公开(公告)日:2010-03-09

    申请号:US11110602

    申请日:2005-04-20

    IPC分类号: G10L15/06

    CPC分类号: G10L15/063 G10L2015/0636

    摘要: A method and computer-readable medium use syllable-like units (SLUs) to decode a pronunciation into a phonetic description. The syllable-like units are generally larger than a single phoneme but smaller than a word. The present invention provides a means for defining these syllable-like units and for generating a language model based on these syllable-like units that can be used in the decoding process. As SLUs are longer than phonemes, they contain more acoustic contextual clues and better lexical constraints for speech recognition. Thus, the phoneme accuracy produced from SLU recognition is much better than all-phone sequence recognition.

    摘要翻译: 一种方法和计算机可读介质使用音节类单位(SLU)来将发音解码成语音描述。 音节式单元通常大于单个音素,但小于一个单词。 本发明提供了一种用于定义这些音节单元并且用于基于这些可以在解码过程中使用的音节单元来生成语言模型的装置。 由于SLU比音素长,它们包含更多的声学语境线索和语音识别的更好的词汇约束。 因此,从SLU识别产生的音素精度比全电话序列识别好得多。