Automatic organization of documents through email clustering
    1.
    发明授权
    Automatic organization of documents through email clustering 有权
    通过电子邮件聚类自动组织文档

    公开(公告)号:US07765212B2

    公开(公告)日:2010-07-27

    申请号:US11321963

    申请日:2005-12-29

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06Q10/107 H04L51/00

    摘要: A system that facilitates organization of emails comprises a clustering component that clusters a plurality of emails and creates topics for emails by assigning key phrases extracted from emails within one or more clusters. An organization component then utilizes the key phrases to organize documents. Furthermore, the organization component can comprise a probability component that determines a probability that a document belongs to a certain topic.

    摘要翻译: 促进电子邮件组织的系统包括:聚类组件,其聚集多个电子邮件,并通过分配从一个或多个集群内的电子邮件中提取的关键短语为电子邮件创建主题。 组织组件然后利用关键短语组织文档。 此外,组织组件可以包括确定文档属于某个主题的概率的概率组件。

    AUTOMATIC GENERATION OF EMAIL PREVIEWS AND SUMMARIES
    2.
    发明申请
    AUTOMATIC GENERATION OF EMAIL PREVIEWS AND SUMMARIES 审中-公开
    电子邮件预览和概述的自动生成

    公开(公告)号:US20080281922A1

    公开(公告)日:2008-11-13

    申请号:US11746149

    申请日:2007-05-09

    IPC分类号: G06F15/16

    CPC分类号: G06F16/345

    摘要: An incoming electronic communication is broken down into message portions. Features of the message portions are extracted and the message portions are converted into sparse feature vectors. The probabilities of the message portions being of interest of the user are calculated and the message portions are converted back into text. Message portions with a relatively high probability of being of interest to a user are presented to the user as a summary.

    摘要翻译: 传入的电子通信被分解成消息部分。 提取消息部分的特征,并将消息部分转换为稀疏特征向量。 计算用户感兴趣的消息部分的概率,并将消息部分转换回文本。 作为概要,以用户的兴趣的较高概率的消息部分呈现给用户。

    Audio duplicate detector
    3.
    发明授权
    Audio duplicate detector 有权
    音频重复检测器

    公开(公告)号:US07421305B2

    公开(公告)日:2008-09-02

    申请号:US10785561

    申请日:2004-02-24

    IPC分类号: G06F17/00

    摘要: The present invention relates to a system and methodology to facilitate automatic management and pruning of audio files residing in a database. Audio fingerprinting is a powerful tool for identifying streaming or file-based audio, using a database of fingerprints. Duplicate detection identifies duplicate audio clips in a set, even if the clips differ in compression quality or duration. The present invention can be provided as a self-contained application that it does not require an external database of fingerprints. Also, a user interface provides various options for managing and pruning the audio files.

    摘要翻译: 本发明涉及一种便于自动管理和修剪驻留在数据库中的音频文件的系统和方法。 音频指纹是使用指纹数据库识别流媒体或基于文件的音频的强大工具。 重复的检测识别集合中的重复音频剪辑,即使剪辑在压缩质量或持续时间上有所不同。 本发明可以作为独立应用来提供,其不需要外部指纹数据库。 此外,用户界面提供了管理和修剪音频文件的各种选项。

    COMPRESSION OF BI-LEVEL IMAGES WITH EXPLICIT REPRESENTATION OF INK CLUSTERS
    5.
    发明申请
    COMPRESSION OF BI-LEVEL IMAGES WITH EXPLICIT REPRESENTATION OF INK CLUSTERS 审中-公开
    压缩图像的双层图像与墨盒的突出表现

    公开(公告)号:US20080175501A1

    公开(公告)日:2008-07-24

    申请号:US11966167

    申请日:2007-12-28

    IPC分类号: G06K9/36

    摘要: A system and method facilitating compression of bi-level images with explicit representation of ink clusters is provided. The present invention includes a cluster shape estimator that analyzes connected component information, extracts clusters and stores the cluster in a global dictionary, a page dictionary or a store of unclustered shapes. A bitmap estimation from clusters component determines dictionary positions for clusters stored in the global dictionary which are then encoded. A cluster position estimator determines page positions of clusters of the global dictionary and/or the page dictionary that are then encoded. Further, the global dictionary, the page dictionary and the store of unclustered shapes are also encoded.

    摘要翻译: 提供了一种利用墨簇的显式表示促进双层图像压缩的系统和方法。 本发明包括分析连接的分量信息的群集形状估计器,提取群集并且将群集存储在全局词典,页面字典或非群集形状的存储中。 来自簇组件的位图估计确定存储在全局字典中的簇的字典位置,然后对其进行编码。 集群位置估计器确定然后被编码的全局字典和/或页字典的集群的页面位置。 此外,还编码了全局字典,页字典和未分簇形状的存储。

    Segmented layered image system
    6.
    发明授权
    Segmented layered image system 有权
    分段分层图像系统

    公开(公告)号:US07376266B2

    公开(公告)日:2008-05-20

    申请号:US11465087

    申请日:2006-08-16

    IPC分类号: G06K9/00 G06K9/36

    CPC分类号: H04N1/403 G06K9/00456

    摘要: Systems and methods for encoding and decoding document images are disclosed. Document images are segmented into multiple layers according to a mask. The multiple layers are non-binary. The respective layers can then be processed and compressed separately in order to achieve better compression of the document image overall. A mask is generated from a document image. The mask is generated so as to reduce an estimate of compression for the combined size of the mask and multiple layers of the document image. The mask is then employed to segment the document image into the multiple layers. The mask determines or allocates pixels of the document image into respective layers. The mask and the multiple layers are processed and encoded separately so as to improve compression of the document image overall and to improve the speed of so doing. The multiple layers are non-binary images and can, for example, comprise a foreground image and a background image.

    摘要翻译: 公开了用于编码和解码文档图像的系统和方法。 根据掩码将文档图像分割成多个图层。 多层是非二进制的。 然后可以分别对各个层进行处理和压缩,以便对整个文件图像实现更好的压缩。 从文档图像生成蒙版。 生成掩模,以减少对于掩模和文档图像的多个层的组合大小的压缩估计。 然后使用掩模将文档图像分割成多个层。 掩模将文档图像的像素确定或分配到各个图层中。 掩模和多层被单独处理和编码,以便整体上改善文档图像的压缩并提高这样做的速度。 多层是非二进制图像,并且可以例如包括前景图像和背景图像。

    ADAPTIVE COMPRESSION OF MULTI-LEVEL IMAGES
    7.
    发明申请
    ADAPTIVE COMPRESSION OF MULTI-LEVEL IMAGES 有权
    多级图像的自适应压缩

    公开(公告)号:US20120014596A1

    公开(公告)日:2012-01-19

    申请号:US13156991

    申请日:2011-06-09

    IPC分类号: G06K9/36 G06K9/00

    摘要: The invention facilitates adaptive compression of multi-level images, such as captured digital images of a whiteboard, etc., encoding a bitstream comprising a color image component and a black-and-white image component. Either or both of a color and a black-and-white image can be output to a user based on user desires, receiving device capabilities, etc.

    摘要翻译: 本发明有助于对包括彩色图像分量和黑白图像分量的比特流进行编码的诸如捕获的白板等的数字图像的多级图像的自适应压缩。 可以基于用户期望,接收设备能力等向用户输出颜色和黑白图像中的一个或两者。

    Compression of bi-level images with explicit representation of ink clusters
    8.
    发明授权
    Compression of bi-level images with explicit representation of ink clusters 有权
    用油墨簇的显式表示压缩双层图像

    公开(公告)号:US07206450B2

    公开(公告)日:2007-04-17

    申请号:US10133532

    申请日:2002-04-25

    IPC分类号: G06K9/36 G06K9/46

    摘要: A system and method facilitating compression of bi-level images with explicit representation of ink clusters is provided. The present invention includes a cluster shape estimator that analyzes connected component information, extracts clusters and stores the cluster in a global dictionary, a page dictionary or a store of unclustered shapes. A bitmap estimation from clusters component determines dictionary positions for clusters stored in the global dictionary which are then encoded. A cluster position estimator determines page positions of clusters of the global dictionary and/or the page dictionary that are then encoded. Further, the global dictionary, the page dictionary and the store of unclustered shapes are also encoded.

    摘要翻译: 提供了一种利用墨簇的显式表示促进双层图像压缩的系统和方法。 本发明包括分析连接的分量信息的群集形状估计器,提取群集并且将群集存储在全局词典,页面字典或非群集形状的存储中。 来自簇组件的位图估计确定存储在全局字典中的簇的字典位置,然后对其进行编码。 集群位置估计器确定然后被编码的全局字典和/或页字典的集群的页面位置。 此外,还编码了全局字典,页字典和未分簇形状的存储。

    Segmented layered image system
    9.
    发明授权

    公开(公告)号:US07120297B2

    公开(公告)日:2006-10-10

    申请号:US10180169

    申请日:2002-06-26

    IPC分类号: G06K9/00 G06K9/36

    CPC分类号: H04N1/403 G06K9/00456

    摘要: Systems and methods for encoding and decoding document images are disclosed. Document images are segmented into multiple layers according to a mask. The multiple layers are non-binary. The respective layers can then be processed and compressed separately in order to achieve better compression of the document image overall. A mask is generated from a document image. The mask is generated so as to reduce an estimate of compression for the combined size of the mask and multiple layers of the document image. The mask is then employed to segment the document image into the multiple layers. The mask determines or allocates pixels of the document image into respective layers. The mask and the multiple layers are processed and encoded separately so as to improve compression of the document image overall and to improve the speed of so doing. The multiple layers are non-binary images and can, for example, comprise a foreground image and a background image.