-
公开(公告)号:US20060101069A1
公开(公告)日:2006-05-11
申请号:US11264805
申请日:2005-11-01
Applicant: James Bell , Margaret Beynon , Benjamin Delo , Andrew Flegg , Julian Friedman , Philipp Offermann
Inventor: James Bell , Margaret Beynon , Benjamin Delo , Andrew Flegg , Julian Friedman , Philipp Offermann
IPC: G06F17/00
CPC classification number: G06F17/2785 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944
Abstract: A method for generating a set of one or more elements of a fingerprint for a document, the document comprising a semantic construct having one or more ordered words, the method comprising the steps of: defining a range of sizes for a fingerprint element; dividing the ordered words of the semantic construct into a set of one or more mutually exclusive fingerprint elements, wherein each of the one or more mutually exclusive fingerprint elements includes a number of adjacent words, the number being within the range of sizes for a fingerprint element; and responsive to a determination that the set of mutually exclusive fingerprint elements excludes a word from the semantic construct, discarding the excluded word.
Abstract translation: 一种用于生成用于文档的指纹的一个或多个元素的集合的方法,所述文档包括具有一个或多个有序字的语义结构,所述方法包括以下步骤:定义指纹元素的尺寸范围; 将所述语义结构的有序单词划分成一组一个或多个相互排斥的指纹元素,其中所述一个或多个互斥指纹元素中的每一个包括多个相邻单词,所述多个相邻单词在所述指纹元素的大小范围内 ; 并且响应于所述一组相互排斥的指纹元素从所述语义构造中排除单词的确定,丢弃所排除的单词。
-
公开(公告)号:US07555489B2
公开(公告)日:2009-06-30
申请号:US11264805
申请日:2005-11-01
Applicant: James Bell , Megan A. Beynon , Benjamin P. Delo , Andrew J. Flegg , Julian Friedman , Philipp Offermann
Inventor: James Bell , Megan A. Beynon , Benjamin P. Delo , Andrew J. Flegg , Julian Friedman , Philipp Offermann
CPC classification number: G06F17/2785 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944
Abstract: Mechanisms for generating a set of one or more elements of a fingerprint for a document, the document comprising a semantic construct having one or more ordered words, are provided. With these mechanisms, a range of sizes for a fingerprint element is defined and ordered words of the semantic construct are divided into a set of one or more mutually exclusive fingerprint elements. Each of the one or more mutually exclusive fingerprint elements includes a number of adjacent words, the number being within the range of sizes for a fingerprint element. Responsive to a determination that the set of mutually exclusive fingerprint elements excludes a word from the semantic construct, the excluded word is discarded.
Abstract translation: 提供了用于生成用于文档的指纹的一个或多个元素的集合的机制,所述文档包括具有一个或多个有序字的语义结构。 利用这些机制,定义指纹元素的大小范围,并将语义结构的排序词分成一组一个或多个相互排斥的指纹元素。 一个或多个相互排斥的指纹元素中的每一个包括多个相邻字,该数字在指纹元素的大小范围内。 响应于相互排斥的指纹元素的集合排除语义构造中的单词的确定,排除的字被丢弃。
-