-
公开(公告)号:US07870114B2
公开(公告)日:2011-01-11
申请号:US11818879
申请日:2007-06-15
申请人: Haidong Zhang , Guowei Liu , Yantao Li , Bing Sun , Jian Wang
发明人: Haidong Zhang , Guowei Liu , Yantao Li , Bing Sun , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F17/30592
摘要: Described is a technology by which high dimensional source data corresponding to rows of records with identifiers, and columns comprising dimensions of data values, are processed into a file model for efficient access. An inverted index corresponding to any dimension is built by mapping data from raw dimension values to mapped values based on mapping entries in a dimension table. The record identifiers are arranged into subgroups based on their mapped value; a count and/or an offset may be maintained for locating each of the subgroups. The raw values for a dimension are maintained within a raw value file. For sparse data, the raw value file may be compressed, e.g., by excluding nulls and associating a record identifier with each non-null. A data manager provides access to data in the data files, such as by offering various functions, using caching for efficiency.
摘要翻译: 描述了一种技术,通过该技术将对应于具有标识符的记录行的高维源数据和包括数据值的维的列处理成用于有效访问的文件模型。 通过根据维度表中的映射条目将数据从原始维度值映射到映射值,构建对应于任何维度的反向索引。 记录标识符根据其映射值排列成子组; 可以维持计数和/或偏移以定位每个子组。 维度的原始值保持在原始值文件中。 对于稀疏数据,可以例如通过排除空值并将记录标识符与每个非空值相关联来压缩原始值文件。 数据管理器提供对数据文件中的数据的访问,例如通过提供各种功能,使用缓存来提高效率。
-
公开(公告)号:US20080312899A1
公开(公告)日:2008-12-18
申请号:US11818596
申请日:2007-06-15
申请人: Yantao Li , Bing Sun , Shuguang Ye , Guowei Liu , Wenli Zhu , Haidong Zhang , Min Wang , Jian Wang
发明人: Yantao Li , Bing Sun , Shuguang Ye , Guowei Liu , Wenli Zhu , Haidong Zhang , Min Wang , Jian Wang
CPC分类号: G06F11/28
摘要: Described is a technology by which software program feature usage is located within a sequence of commands collected during program usage sessions. For example, feature generally corresponds to a series of commands, such as copy and paste. A visual modeling component is controlled via drag-and-drop operations to describe a feature model, which is then compiled by a compiler into a finite state machine. Noise models may be used to exclude any command in the sequence that is irrelevant to the feature usage. A recognition process uses the finite state machine to locate program feature usage within the sequence of recorded commands by matching command sub-sequences corresponding to the feature model via the state machine. An analyzer may then use the located matches to provide an analysis report on feature usage.
摘要翻译: 描述了软件程序特征使用位于在程序使用会话期间收集的一系列命令中的技术。 例如,特征通常对应于一系列命令,例如复制和粘贴。 视觉建模组件通过拖放操作进行控制,以描述特征模型,然后将其由编译器编译成有限状态机。 噪声模型可用于排除序列中与特征使用无关的任何命令。 识别过程使用有限状态机通过状态机匹配与特征模型对应的命令子序列来定位记录命令序列内的节目特征使用。 然后,分析仪可以使用定位的匹配来提供关于特征使用的分析报告。
-
公开(公告)号:US08176476B2
公开(公告)日:2012-05-08
申请号:US11818611
申请日:2007-06-15
申请人: Yantao Li , Adnan Azfar Mahmud , Wenli Zhu , Haidong Zhang , Shuguang Ye , Bing Sun , Qiang Wang , Yingnong Dang , Guowei Liu , Min Wang , Jian Wang
发明人: Yantao Li , Adnan Azfar Mahmud , Wenli Zhu , Haidong Zhang , Shuguang Ye , Bing Sun , Qiang Wang , Yingnong Dang , Guowei Liu , Min Wang , Jian Wang
IPC分类号: G06F9/44
CPC分类号: G06F11/3414 , G06F11/3438 , G06F11/3447 , G06F11/3476
摘要: Described is a technology by which software instrumentation data collected from user program sessions are analyzed to output an analysis report or the like via example methods and an architecture configured for efficient operation. A client component queries a service for analysis related information. To process the query, the service works with a data manager, and via a high dimensional analysis component may use information processed from the software instrumentation data, such as in the form of one or more inverted indexes and/or raw value files. The service may include a usage analysis component, a feature recognition component that locates features from command sequences, a user recognition component and/or a program reliability component. One or more counterpart components at the client may generate analysis reports or the like based on the query results. The client also may maintain user libraries and feature libraries to facilitate analyses.
摘要翻译: 描述了通过对用户程序会话收集的软件仪表数据进行分析以通过示例方法和配置为有效操作的架构来输出分析报告等的技术。 客户端组件向服务查询分析相关信息。 为了处理查询,该服务与数据管理器一起工作,并且通过高维分析组件可以使用从软件工具数据处理的信息,例如以一个或多个反向索引和/或原始值文件的形式。 该服务可以包括使用分析组件,从命令序列定位特征的特征识别组件,用户识别组件和/或程序可靠性组件。 客户端上的一个或多个对应组件可以基于查询结果生成分析报告等。 客户端还可以维护用户库和功能库,以便于分析。
-
公开(公告)号:US20080313633A1
公开(公告)日:2008-12-18
申请号:US11818600
申请日:2007-06-15
申请人: Wenli Zhu , Yantao Li , Qiang Wang , Min Wang , Adnan Azfar Mahmud , Jian Wang , Haidong Zhang , Bing Sun , Guowei Liu , Shuguang Ye
发明人: Wenli Zhu , Yantao Li , Qiang Wang , Min Wang , Adnan Azfar Mahmud , Jian Wang , Haidong Zhang , Bing Sun , Guowei Liu , Shuguang Ye
IPC分类号: G06F9/46
CPC分类号: G06F11/3466 , G06F2201/865
摘要: Described is a technology for analyzing usage of a software program's features. Software instrumentation data is during actual user program usage sessions. The collected data is then processed to determine various feature usage counts and other information, cross-feature usage (e.g., among users who use a feature, how many use another feature or program), and characteristics of feature users, e.g., how long, how much, how often and how extensive feature users use a program. Session analysis may be performed to provide information about the number of sessions in which a set of features occur. Feature usage trends over time may also be determined via analysis. A user interface is described for facilitating selection of one or more features to analyze, for facilitating selection of a group of users, and/or for outputting results corresponding to the analysis.
摘要翻译: 描述了一种用于分析软件程序功能的使用的技术。 软件测试数据是在实际的用户程序使用会话期间。 然后处理所收集的数据以确定各种特征使用计数和其他信息,交叉特征使用(例如,在使用特征的用户中,使用另一个特征或程序的用户数量)以及特征用户的特征,例如多长时间, 用户使用程序多少,多长时间以及广泛的功能。 可以执行会话分析以提供关于一组特征出现的会话数量的信息。 随着时间的推移,功能使用趋势也可以通过分析来确定。 描述用户界面以便于选择一个或多个特征以分析,以便于选择一组用户,和/或用于输出对应于该分析的结果。
-
公开(公告)号:US07765216B2
公开(公告)日:2010-07-27
申请号:US11818607
申请日:2007-06-15
申请人: Yantao Li , Guowei Liu , Haidong Zhang , Adnan Azfar Mahmud , Bing Sun , Min Wang , Wenli Zhu , Jian Wang
发明人: Yantao Li , Guowei Liu , Haidong Zhang , Adnan Azfar Mahmud , Bing Sun , Min Wang , Wenli Zhu , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F17/30592
摘要: Described is a technology by which high dimensional data may be efficiently analyzed, including by filtering, grouping, aggregating and/or sorting operations to provide an analysis result. For efficiency in the analysis, an inverted index may be built (e.g., as part of filtering), and/or a hash structure (e.g., as part of grouping). Analysis parameters specify dimensions, on which union and/or intersection operations are performed to provide a final dataset. The analysis tool provides a user interface for inputting analysis parameters and outputting information corresponding to an analysis result. The analysis tool may sort the information corresponding to the analysis result, e.g., to output the topmost or bottommost results.
摘要翻译: 描述了可以通过过滤,分组,聚合和/或排序操作来有效地分析高维度数据以提供分析结果的技术。 为了分析的效率,可以构建反向索引(例如,作为过滤的一部分)和/或散列结构(例如,作为分组的一部分)。 分析参数指定维度,执行联合和/或交集操作以提供最终数据集。 分析工具提供用于输入分析参数并输出与分析结果相对应的信息的用户界面。 分析工具可以对与分析结果相对应的信息进行排序,例如输出最上面或最底部的结果。
-
公开(公告)号:US20080313184A1
公开(公告)日:2008-12-18
申请号:US11818607
申请日:2007-06-15
申请人: Yantao Li , Guowei Liu , Haidong Zhang , Adnan Azfar Mahmud , Bing Sun , Min Wang , Wenli Zhu , Jian Wang
发明人: Yantao Li , Guowei Liu , Haidong Zhang , Adnan Azfar Mahmud , Bing Sun , Min Wang , Wenli Zhu , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F17/30592
摘要: Described is a technology by which high dimensional data may be efficiently analyzed, including by filtering, grouping, aggregating and/or sorting operations to provide an analysis result. For efficiency in the analysis, an inverted index may be built (e.g., as part of filtering), and/or a hash structure (e.g., as part of grouping). Analysis parameters specify dimensions, on which union and/or intersection operations are performed to provide a final dataset. The analysis tool provides a user interface for inputting analysis parameters and outputting information corresponding to an analysis result. The analysis tool may sort the information corresponding to the analysis result, e.g., to output the topmost or bottommost results.
摘要翻译: 描述了可以通过过滤,分组,聚合和/或排序操作来有效地分析高维度数据以提供分析结果的技术。 为了分析的效率,可以构建反向索引(例如,作为过滤的一部分)和/或散列结构(例如,作为分组的一部分)。 分析参数指定维度,执行联合和/或交集操作以提供最终数据集。 分析工具提供用于输入分析参数并输出与分析结果相对应的信息的用户界面。 分析工具可以对与分析结果相对应的信息进行排序,例如输出最上面或最底部的结果。
-
公开(公告)号:US07747988B2
公开(公告)日:2010-06-29
申请号:US11818600
申请日:2007-06-15
申请人: Wenli Zhu , Yantao Li , Qiang Wang , Haidong Zhang , Bing Sun , Guowei Liu , Shuguang Ye , Min Wang , Adan Azfar Mahmud , Jian Wang
发明人: Wenli Zhu , Yantao Li , Qiang Wang , Haidong Zhang , Bing Sun , Guowei Liu , Shuguang Ye , Min Wang , Adan Azfar Mahmud , Jian Wang
IPC分类号: G06F9/44
CPC分类号: G06F11/3466 , G06F2201/865
摘要: Described is a technology for analyzing usage of a software program's features. Software instrumentation data is collected during actual user program usage sessions. The collected data is then processed to determine various feature usage counts and other information, cross-feature usage (e.g., among users who use a feature, how many use another feature or program), and characteristics of feature users, e.g., how long, how much, how often and how extensive feature users use a program. Session analysis may be performed to provide information about the number of sessions in which a set of features occur. Feature usage trends over time may also be determined via analysis. A user interface is described for facilitating selection of one or more features to analyze, for facilitating selection of a group of users, and/or for outputting results corresponding to the analysis.
摘要翻译: 描述了一种用于分析软件程序功能的使用的技术。 在实际用户程序使用会话期间收集软件仪器数据。 然后处理所收集的数据以确定各种特征使用计数和其他信息,交叉特征使用(例如,在使用特征的用户中,使用另一个特征或程序的用户数量)以及特征用户的特征,例如多长时间, 用户使用程序多少,多长时间以及广泛的功能。 可以执行会话分析以提供关于一组特征出现的会话数量的信息。 随着时间的推移,功能使用趋势也可以通过分析来确定。 描述用户界面以便于选择一个或多个特征以分析,以便于选择一组用户,和/或用于输出对应于该分析的结果。
-
公开(公告)号:US20080313213A1
公开(公告)日:2008-12-18
申请号:US11818879
申请日:2007-06-15
申请人: Haidong Zhang , Guowei Liu , Yantao Li , Bing Sun , Jian Wang
发明人: Haidong Zhang , Guowei Liu , Yantao Li , Bing Sun , Jian Wang
IPC分类号: G06F7/00
CPC分类号: G06F17/30592
摘要: Described is a technology by which high dimensional source data corresponding to rows of records with identifiers, and columns comprising dimensions of data values, are processed into a file model for efficient access. An inverted index corresponding to any dimension is built by mapping data from raw dimension values to mapped values based on mapping entries in a dimension table. The record identifiers are arranged into subgroups based on their mapped value; a count and/or an offset may be maintained for locating each of the subgroups. The raw values for a dimension are maintained within a raw value file. For sparse data, the raw value file may be compressed, e.g., by excluding nulls and associating a record identifier with each non-null. A data manager provides access to data in the data files, such as by offering various functions, using caching for efficiency.
摘要翻译: 描述了一种技术,通过该技术将对应于具有标识符的记录行的高维源数据和包括数据值的维的列处理成用于有效访问的文件模型。 通过根据维度表中的映射条目将数据从原始维度值映射到映射值,构建对应于任何维度的反向索引。 记录标识符根据其映射值排列成子组; 可以维持计数和/或偏移以定位每个子组。 维度的原始值保持在原始值文件中。 对于稀疏数据,可以例如通过排除空值并将记录标识符与每个非空值相关联来压缩原始值文件。 数据管理器提供对数据文件中的数据的访问,例如通过提供各种功能,使用缓存来提高效率。
-
公开(公告)号:US07680645B2
公开(公告)日:2010-03-16
申请号:US11818596
申请日:2007-06-15
申请人: Yantao Li , Bing Sun , Shuguang Ye , Guowei Liu , Wenli Zhu , Haidong Zhang , Min Wang , Jian Wang
发明人: Yantao Li , Bing Sun , Shuguang Ye , Guowei Liu , Wenli Zhu , Haidong Zhang , Min Wang , Jian Wang
IPC分类号: G06F9/45
CPC分类号: G06F11/28
摘要: Described is a technology by which software program feature usage is located within a sequence of commands collected during program usage sessions. For example, feature generally corresponds to a series of commands, such as copy and paste. A visual modeling component is controlled via drag-and-drop operations to describe a feature model, which is then compiled by a compiler into a finite state machine. Noise models may be used to exclude any command in the sequence that is irrelevant to the feature usage. A recognition process uses the finite state machine to locate program feature usage within the sequence of recorded commands by matching command sub-sequences corresponding to the feature model via the state machine. An analyzer may then use the located matches to provide an analysis report on feature usage.
摘要翻译: 描述了软件程序特征使用位于在程序使用会话期间收集的一系列命令中的技术。 例如,特征通常对应于一系列命令,例如复制和粘贴。 视觉建模组件通过拖放操作进行控制,以描述特征模型,然后将其由编译器编译成有限状态机。 噪声模型可用于排除序列中与特征使用无关的任何命令。 识别过程使用有限状态机通过状态机匹配与特征模型对应的命令子序列来定位记录命令序列内的节目特征使用。 然后,分析仪可以使用定位的匹配来提供关于特征使用的分析报告。
-
公开(公告)号:US20080313149A1
公开(公告)日:2008-12-18
申请号:US11818611
申请日:2007-06-15
申请人: Yanato Li , Adnan Azfar Mahmud , Wenli Zhu , Haidong Zhang , Shuguang Ye , Bing Sun , Qiang Wang , Yingnong Dang , Guowei Liu , Min Wang , Jian Wang
发明人: Yanato Li , Adnan Azfar Mahmud , Wenli Zhu , Haidong Zhang , Shuguang Ye , Bing Sun , Qiang Wang , Yingnong Dang , Guowei Liu , Min Wang , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F11/3414 , G06F11/3438 , G06F11/3447 , G06F11/3476
摘要: Described is a technology by which software instrumentation data collected from user program sessions are analyzed to output an analysis report or the like via example methods and an architecture configured for efficient operation. A client component queries a service for analysis related information. To process the query, the service works with a data manager, and via a high dimensional analysis component may use information processed from the software instrumentation data, such as in the form of one or more inverted indexes and/or raw value files. The service may include a usage analysis component, a feature recognition component that locates features from command sequences, a user recognition component and/or a program reliability component. One or more counterpart components at the client may generate analysis reports or the like based on the query results. The client also may maintain user libraries and feature libraries to facilitate analyses.
摘要翻译: 描述了通过对用户程序会话收集的软件仪表数据进行分析以通过示例方法和配置为有效操作的架构来输出分析报告等的技术。 客户端组件向服务查询分析相关信息。 为了处理查询,该服务与数据管理器一起工作,并且通过高维分析组件可以使用从软件工具数据处理的信息,例如以一个或多个反向索引和/或原始值文件的形式。 该服务可以包括使用分析组件,从命令序列定位特征的特征识别组件,用户识别组件和/或程序可靠性组件。 客户端上的一个或多个对应组件可以基于查询结果生成分析报告等。 客户端还可以维护用户库和功能库,以便于分析。
-
-
-
-
-
-
-
-
-