Efficient data infrastructure for high dimensional data analysis
    1.
    发明授权
    Efficient data infrastructure for high dimensional data analysis 有权
    高维数据基础架构,用于高维数据分析

    公开(公告)号:US07870114B2

    公开(公告)日:2011-01-11

    申请号:US11818879

    申请日:2007-06-15

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30592

    摘要: Described is a technology by which high dimensional source data corresponding to rows of records with identifiers, and columns comprising dimensions of data values, are processed into a file model for efficient access. An inverted index corresponding to any dimension is built by mapping data from raw dimension values to mapped values based on mapping entries in a dimension table. The record identifiers are arranged into subgroups based on their mapped value; a count and/or an offset may be maintained for locating each of the subgroups. The raw values for a dimension are maintained within a raw value file. For sparse data, the raw value file may be compressed, e.g., by excluding nulls and associating a record identifier with each non-null. A data manager provides access to data in the data files, such as by offering various functions, using caching for efficiency.

    摘要翻译: 描述了一种技术,通过该技术将对应于具有标识符的记录行的高维源数据和包括数据值的维的列处理成用于有效访问的文件模型。 通过根据维度表中的映射条目将数据从原始维度值映射到映射值,构建对应于任何维度的反向索引。 记录标识符根据其映射值排列成子组; 可以维持计数和/或偏移以定位每个子组​​。 维度的原始值保持在原始值文件中。 对于稀疏数据,可以例如通过排除空值并将记录标识符与每个非空值相关联来压缩原始值文件。 数据管理器提供对数据文件中的数据的访问,例如通过提供各种功能,使用缓存来提高效率。

    Software feature modeling and recognition
    2.
    发明申请
    Software feature modeling and recognition 有权
    软件特征建模与识别

    公开(公告)号:US20080312899A1

    公开(公告)日:2008-12-18

    申请号:US11818596

    申请日:2007-06-15

    IPC分类号: G06F9/45 G06F3/048

    CPC分类号: G06F11/28

    摘要: Described is a technology by which software program feature usage is located within a sequence of commands collected during program usage sessions. For example, feature generally corresponds to a series of commands, such as copy and paste. A visual modeling component is controlled via drag-and-drop operations to describe a feature model, which is then compiled by a compiler into a finite state machine. Noise models may be used to exclude any command in the sequence that is irrelevant to the feature usage. A recognition process uses the finite state machine to locate program feature usage within the sequence of recorded commands by matching command sub-sequences corresponding to the feature model via the state machine. An analyzer may then use the located matches to provide an analysis report on feature usage.

    摘要翻译: 描述了软件程序特征使用位于在程序使用会话期间收集的一系列命令中的技术。 例如,特征通常对应于一系列命令,例如复制和粘贴。 视觉建模组件通过拖放操作进行控制,以描述特征模型,然后将其由编译器编译成有限状态机。 噪声模型可用于排除序列中与特征使用无关的任何命令。 识别过程使用有限状态机通过状态机匹配与特征模型对应的命令子序列来定位记录命令序列内的节目特征使用。 然后,分析仪可以使用定位的匹配来提供关于特征使用的分析报告。

    Analyzing software usage with instrumentation data
    3.
    发明授权
    Analyzing software usage with instrumentation data 有权
    使用仪器数据分析软件使用情况

    公开(公告)号:US08176476B2

    公开(公告)日:2012-05-08

    申请号:US11818611

    申请日:2007-06-15

    IPC分类号: G06F9/44

    摘要: Described is a technology by which software instrumentation data collected from user program sessions are analyzed to output an analysis report or the like via example methods and an architecture configured for efficient operation. A client component queries a service for analysis related information. To process the query, the service works with a data manager, and via a high dimensional analysis component may use information processed from the software instrumentation data, such as in the form of one or more inverted indexes and/or raw value files. The service may include a usage analysis component, a feature recognition component that locates features from command sequences, a user recognition component and/or a program reliability component. One or more counterpart components at the client may generate analysis reports or the like based on the query results. The client also may maintain user libraries and feature libraries to facilitate analyses.

    摘要翻译: 描述了通过对用户程序会话收集的软件仪表数据进行分析以通过示例方法和配置为有效操作的架构来输出分析报告等的技术。 客户端组件向服务查询分析相关信息。 为了处理查询,该服务与数据管理器一起工作,并且通过高维分析组件可以使用从软件工具数据处理的信息,例如以一个或多个反向索引和/或原始值文件的形式。 该服务可以包括使用分析组件,从命令序列定位特征的特征识别组件,用户识别组件和/或程序可靠性组件。 客户端上的一个或多个对应组件可以基于查询结果生成分析报告等。 客户端还可以维护用户库和功能库,以便于分析。

    Software feature usage analysis and reporting
    4.
    发明申请
    Software feature usage analysis and reporting 有权
    软件功能使用分析和报告

    公开(公告)号:US20080313633A1

    公开(公告)日:2008-12-18

    申请号:US11818600

    申请日:2007-06-15

    IPC分类号: G06F9/46

    CPC分类号: G06F11/3466 G06F2201/865

    摘要: Described is a technology for analyzing usage of a software program's features. Software instrumentation data is during actual user program usage sessions. The collected data is then processed to determine various feature usage counts and other information, cross-feature usage (e.g., among users who use a feature, how many use another feature or program), and characteristics of feature users, e.g., how long, how much, how often and how extensive feature users use a program. Session analysis may be performed to provide information about the number of sessions in which a set of features occur. Feature usage trends over time may also be determined via analysis. A user interface is described for facilitating selection of one or more features to analyze, for facilitating selection of a group of users, and/or for outputting results corresponding to the analysis.

    摘要翻译: 描述了一种用于分析软件程序功能的使用的技术。 软件测试数据是在实际的用户程序使用会话期间。 然后处理所收集的数据以确定各种特征使用计数和其他信息,交叉特征使用(例如,在使用特征的用户中,使用另一个特征或程序的用户数量)以及特征用户的特征,例如多长时间, 用户使用程序多少,多长时间以及广泛的功能。 可以执行会话分析以提供关于一组特征出现的会话数量的信息。 随着时间的推移,功能使用趋势也可以通过分析来确定。 描述用户界面以便于选择一个或多个特征以分析,以便于选择一组用户,和/或用于输出对应于该分析的结果。

    Multidimensional analysis tool for high dimensional data
    5.
    发明授权
    Multidimensional analysis tool for high dimensional data 有权
    用于高维数据的多维分析工具

    公开(公告)号:US07765216B2

    公开(公告)日:2010-07-27

    申请号:US11818607

    申请日:2007-06-15

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30592

    摘要: Described is a technology by which high dimensional data may be efficiently analyzed, including by filtering, grouping, aggregating and/or sorting operations to provide an analysis result. For efficiency in the analysis, an inverted index may be built (e.g., as part of filtering), and/or a hash structure (e.g., as part of grouping). Analysis parameters specify dimensions, on which union and/or intersection operations are performed to provide a final dataset. The analysis tool provides a user interface for inputting analysis parameters and outputting information corresponding to an analysis result. The analysis tool may sort the information corresponding to the analysis result, e.g., to output the topmost or bottommost results.

    摘要翻译: 描述了可以通过过滤,分组,聚合和/或排序操作来有效地分析高维度数据以提供分析结果的技术。 为了分析的效率,可以构建反向索引(例如,作为过滤的一部分)和/或散列结构(例如,作为分组的一部分)。 分析参数指定维度,执行联合和/或交集操作以提供最终数据集。 分析工具提供用于输入分析参数并输出与分析结果相对应的信息的用户界面。 分析工具可以对与分析结果相对应的信息进行排序,例如输出最上面或最底部的结果。

    Multidimensional analysis tool for high dimensional data
    6.
    发明申请
    Multidimensional analysis tool for high dimensional data 有权
    用于高维数据的多维分析工具

    公开(公告)号:US20080313184A1

    公开(公告)日:2008-12-18

    申请号:US11818607

    申请日:2007-06-15

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30592

    摘要: Described is a technology by which high dimensional data may be efficiently analyzed, including by filtering, grouping, aggregating and/or sorting operations to provide an analysis result. For efficiency in the analysis, an inverted index may be built (e.g., as part of filtering), and/or a hash structure (e.g., as part of grouping). Analysis parameters specify dimensions, on which union and/or intersection operations are performed to provide a final dataset. The analysis tool provides a user interface for inputting analysis parameters and outputting information corresponding to an analysis result. The analysis tool may sort the information corresponding to the analysis result, e.g., to output the topmost or bottommost results.

    摘要翻译: 描述了可以通过过滤,分组,聚合和/或排序操作来有效地分析高维度数据以提供分析结果的技术。 为了分析的效率,可以构建反向索引(例如,作为过滤的一部分)和/或散列结构(例如,作为分组的一部分)。 分析参数指定维度,执行联合和/或交集操作以提供最终数据集。 分析工具提供用于输入分析参数并输出与分析结果相对应的信息的用户界面。 分析工具可以对与分析结果相对应的信息进行排序,例如输出最上面或最底部的结果。

    Software feature usage analysis and reporting
    7.
    发明授权
    Software feature usage analysis and reporting 有权
    软件功能使用分析和报告

    公开(公告)号:US07747988B2

    公开(公告)日:2010-06-29

    申请号:US11818600

    申请日:2007-06-15

    IPC分类号: G06F9/44

    CPC分类号: G06F11/3466 G06F2201/865

    摘要: Described is a technology for analyzing usage of a software program's features. Software instrumentation data is collected during actual user program usage sessions. The collected data is then processed to determine various feature usage counts and other information, cross-feature usage (e.g., among users who use a feature, how many use another feature or program), and characteristics of feature users, e.g., how long, how much, how often and how extensive feature users use a program. Session analysis may be performed to provide information about the number of sessions in which a set of features occur. Feature usage trends over time may also be determined via analysis. A user interface is described for facilitating selection of one or more features to analyze, for facilitating selection of a group of users, and/or for outputting results corresponding to the analysis.

    摘要翻译: 描述了一种用于分析软件程序功能的使用的技术。 在实际用户程序使用会话期间收集软件仪器数据。 然后处理所收集的数据以确定各种特征使用计数和其他信息,交叉特征使用(例如,在使用特征的用户中,使用另一个特征或程序的用户数量)以及特征用户的特征,例如多长时间, 用户使用程序多少,多长时间以及广泛的功能。 可以执行会话分析以提供关于一组特征出现的会话数量的信息。 随着时间的推移,功能使用趋势也可以通过分析来确定。 描述用户界面以便于选择一个或多个特征以分析,以便于选择一组用户,和/或用于输出对应于该分析的结果。

    Efficient data infrastructure for high dimensional data analysis
    8.
    发明申请
    Efficient data infrastructure for high dimensional data analysis 有权
    高维数据基础架构,用于高维数据分析

    公开(公告)号:US20080313213A1

    公开(公告)日:2008-12-18

    申请号:US11818879

    申请日:2007-06-15

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30592

    摘要: Described is a technology by which high dimensional source data corresponding to rows of records with identifiers, and columns comprising dimensions of data values, are processed into a file model for efficient access. An inverted index corresponding to any dimension is built by mapping data from raw dimension values to mapped values based on mapping entries in a dimension table. The record identifiers are arranged into subgroups based on their mapped value; a count and/or an offset may be maintained for locating each of the subgroups. The raw values for a dimension are maintained within a raw value file. For sparse data, the raw value file may be compressed, e.g., by excluding nulls and associating a record identifier with each non-null. A data manager provides access to data in the data files, such as by offering various functions, using caching for efficiency.

    摘要翻译: 描述了一种技术,通过该技术将对应于具有标识符的记录行的高维源数据和包括数据值的维的列处理成用于有效访问的文件模型。 通过根据维度表中的映射条目将数据从原始维度值映射到映射值,构建对应于任何维度的反向索引。 记录标识符根据其映射值排列成子组; 可以维持计数和/或偏移以定位每个子组​​。 维度的原始值保持在原始值文件中。 对于稀疏数据,可以例如通过排除空值并将记录标识符与每个非空值相关联来压缩原始值文件。 数据管理器提供对数据文件中的数据的访问,例如通过提供各种功能,使用缓存来提高效率。

    Software feature modeling and recognition
    9.
    发明授权
    Software feature modeling and recognition 有权
    软件特征建模与识别

    公开(公告)号:US07680645B2

    公开(公告)日:2010-03-16

    申请号:US11818596

    申请日:2007-06-15

    IPC分类号: G06F9/45

    CPC分类号: G06F11/28

    摘要: Described is a technology by which software program feature usage is located within a sequence of commands collected during program usage sessions. For example, feature generally corresponds to a series of commands, such as copy and paste. A visual modeling component is controlled via drag-and-drop operations to describe a feature model, which is then compiled by a compiler into a finite state machine. Noise models may be used to exclude any command in the sequence that is irrelevant to the feature usage. A recognition process uses the finite state machine to locate program feature usage within the sequence of recorded commands by matching command sub-sequences corresponding to the feature model via the state machine. An analyzer may then use the located matches to provide an analysis report on feature usage.

    摘要翻译: 描述了软件程序特征使用位于在程序使用会话期间收集的一系列命令中的技术。 例如,特征通常对应于一系列命令,例如复制和粘贴。 视觉建模组件通过拖放操作进行控制,以描述特征模型,然后将其由编译器编译成有限状态机。 噪声模型可用于排除序列中与特征使用无关的任何命令。 识别过程使用有限状态机通过状态机匹配与特征模型对应的命令子序列来定位记录命令序列内的节目特征使用。 然后,分析仪可以使用定位的匹配来提供关于特征使用的分析报告。

    Analyzing software usage with instrumentation data
    10.
    发明申请
    Analyzing software usage with instrumentation data 有权
    使用仪器数据分析软件使用情况

    公开(公告)号:US20080313149A1

    公开(公告)日:2008-12-18

    申请号:US11818611

    申请日:2007-06-15

    IPC分类号: G06F17/30

    摘要: Described is a technology by which software instrumentation data collected from user program sessions are analyzed to output an analysis report or the like via example methods and an architecture configured for efficient operation. A client component queries a service for analysis related information. To process the query, the service works with a data manager, and via a high dimensional analysis component may use information processed from the software instrumentation data, such as in the form of one or more inverted indexes and/or raw value files. The service may include a usage analysis component, a feature recognition component that locates features from command sequences, a user recognition component and/or a program reliability component. One or more counterpart components at the client may generate analysis reports or the like based on the query results. The client also may maintain user libraries and feature libraries to facilitate analyses.

    摘要翻译: 描述了通过对用户程序会话收集的软件仪表数据进行分析以通过示例方法和配置为有效操作的架构来输出分析报告等的技术。 客户端组件向服务查询分析相关信息。 为了处理查询,该服务与数据管理器一起工作,并且通过高维分析组件可以使用从软件工具数据处理的信息,例如以一个或多个反向索引和/或原始值文件的形式。 该服务可以包括使用分析组件,从命令序列定位特征的特征识别组件,用户识别组件和/或程序可靠性组件。 客户端上的一个或多个对应组件可以基于查询结果生成分析报告等。 客户端还可以维护用户库和功能库,以便于分析。