摘要:
Systems, methods, and computer storage media having computer-executable instructions embodied thereon that utilize images to generate identifiers of entities and to determine various relationships between entities and web pages are described. A collection of class images that represent various people and non-people entities deemed relevant for tracking user behavior is generated. Given a class image that represents an entity, an image identifier comprising unique feature vectors for that class image is generated. Given an arbitrary web page, all the images embedded on the web page are analyzed. Each embedded image is compared to the image identifiers of the collection of class images to determine whether or not any entities of interest appear in the embedded images of the web page. If relevant entities do appear on the web page, various associations can be made between the entities found on the page, and between the uniform resource locator (URL) of the web page where the entities appear.
摘要:
An influential persons identification system and method for identifying a set of influential persons (or influencers) in a social network (such as an online social network). The influential persons set is generated such that by sending a message to the set the message will be propagated through the network at the greatest speed and coverage. A ranking of users is generated, and a pruning process is performed starting with the top-ranked user and working down the list. For each user on the list, the user is identified as an influencer and then the user and each of his friends are deleted from the social network users list. Next, the same process is performed for the second-ranked user, the third-ranked user, and so forth. The process terminates when the list of users of the social network is exhausted or the desired number of influencers on the influential person set is reached.
摘要:
Methods and systems for displaying data retrieved from a multi-dimensional data source via an interactive data diagram. A graphical user interface is responsive to input from a user to retrieve multi-dimensional data for display via an interactive data diagram. The interactive data diagram displays multi-dimensional data in a hierarchical structure that includes a plurality of dimension levels and one or more member levels within each dimension level. A user specifies a change to the display structure by selecting a displayed member level in the hierarchical structure. The interactive data diagram is responsive to the user specified change to generate a drilled down data diagram displaying detailed dimension and member levels related to the selected member level.
摘要:
Methods, systems, and computer-readable media for facilitating personalization of web content is provided, while protecting the privacy of the user data utilized to personalize the user's experience. A privacy vault may collect user data including user activity data, demographic data, and user interests submitted by a user. In one embodiment, the privacy vault operates on a user client device. The privacy vault sends the user data to a community vault that collects user data from multiple users. The community vault generates segment rules that whether a user belongs to a user segment, which expresses a user's interest. The segment rules are then communicated back to the privacy vault, which assigns one or more user segments to the user based on the user data available to the privacy vault and the segment rules. The privacy vault may communicate user segments to one or more content providers that supply personalized content that is selected based on the user segments provided.
摘要:
A social network visualization and mining system that includes a visualization application for mining social networks of users in an online social network. This visualization can be used to mine the social network for additional information and intelligence. The social network is displaying in graphical form, such as a node-link graph, with a center node representing the social network of a user being examined, and secondary nodes represent the primary user's friends. Lines represent links between the primary user and his friends, while various visualization features such as line thickness, line color, and text size are used to easily identify the type of relationship between users. The system also includes a topics visualization module, which builds and displays a social network based on a certain topic or keyword that is entered by the application user. A demographic prediction module examines a user's social network to predict demographics of users.
摘要:
A system and method are provided for deriving business intelligence (BI) data and exploring the derived data. The system may include a business intelligence engine and a business intelligence visualizer. The BI engine may be responsible for deriving or discovering fact summary data. The fact summary data may include aggregated or trend data in addition to the dimension or measure data. The BI engine may include components for determining fact summary data such as “What's Hot” and “What's Not Hot”. The components of the BI engine may include an algorithm for automatically generating “hotness scores” for members of dimensions or combinations of dimensions. The BI visualizer provides a chart node tree display for user exploration.
摘要翻译:提供了一种用于导出商业智能(BI)数据和探索派生数据的系统和方法。 该系统可以包括商业智能引擎和商业智能可视化器。 BI引擎可能负责导出或发现事实汇总数据。 除了尺寸或测量数据之外,事实摘要数据可以包括聚合或趋势数据。 BI引擎可能包括用于确定事实汇总数据的组件,例如“What's Hot”和“What Hot Not”。 BI引擎的组件可以包括用于为尺寸或尺寸的组合的成员自动生成“热度分数”的算法。 BI可视化器提供用于用户探索的图表节点树显示。
摘要:
Computer-readable media, systems, and methods for sensitive webpage content detection are described. In embodiments, a multi-class classifier is developed and one or more webpages with webpage content are received. In various embodiments, the one or more webpages are analyzed with the multi-class classifier and, in various embodiments, a sensitivity level is predicted that is associated with the webpage content of the one or more webpages. In various other embodiments, the multi-class classifier includes one or more sensitivity categories.
摘要:
Computer-readable media, systems, and methods for sensitive webpage content detection are described. In embodiments, a multi-class classifier is developed and one or more webpages with webpage content are received. In various embodiments, the one or more webpages are analyzed with the multi-class classifier and, in various embodiments, a sensitivity level is predicted that is associated with the webpage content of the one or more webpages. In various other embodiments, the multi-class classifier includes one or more sensitivity categories.
摘要:
An influential persons identification system and method for identifying a set of influential persons (or influencers) in a social network (such as an online social network). The influential persons set is generated such that by sending a message to the set the message will be propagated through the network at the greatest speed and coverage. A ranking of users is generated, and a pruning process is performed starting with the top-ranked user and working down the list. For each user on the list, the user is identified as an influencer and then the user and each of his friends are deleted from the social network users list. Next, the same process is performed for the second-ranked user, the third-ranked user, and so forth. The process terminates when the list of users of the social network is exhausted or the desired number of influencers on the influential person set is reached.
摘要:
This invention provides a system and method for predicting user demographic attributes for non-registered users and users with incomplete profiles. The invention uses demographic information from registered users and user search history logs to create a database of information that associates the users' search history habits with their demographic attributes. The invention creates a first database that associates users' search query history with their demographic attributes, and also creates a second database that associates web pages that users have visited frequently along with the users' demographic attributes. The invention can compare the searching and browsing habits of non-registered users and users with incomplete profiles to the searching and browsing habits of registered users. Through the comparison, the invention can use the corresponding demographic attributes of the registered users to predict the demographic attributes of the non-registered users and the registered users with incomplete profiles.