Abstract:
Developing an ETL system for transforming data prior to loading the data into a data warehouse. An embodiment of the invention automatically generates configuration files from an input data flow diagram defining the ETL system. The configuration files or other metafiles control execution of the processes illustrated in the data flow diagram. The invention includes a notation for use in the data flow diagram.
Abstract:
A bridge circuit adapted to be associated with a PCI and a secondary bus circuits which bridge circuit includes circuitry for storing an indication that a particular PCI bus master has attempted an access of the secondary bus and has been forced to retry that access, circuitry for masking any retry until the bus is again available, and circuitry for providing an interval during which a retrying PCI bus master is guaranteed access to the secondary bus in favor of a bus master on the secondary bus after the bus is relinquished so that a sequence of retry operations causing a loss of bandwidth on the PCI bus is not generated.
Abstract:
Review and filtering of search results is facilitated by providing additional types of results beyond links to existing documents can be provided in addition to or in place of links to existing documents. These additional results can facilitate modifying a search request to filter out unwanted documents. The additional results, when in the form of content display objects and/or application display objects, can also provide the user with varying levels of information detail. In some preferred embodiments, an ontology based knowledge base can be leveraged in order to facilitate providing and filtering the results.
Abstract:
Advertisements are selected for presentation on search result pages and web pages based on phrases generated from lateral concepts and topics identified for the search result pages and web pages. A search query or an indication of a web page is received for which advertisements are to be provided. Lateral concepts and topics are identified based on the search query or content of the web page. The lateral concepts and topics are used as phrases for selecting advertisements from an advertisement inventory. Selected advertisements are provided for presentation on a search results page in response to a search query or on a web page initially identified.
Abstract:
Advertisements are selected for presentation on search result pages and web pages based on phrases generated from lateral concepts and topics identified for the search result pages and web pages. A search query or an indication of a web page is received for which advertisements are to be provided. Lateral concepts and topics are identified based on the search query or content of the web page. The lateral concepts and topics are used as phrases for selecting advertisements from an advertisement inventory. Selected advertisements are provided for presentation on a search results page in response to a search query or on a web page initially identified.
Abstract:
Systems, methods, and computer-storage media for generating and providing knowledge content to users utilizing a web architecture that integrates information across data silos through a common, flexible data storage schema, such as a star or snowflake schema, are provided. Data from a content graph, a user activity graph, a social graph, and temporal data as it relates to each of the content graph, the social graph and the user activity graph, is stored in a knowledge content database utilizing the star schema. In this way, data from each of these formerly disparate sources may be accessed from a common, extensible application platform utilizing ontologies and pivot table functionality, thus providing smarter, more comprehensive knowledge in response to received user queries.
Abstract:
A system that facilitates data processing includes a receiver component that receives an SQL query. A partitioning component partitions the SQL query into multiple tasks and provides the tasks to multiple cluster nodes for processing. The system enables very large amounts of data (e.g., multiple terabytes) to be quickly prepared for analytical processing, such as for use in connection with a search engine, an advertisement provision system, etc.
Abstract:
Transforming data prior to loading the data into a data warehouse. Software of the invention partitions data records received from a plurality of servers and performs sequential file management operations and identifier management operations on each of the partitions prior to loading the data records into the data warehouse. Data records transformed according to the invention are easier to load into the data warehouse and easier to manipulate after loading. The invention enables analysis and data mining of tens of terabytes of user level detail data and summary data.
Abstract:
A method and system are provided for selecting advertisements for presentation to a user in response to a user search query. The system may include a keyword server for parsing the user search query and an index server for receiving the parsed search query. The index server may include an index of advertising phrases and pre-filtering components for comparing index entries to the parsed user search query in order to discard non-matching index entries and locate matching entries. The pre-filtering components may include either a phrase length pre-filtering component or a word hash pre-filtering component. The system may additionally include a listing server for sorting through the matching entries located by the index server and further filtering the matching entries for retrieval and presentation to the user.
Abstract:
Systems and methods for determining the value of bids placed by content providers for placement positions on a page, e.g., a web page, rendered according to a given context, for instance, the search results listing for a particular query initiated on a search engine web site, are provided. Additionally, systems and methods are provided for determining placement of content items, e.g., advertisements and/or images, on a rendered page relative to other content items on the page based upon bid value.