摘要:
Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.
摘要:
In one embodiment of the invention, a method of generating a compressed configuration bitstream for a programmable logic device comprises encoding the most-prevalent data word within the configuration data of the bitstream into a codeword of a first type; encoding a set of more-prevalent data words within the configuration data into codewords of a second type; and including in the compressed bitstream at least some of the data words that are members of the set of more-prevalent data words. The included data words, when received by the programmable logic device, are adapted to be identified by the device as members of the set of more-prevalent data words. The included data words are stored for selection by the device when a codeword of the second type representing an included data word is received by the device.
摘要:
MicroRNAs (miRNAs) are a diverse and abundant class of ˜22-nucleotide (nt) endogenous regulatory RNAs that play a variety of roles in animal cells by controlling gene expression at the posttranscriptional level. Increased miR-181a expression in mature T cells is shown to cause a marked increase in T cell activation and augments T cell sensitivity to peptide antigens. Moreover, T cell blasts with higher miR-181a expression become reactive to antagonists. The effects of miR-181a on antigen discrimination are in part achieved by dampening the expression of multiple negative regulators in the T cell receptor (TCR) signaling pathway, including PTPN22 and the dual specificity phosphatases DUSP5 and DUSP6. This results in a reduction in the TCR signaling threshold, thus quantitatively and qualitatively enhancing T cell sensitivity to antigens.
摘要:
Systems, methods, and computer storage media having computer-executable instructions embodied thereon that provide content items selected based on context are provided. Contextual indicators associated with a user are identified and utilized to determine one or more content items that the user is likely to desire to access at a particular point in time. Upon receiving an indication that the user desires to perform a context-aware search, the identified content items (or references thereto) are presented automatically to the user, that is, without the user having to input any search query terms. The indication that the user desires to perform a context-aware search may be received, for instance, upon receiving an indication that a selectable context-aware search button has been selected by the user. This single-button action is particularly useful for mobile computing devices, wherein alpha-numeric textual input is relatively difficult.
摘要:
Described is a technology in which new words (including a phrase or set of Chinese characters) are mined from a query log. The new words may be added to (or otherwise supplement) an IME dictionary. A set of candidate queries may be selected from the log based upon market (e.g., the Chinese market) and/or by language. From this set, various filtering steps are performed to locate only new words that are frequently in used. For example, only frequent queries are kept for further processing, which may include filtering out queries based on length (e.g., less than two or greater than eight Chinese characters), and/or filtering out queries based on too many stop-words in the query. Processing may also include filtering out a query that is a substring of a larger query, or Vice-versa. Also described is Pinyin-based clustering and filtering, and filtering out queries already handled in the dictionary.
摘要:
Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
摘要:
A search system and method is provided. The method includes constructing a graph-based query that is indicative of a user's preference-levels for different features of a search item (a product, for example). The constructed graph-based query is executed by comparing the user's preference-levels for the different features of the product, which are graphically represented in the query, with information related to sentiments expressed by other users regarding the product. Information related to the sentiments expressed by other users regarding the product can include system-generated product performance graphs constructed from comments regarding the product obtained from the World Wide Web (or other network). Results returned and output upon execution of the graph-based query include system-generated product performance graphs that are similar to the user-submitted query.
摘要:
Described is a technology for disambiguating data corresponding to persons that are located from search results, so that different persons having the same name can be clearly distinguished. Name entity extraction locates words (terms) that are within a certain distance of persons' names in the search results. The terms are used in disambiguating search results that correspond to different persons having the same name, such as location information, organization information, career information, and/or partner information. In one example, each person is represented as a vector, and similarity among vectors is calculated based on weighting that corresponds to nearness of the terms to a person, and/or the types of terms. Based on the similarity data, the person vectors that represent the same person are then merged into one cluster, so that each cluster represents (to a high probability) only one distinct person.
摘要:
Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
摘要:
A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.