摘要:
A relevance system determines the relevance of a query term to a document based on spans within the document that contain the query term. The relevance system aggregates the relevance of the query terms into an overall relevance for the document. For each query term, the relevance system calculates a span relevance for each span that contains that query term. The relevance system then aggregates the span relevances for a query term into a query term relevance for that document. The relevance system may aggregate the query term relevances into a document relevance.
摘要:
A method and system for generating wrappers for hierarchically organized documents by jointly optimizing template detection and wrapper generation is provided. A wrapper generation system generates a wrapper for documents with similar templates by identifying a cluster of document trees and generating a wrapper tree for the cluster. A wrapper tree defines the wrapper for documents that match the template of the cluster. The wrapper generation system clusters document trees by generating a wrapper tree for the cluster based on an initial document tree. The wrapper generation system then repeatedly determines whether any other document tree matches or nearly matches the wrapper tree for the cluster and, if so, adds the document tree to the cluster and adjusts the wrapper tree as appropriate so that all the document trees, including the newly added one, match the wrapper tree.
摘要:
A method, apparatus and system for suppressing low frequency oscillation in a power system. The method comprises: determining a system transfer function of an interconnected power system section in which a variable frequency transformer (VFT) is located; determining a damping controller parameters according to the system transfer function; and suppressing low frequency oscillation of the power system by means of the VFT based on the damping controller parameter. The objects of the method, apparatus and system are definite: optimizing the damping controller parameter can be achieved by simply tracking and analyzing the response of the system to disturbance, without the need to understand the configuration and parameters of the system or solve complicated power system equations, which has a better effect in suppressing low frequency oscillation in the power system and is advantageous for improving the safety and stability level of the power grid.
摘要:
Computer-readable media, computer systems, and computing devices facilitate enhancing a web index with uniform resource locator (URL)/non-encoding character (NEC) word pairs to facilitate relevance ranking of search results provided in response to a search query that includes NEC words. URLs are received from web pages and substrings extracted therefrom. Additional elements are received from the web page, word-broken into sequences of NEC words, and the NEC words are converted into encoding-language representations which are matched against the URL substrings to identify candidate URL/NEC pairs for utilization in relevance ranking.
摘要:
Computer-readable media, computer systems, and computing devices facilitate enhancing a web index with uniform resource locator (URL)/non-encoding character (NEC) word pairs to facilitate relevance ranking of search results provided in response to a search query that includes NEC words. URLs are received from web pages and substrings extracted therefrom. Additional elements are received from the web page, word-broken into sequences of NEC words, and the NEC words are converted into encoding-language representations which are matched against the URL substrings to identify candidate URL/NEC pairs for utilization in relevance ranking.
摘要:
Techniques described herein allow for suggesting creation of tools for improving search engine performance. Specifically, these tools focus on producing more relevant search engine results via a URL-based query clustering method. These tools first extract tokens from Uniform Resource Locators associated to search queries. With these tokens, these tools form query clusters of common tokens. The resulting clusters can be used to help understand the similarities in user search queries via URL-based cluster queries to produce more relevant search results.
摘要:
An information extraction model is trained on format features identified within labeled training documents. Information from a document is extracted by assigning labels to units based on format features of the units within the document. A begin label and end label are identified and the information is extracted between the begin label and the end label. The extracted information can be used in various document processing tasks such as ranking.
摘要:
A method, apparatus and system for suppressing low frequency oscillation in a power system. The method includes determining a system transfer function of an interconnected power system section in which a variable frequency transformer (VFT) is located; determining a damping controller parameters according to the system transfer function; and suppressing low frequency oscillation of the power system by means of the VFT based on the damping controller parameter. The objects of the method, apparatus and system are definite: optimizing the damping controller parameter can be achieved by simply tracking and analyzing the response of the system to disturbance, without the need to understand the configuration and parameters of the system or solve complicated power system equations, which has a better effect in suppressing low frequency oscillation in the power system and is advantageous for improving the safety and stability level of the power grid.
摘要:
A method and a device for limiting secondary arc current of an extra-high voltage/ultra-high voltage double circuit line on the same tower. The method comprises the following steps: determining the type of a single-phase-to-ground fault when the extra-high voltage/ultra-high voltage double circuit line on the same tower has a single-phase-to-ground fault (S501); selecting a reactance value of a neutral grounding reactor according to the type of the single-phase-to-ground fault (S502); and switching the extra-high voltage/ultra-high voltage double circuit line on the same tower to the selected reactance value of the neutral grounding reactor (S503). Thus the reactance value of the neutral grounding reactor is not constant, but is changed along with the operating conditions of the power transmission line, that is, the reactance value of the neutral grounding reactor is controllable. In this way, when the operating conditions of the extra-high voltage/ultra-high voltage double circuit line on the same tower are different, a neutral grounding reactor with an optimal reactance value can be selected so as to be accessed to the power transmission line, thereby effectively limiting the secondary arc current caused by the single-phase-to-ground fault.
摘要:
Techniques described herein allow for suggesting creation of tools for improving search engine performance. Specifically, these tools focus on producing more relevant search engine results via a URL-based query clustering method. These tools first extract tokens from Uniform Resource Locators associated to search queries. With these tokens, these tools form query clusters of common tokens. The resulting clusters can be used to help understand the similarities in user search queries via URL-based cluster queries to produce more relevant search results.