摘要:
A system for identifying different language versions of the same structured format document (e.g., HTML web page) detects the language of the two documents and translates one or both into a preferred language if necessary, parses the two candidate documents and builds two hierarchical data structure based on the document. The data structures are used to compare the hierarchical structure of the two documents and also to access text portions in congruent positions in the two documents. A fuzzy measure of similarity of a set of text portions occupying congruent positions in the two documents is then obtained, to induce a measure of the similarity of the two documents which is compared to a fuzzy threshold.
摘要:
A system and method associates a label and description with a search query such that the query, label, and description can be stored in a shared query repository so that queries can be retrieved by multiple users for reuse. The shared query repository can be searched, so that an appropriate query can be located, retrieved, and then submitted for execution over a document database by a search engine. Retrieved queries can be combined with other retrieved queries or modified with new search terms, and the new combined search query can be used for a new search on the database. The database search system and method efficiently permits reuse of search queries and facilitates sharing of search strategies.
摘要:
A system and associated method that allow particular requests to be executed at some point in the future without specifying the exact time or necessarily a precise location. The execution time of the request is linked to the arrival of a person or object at, or near a geographic destination location. When a person, an object, or a group of persons or objects, arrives at the destination location, or comes close to it, the request to interact will be executed. The proximity threshold can be adjustable or programmable.
摘要:
A network repository service supplements the functions of a web server to enable an increase in the efficiency of web crawling. The repository service: (a) automatically maintains a file modification list that contains the names of files on the server that have been modified (i.e., added, deleted, or otherwise modified), together with the date and time of the file modification; and (b) provides a requesting crawler with the file modification list (or a portion of the list corresponding to a time period specified by the crawler). The repository service may also (c) limit or restrict access privileges of crawlers that do not request the file modification list prior to crawling, thereby protecting the server from overcrawling. The repository service enables a crawler to request the file modification list, and avoid unnecessarily recrawling files that have not been modified since its last visit, thereby preventing considerable waste of time, network bandwidth, server processing resources, and crawler processing resources. Using the file modification list, the crawler can remove all prior references to deleted files, and efficiently recrawl only those files that have been added or changed since the crawler last visited the web server.
摘要:
A master repository service maintains a directory of web servers and the most recent times that their web contents were modified, and provides this information to web crawlers to increase their efficiency. The master repository service receives web content update reports from a plurality of web servers, updates the directory to keep it current, and provides crawlers with web site modification information. The web site modification information preferably comprises identifiers for new web sites, “dead” web sites, and modified web sites. Each crawler is preferably provided only with web site modification information received since it last received information from the master repository service. The information allows web crawlers to know immediately about new web sites, and allows them to spend time visiting only those web sites that are new or that have changed their content.
摘要:
A computer system enables a user to conveniently fill-out, configure, and submit a structure of interrelated data fields, where the order and type of linking between the fields is user selected. A graphical user interface presents a field template having one or more data fields. The user may extend the electronic form by selecting an expand form field; in response to selection of the expand field, the user interface adds a second field template and a connective field to the display. This second template, like the first, includes one or more data fields. Using a connective field, the user identifies a logical relationship between the first field template and the second field template. For instance, the user may select from Boolean or other connective terms to construct a form having a complex format of interrelated fields. As each new field template is added with its corresponding connective field, the user interface also presents a nesting icon, allowing the user to establish a logical hierarchy between the various field templates.
摘要:
An information objects is defined that is representative of a real-world entity (e.g., a product or a service). The information object may be stored in a data store. The information object has an associated owner. A communication channel is associated with the information object that is configurable to route communications to a manager assigned to the information object. A party is enabled to obtain management of the information object for a time period. The communication channel is configured to route to the party requests that are made by interacting with the information object during the time period. A plurality of users is enabled to interact with the information object during the time period to input requests to the party over the communication channel.
摘要:
Techniques for providing information about “offline” content are provided. In one technique, content (e.g., televised or paper-printed content) is “tagged” with a service-associated icon and a keyword. A person seeing the icon in the content may submit the keyword to the service via his web browser. The service responsively submits search-limiting criteria, associated with the keyword, as query terms to a search engine. The search engine determines relevant web pages based on the query terms, dynamically generates search results and returns the search results to either the web browser or the service, which may dynamically generate and send to the web browser another web page containing the search results. Due to the automatic addition of the search-limiting criteria to the query terms, the set of web pages that the search engine determines to be relevant is narrower and more focused than the set otherwise would be.
摘要:
Systems and methods are provided for implementing searches using contextual information associated with a Web page (or other document) that a user is viewing when a query is entered. The page includes a contextual search interface that has an associated context vector representing content of the page. When the user submits a search query via the contextual search interface, the query and the context vector are both provided to the query processor and used in responding to the query.
摘要:
Systems and methods, including user interfaces, are provided for implementing searches using contextual information associated with a Web page (or other document) that a user is viewing when a query is entered. The page includes a contextual search interface that has an associated context vector representing content of the page. When the user submits a search query via the contextual search interface, the query and the context vector are both provided to the query processor and used in responding to the query.