Abstract:
A method and a device is described for de-duplicating a web page. The method includes: extracting at least one core sentence from a target web page; mapping each core sentence to a unique numeric value to form a first numeric value set; determining an intersection set of the first numeric value set and each second numeric value set, and the number of numeric values included in each intersection set, and determining a maximum number of numeric values included in each intersection set; and when a ratio of the maximum number to a total number of numeric values in the first numeric value set is greater than a set threshold, processing the target web page as a duplicate web page. In embodiments of the present invention, during web page de-duplication processing, accuracy can be improved, an anti-noise capability can be enhanced, and a calculating scale can be reduced.
Abstract:
The present disclosure provides a method, a device, and a system for packet processing, where the method includes: receiving a protocol packet from a downstream device, where the protocol packet carries uplink port information of the downstream device and the user VLAN of the downstream device; learning port types of the present device based on the uplink port information and classifying the user VLAN of the present device and the user VLAN of the downstream device based on the user VLAN of the downstream device; and forwarding the received packet based on configuration information stored on the present device, where the port type or types to which each VLAN type can be added are specified in the configuration information.
Abstract:
The method includes: acquiring a to-be-queried statement, where the to-be-queried statement is a natural language query statement; dividing the to-be-queried statement according to a preset word stock to obtain N words; determining, from a preset database, at least one candidate database entity of a first word, where the first word is any word in the N words, and separately annotating a label on each word in the N words to obtain annotation information corresponding to the to-be-queried statement; generating K query conditions according to the annotation information, where each query condition in the K query conditions includes a second word, an operator, and a third word; generating a query target according to the annotation information, where the query target includes a database entity of at least one word in the N words; and performing query according to the K query conditions and the query target to obtain a query result.
Abstract:
A service recommendation method includes, when a user of a terminal requests a first service from an intelligent assistant, selecting, according to a name of the first service and by using a pre-established service relationship model, a potential service with a degree of relevance to the first service that meets a preset condition from multiple services that the intelligent assistant can provide, where names of the multiple services and degrees of relevance of the multiple services to each other are recorded in the service relationship model; and recommending the potential service to the user.
Abstract:
A service recommendation method includes, when a user of a terminal requests a first service from an intelligent assistant, selecting, according to a name of the first service and by using a pre-established service relationship model, a potential service with a degree of relevance to the first service that meets a preset condition from multiple services that the intelligent assistant can provide, where names of the multiple services and degrees of relevance of the multiple services to each other are recorded in the service relationship model; and recommending the potential service to the user.
Abstract:
A method and a device is described for de-duplicating a web page. The method includes: extracting at least one core sentence from a target web page; mapping each core sentence to a unique numeric value to form a first numeric value set; determining an intersection set of the first numeric value set and each second numeric value set, and the number of numeric values included in each intersection set, and determining a maximum number of numeric values included in each intersection set; and when a ratio of the maximum number to a total number of numeric values in the first numeric value set is greater than a set threshold, processing the target web page as a duplicate web page. In embodiments of the present invention, during web page de-duplication processing, accuracy can be improved, an anti-noise capability can be enhanced, and a calculating scale can be reduced.
Abstract:
The present disclosure provides a method, a device, and a system for packet processing, where the method includes: receiving a protocol packet from a downstream device, where the protocol packet carries uplink port information of the downstream device and the user VLAN of the downstream device; learning port types of the present device based on the uplink port information and classifying the user VLAN of the present device and the user VLAN of the downstream device based on the user VLAN of the downstream device; and forwarding the received packet based on configuration information stored on the present device, where the port type or types to which each VLAN type can be added are specified in the configuration information.