摘要:
Semantically ranking content in a website (110) with a computerized ranking device (105) includes: parsing content from the website (110) into multiple autonomous content blocks (415-1 to 415-17) with the computerized ranking device (105) and assigning an importance ranking with said computerized ranking device (105) to each of the content blocks (415-1 to 415-17) based on a degree to which a substance of the content block (415-1 to 415-17) is relevant to one of a plurality of predefined categories.
摘要:
A method and system for extracting Web content is disclosed. In one embodiment, Web content in a Webpage is extracted by identifying paragraphs in the Web content based on line-break node determination. A range of text-body associated with the identified paragraphs is then identified using a maximum scoring subsequence. Further, the identified text-body is refined using a heuristic rule of substantially horizontal alignment. Furthermore, one or more titles and one or more images associated with the Web content are extracted. Moreover, the Web content including the identified paragraphs, the one or more titles and the one or more images are outputted.
摘要:
A keyboard device includes at least one luminous key, at least one light-emitting element, a membrane switch circuit member, an opaque seal structure, and a transparent seal structure. The luminous key has a light-transmissible zone. The light-emitting element is electrically connected with the membrane switch circuit member, and disposed under the light-transmissible zone. A top surface of the light-emitting element is encapsulated by the transparent seal structure. The transparent seal structure is partially surrounded by the opaque seal structure. Consequently, the light beam from the light-emitting element is transmissible through the transparent seal structure, and directed to the light-transmissible zone.
摘要:
Disclosed is a computer-implemented method of determining smarty between first and second elements of an electronic document. The method uses a computer to calculate a plurality of measures of similarity between the first and second elements in at least two representations of the electronic document. A computer program product and system implementing this method are also disclosed.
摘要:
A method for generating a panoramic image that enables a user to obtain panoramic photographs with a digital camera without the aid of a computer system or specialized lenses. A digital camera according to the present techniques captures a series of image frames as a user pans the digital camera through a panoramic image scene and combines the captured image frames while the image frames are being captured.
摘要:
A method performed by a processing system is provided. The method comprises detecting an artifact in a first frame of a digital video using a plurality of edges identified in the first frame and replacing a region that encompasses the artifact in the first frame with a corresponding region from a second frame.
摘要:
A method for determining logical components of a portable document format (PDF) document is disclosed. The method includes separating the document into a plurality of layers. A PDF document is created for each of the plurality of layers. The method also includes determining a logical structure for each layer. The logical structures of the plurality of layers are combined to determine the logical components of the PDF document.
摘要:
A system and method for enhancing digital images containing both text and pictorial content (“mixed document images”) utilizes an estimated illumination surface for a given digital image to correct the undesirable effect of non-uniform illumination. The estimated illumination surface is based on the luminance values of the edge pixels of the given image that are on the dark side of text edges. In an alternative embodiment, the luminance values of the edge pixels that are on the lighter side of the detected text edges are used to generate the estimated illumination surface. The estimated illumination surface is applied to the digital image to compensate for illumination variations in the image due to non-uniform illumination. In addition to non-uniform illumination correction, the system and method enhances the mixed document images by sharpening and/or darkening edges of text contained in the images.
摘要:
A system and method for segmenting foreground and background regions on a digitized image uses a computer, having a processor and system memory, to segment the image into initial regions and identify background regions from the initial regions. A complete background surface is estimated of the image, and pixels of the image are rectified with the estimated background surface to normalize the image. Normalized pixels are compared with a threshold color to determine a final segmentation of background regions.
摘要:
A system and method are provided for extracting main content from a web page. Web page segmentation is performed on a web page to provide affinity-grouped segments. Descriptive features of at least one of the affinity-grouped segments are computed. At least one of the affinity-grouped segments is classified as a main body segment based on the computed descriptive features. Additional affinity-grouped segments are classified as to a document function based on the computed descriptive features. Classified affinity-grouped segments are assembled according to their classified document functions to provide the main content.