摘要:
In some implementations, text is extracted from a digital work and a plurality of noun phrases are identified. The noun phrases are checked against a network accessible resource, such as an online encyclopedia, that includes a plurality of interlinked article entries. The noun phrases that have corresponding entries in the network accessible resource are included in a set of candidate topics. The candidate topics are ranked based, at least in part, on the links to and from each of the entries corresponding to the candidate topics. Candidate topics below a ranking threshold are removed from the set of candidate topics. Further, term frequency information for each candidate topic in relation to the digital work is compared against term frequency information for the candidate topic in a large corpus of textual works to remove candidate topics within a frequency difference threshold.
摘要:
In some implementations, text is extracted from a digital work and proper nouns are identified in the text to generate a list of names. The list of names may be sorted so that names containing more information are positioned toward the beginning of the list. The list may be traversed to cluster names and alternate names into name sets that correspond to particular entities in the digital work. Non-unique names that appear in more than one name set may be disambiguated based on proximity to unique names in the same name sets to determine which occurrences of the non-unique names belong with which name sets. Furthermore, a representative name may be selected from among multiple names in a name set for use in representing an entity or object corresponding to the name set. In some examples, the representative name may be selected based on a fullness of the name.
摘要:
In some implementations, a device displays a user interface that provides supplemental information in connection with a digital work. For example, the supplemental information may include a listing of objects identified in the digital work. Further, a visual representation may be displayed with each listed object. The visual representation for each listed object may provide a representation of at least one location of at least one occurrence of the object in the digital work. The objects may be displayed according to a supplemental information view, a page view, a chapter view, a book view, a series view, a library view, or the like. Additionally, one or more object buttons may be displayed concurrently with the listing of objects. The object buttons may correspond to the types of objects displayed, and may be selected to limit the displayed objects to a particular type.
摘要:
In some implementations, a device displays supplemental information in connection with a digital work. The supplemental information may include a visual representation that represents one or more occurrences of an object in the digital work. The visual representation may include an area representative of an expanse of the digital work. At least one marking is located in the area in correlation to a location of an occurrence of the object in the digital work. In some examples, the visual representation may include a plurality of markings representing multiple occurrences of the object in the digital work, with a first or leftmost marking positioned in the area in proportion to a first occurrence of the object in the digital work. A second or rightmost marking may be positioned in the area in proportion to a final occurrence of the object in the digital work.
摘要:
In some implementations, a digital work provider provides a digital work and supplemental information related to the digital work for delivery to an electronic device. For example, the digital work provider may parse a digital work to identify objects in the digital work. The digital work provider may generate supplemental information for the digital work based on the objects. For example, the supplemental information may include an index identifying locations of occurrences of the objects identified in the digital work. The supplemental information may further include prestored content related to one or more of the objects. For instance, the digital work provider may obtain the prestored content from one or more authoritative network resources. The electronic device may display the supplemental information in response to a user selection of an object in the digital work during display of the digital work.
摘要:
In some implementations, a device displays supplemental information in connection with a digital work. The supplemental information may include a visual representation that represents one or more occurrences of an object in the digital work. The visual representation may include an area representative of an expanse of the digital work. At least one marking is located in the area in correlation to a location of an occurrence of the object in the digital work. In some examples, the visual representation may include a plurality of markings representing multiple occurrences of the object in the digital work, with a first or leftmost marking positioned in the area in proportion to a first occurrence of the object in the digital work. A second or rightmost marking may be positioned in the area in proportion to a final occurrence of the object in the digital work.
摘要:
A meaning of a term is determined using the contents of a corpus of books through use of metadata about the books within the corpus, terms in the same work which provide context, and so forth. Users may query to determine the meaning of a term. Users may also build vocabulary skills by testing as well. A changing meaning of a term over time may be determined and utilized as well. Searches are facilitated by the enhanced ability to determine meaning of the terms, particularly in context. Feedback from the searches may also be used to refine future searches.
摘要:
Named entity recognition is applied to identify text strings corresponding to character identities in a written work. The textual strings are grouped according to character identity and, from each group, a primary name is selected. A significance is calculated for each of the character identities. The character identities including the primary names are presented in a catalog based on the calculated significance. In some embodiments, character identity identification results are refined by allowing users to vote regarding the significance of the character identities and by granting more weight to the votes of users with a close relationship to the written work.
摘要:
A method is provided for presenting a written work. A character identity is recognized within a written work. Presentation information for the written work, such as a graphical scheme or an electronic voice, is determined based on the character identity. The presentation information is provided to a user computing device. The user computing device renders the written work or a portion thereof using the presentation information.