摘要:
A multi-tenant system stores a hierarchy of machine-learned models, wherein each machine-learned model is configured to receive as input a set of search results and generate as output scores for ranking the set of search results. Each machine-learned model is associated with a set of dimensions. The system evaluates search query performance. Performance below a threshold causes a new model to be generated and added to the hierarchy of models. Upon execution of a new search query associated with the same set of dimensions as the newly created model, the new model is used to rank that search query's search results.
摘要:
Embodiments herein provide a method for imputing sensor data, in a sensor data sequence with missing data based on the semantics learning, where semantics is defined by the constraints of the sensor data features. A candidate value for imputation is determined based on sensor data of corresponding time instances of the sensor data sequence using learning based on semantics of features of the sensor data sequence with missing data. The nearest neighbors search has been applied in similar response data sequence using the data values corresponding to the time instant of missing data in sensor data sequence. In case similar response data sequence is not available imputation is performed based on the distribution pattern of missing data.
摘要:
Systems and methods perform various optimizations of an LLCS algorithm for use in determining if a set of input sequences are similar to a query sequence. The optimizations include filtering out sequence from the set of input sequences where the estimated similarity of the sequence with the query sequence is below a threshold value. The remaining sequences can then be provided to an LLCS algorithm where the output of the LLCS algorithm is used in a similarity function to determine an actual similarity of an input sequence with a query sequence.
摘要:
Sorting an array consisting of large number of elements. The present invention provides an apparatus for executing a multiway merging process which generates one output sequence from N input sequences on an array consisting of a large number of elements. The apparatus includes: an input sequence production unit configured to produce an input sequence by pairing a key from an element for use in a comparison during sorting with an index identifying the element for each element or sorted array of elements; an execution unit configured to execute the multiway merging process on N input sequences without rearranging the elements based on which input sequences have been produced; and a generation unit configured to rearrange the elements constituting the input sequences according to an output sequence that has been generated by the multiway merging process in the execution unit so as to generate a sorted array of elements.
摘要:
A documentation parser 20 identifies text in documentation (1) on computer processes and performs analysis (21-25) on the structure and content of the documentation to determine automatically how to invoke services. The system then compiles a process using the invoked services and runs tests (30) using the process to determine whether it functions as intended. This reduces the degree of human intervention (41) required to generate the complete program (50, 51).
摘要:
A lowest common ancestor of a first data sequence and a second data sequence is determined. Based on the lowest common ancestor, symbols that differ between the first data sequence and the second data sequence are identified. A distance between the first data sequence and the second data sequence is determined based on the symbols.
摘要:
An information processing apparatus includes a selection unit selecting at least a part of a text included in contents, an acquisition unit acquiring a processing result of natural language processing for the part of the text selected by the selection unit, a specifying unit specifying a predetermined part of the text based on the processing result acquired by the acquisition unit, a detection unit detecting a keyword from the predetermined part of the text based on the processing result acquired by the acquisition unit, a tag generation unit automatically generating a tag in accordance with the keyword detected by the detection unit, and an association unit associating the tag generated by the tag generation unit with the predetermined part of the text.
摘要:
According to some embodiments, bin data may be input and, based on a portion of the bin data, an entry in a search table may be determined. An indication of whether the search is complete may then be read from the search table along with at least one of: (i) a base symbol value or (ii) information about a next node. If the search is not complete, the process may continue to determine entries in the search table based on the information about the next node and additional portions of the bin data. When the search is complete, a decoded symbol may be calculated based on the last base symbol value and a remaining portion of the bin data associated with an extra bin length read from the search table.
摘要:
The present invention provides a high speed searching method by searching by decoding only necessary data compressed and encoded by the block sorting lossless compression method, without decoding all of the encoded data. The pairs of current sorting position number and previous sorting position number will be determined for the BW transformed rows and rows sorted with the lexicographic order in the data compressed by the block sorting lossless compression method. The data will be decoded based on the pairs while matching data with the searching character string. Only data required for the search will be decoded. The pairs of current sorting position number and previous sorting position number in the block sorting lossless compression method will be directly encoded.