-
公开(公告)号:US10572511B2
公开(公告)日:2020-02-25
申请号:US15171168
申请日:2016-06-02
发明人: Arlen Anderson , Kamil Trojan
IPC分类号: G06F16/28 , G06F16/20 , G06F16/33 , G06F16/2453
摘要: Received data records, each including one or more values in one or more fields, are processed to identify a matched data cluster. The processing includes: for selected data records, generating a query from one or more values; identifying one or more candidate data records from the received data records using the query; determining whether or not the selected data record satisfies a cluster membership criterion for at least one candidate data cluster of one or more existing data clusters containing the candidate records; and selecting the matched data cluster from among one or more candidate data clusters based at least in part on a growth criterion for the candidate data clusters, or initializing the matched data cluster with the selected data record if the selected data record does not satisfy a cluster membership criterion for any of the existing data clusters or based on a result of the growth criterion.
-
公开(公告)号:US10558493B2
公开(公告)日:2020-02-11
申请号:US16175133
申请日:2018-10-30
摘要: A method for processing state update requests in a distributed data processing system with a number of processing nodes includes maintaining a number of counters including a working counter indicating a current time interval, a replication counter indicating a time interval for which all requests associated with that time interval are replicated at multiple processing nodes of the number of processing nodes, and a persistence counter indicating a time interval of the number of time intervals for which all requests associated with that time interval are stored in persistent storage. The counters are used to manage processing of the state update requests.
-
公开(公告)号:US10528395B2
公开(公告)日:2020-01-07
申请号:US15873095
申请日:2018-01-17
发明人: Tim Wakeling , Mark Buxbaum , Mark Staknis
摘要: Managing task execution includes: receiving a specification of a plurality of tasks to be performed by respective functional modules; processing a flow of input data using a dataflow graph that includes nodes representing data processing components connected by links representing flows of data between data processing components; in response to at least one flow of data provided by at least one data processing component, generating a flow of messages; and in response to each of the messages in the flow of messages, performing an iteration of a set of one or more tasks using one or more corresponding functional modules.
-
公开(公告)号:US10423395B2
公开(公告)日:2019-09-24
申请号:US16042205
申请日:2018-07-23
发明人: Craig W. Stanfill , Richard Shapiro
摘要: A received graph-based program specification includes: a plurality of components, each corresponding to at least one operation; and a plurality of directed links each connecting an upstream component to a downstream component. Processed code is generated representing one or more groups of operations by: identifying a possible level of concurrency in a first group of operations based at least in part on a topology of the graph, such that multiple operations in the first group are not prevented by the topology of the graph from executing concurrently; analyzing at least some of the operations in the first group to determine runtime characteristics associated with the analyzed operations; and generating processed code for executing the operations, where the processed code enforces a reduced level of concurrency in the first group, lower than the identified possible level of concurrency, based at least in part on the determined runtime characteristics.
-
公开(公告)号:US10210236B2
公开(公告)日:2019-02-19
申请号:US14949391
申请日:2015-11-23
发明人: Roy Procops , David Trahan
摘要: Among other things, we describe a technique for storing data of a data cube in one or more flat files. We also describe a technique for processing a query to access data of a data cube. These techniques can be implemented in a number of ways, including as a method, system, and/or computer program product stored on a computer readable storage device. One of the techniques includes receiving a set of data records having at least two dimensions, generating a set of grouped data records ordered by cardinality, and generating and storing at least one flat file containing the set of grouped data records, wherein a particular data record of the grouped data records includes a primary key that can be used to identify data of the particular data record in response to a request.
-
公开(公告)号:US10191924B2
公开(公告)日:2019-01-29
申请号:US14886541
申请日:2015-10-19
发明人: Scott Studer , Amit Weisman , David Phillimore
摘要: A computing system processes data units using one of at least two different modes of applying a rule. In a first mode, data units are received in a particular order and are processed including writing an updated value to at least one state variable based on a result of applying the rule to the data unit. In a second mode, a selection of particular data units is processed including determining a first set of data units including an ordered subset of data units that occur before the particular data unit from the number of data units, prior to applying the rule to the particular data unit, updating at least one state variable to a state that would result from processing the first set of data units in the first mode, and applying the rule to the particular data unit including reading the updated value of the state variable.
-
公开(公告)号:US10191863B2
公开(公告)日:2019-01-29
申请号:US14658440
申请日:2015-03-16
发明人: Jed Roberts , Craig W. Stanfill , Scott Studer
摘要: One or more mappings each define a correspondence between input attributes of an input entity and output attributes of an output entity, where the input out output entities each include one or more key attributes identified as part of a unique key. Computing result information, displayed in a user interface, includes: processing instances of a first input entity to generate instances of a first output entity; determining one or more mapped input attributes of the first input entity that correspond to each of the key attributes of the first output entity; generating the instances of the first output entity based on the determined one or more mapped input attributes; computing a total number of instances of the first input entity that were processed; and computing a total number of instances of the first output entity that were generated.
-
公开(公告)号:US09984482B2
公开(公告)日:2018-05-29
申请号:US13974537
申请日:2013-08-23
发明人: Erik Bator , Ilya Rozenberg , Daniell Stevens , Dan Teven , Fredric M. White
IPC分类号: G06T11/20
CPC分类号: G06T11/206 , G06T2200/24
摘要: In a general aspect, a method includes displaying a first component of a graph on a user interface, including displaying at least one port of the first component; responsive to a user interaction with the displayed first component, displaying a representation of one or more ports of the first component that is distinct from the display of the first component; and enabling the user to generate a connection between the representation of a particular one of the ports of the first component and a port of a second component of the graph.
-
公开(公告)号:US09984059B2
公开(公告)日:2018-05-29
申请号:US14495951
申请日:2014-09-25
发明人: Scott Studer , David Phillimore , Amit Weisman
IPC分类号: G06F17/00 , G06F17/24 , G06F17/30 , G06F3/0482 , G06F3/0484 , G06F3/0489
CPC分类号: G06F17/246 , G06F3/0482 , G06F3/04842 , G06F3/04895 , G06F17/243 , G06F17/245 , G06F17/30507
摘要: Specifying rules for processing data included in fields of elements of a dataset includes rendering user interface elements associated with a respective condition. The user interface elements include: first subsets of user interface elements, at least some of which are associated with an input value derived from at least one field, and second subsets of user interface elements, each configured to receive user input associated with a respective condition. Conditions are applied to at least a first element of the dataset based on user input received from at least some of the user interface elements, in response to receiving user input for a first user interface element associated with a first field. Instructions are generated for applying one or more selected conditions associated with fewer than all of the user interface elements, the selected conditions including at least a condition associated with the first user interface element.
-
公开(公告)号:US09886319B2
公开(公告)日:2018-02-06
申请号:US12704998
申请日:2010-02-12
申请人: Tim Wakeling , Mark Buxbaum , Mark Staknis
发明人: Tim Wakeling , Mark Buxbaum , Mark Staknis
CPC分类号: G06F9/5038 , G06F2209/506
摘要: Managing task execution includes: receiving a specification of a plurality of tasks to be performed by respective functional modules; processing a flow of input data using a dataflow graph that includes nodes representing data processing components connected by links representing flows of data between data processing components; in response to at least one flow of data provided by at least one data processing component, generating a flow of messages; and in response to each of the messages in the flow of messages, performing an iteration of a set of one or more tasks using one or more corresponding functional modules.
-
-
-
-
-
-
-
-
-