摘要:
A method and system of de-identification of a record (100) are provided. The method includes creating a vector of identification field values (201) of a record (100), searching unstructured data (205) of the record (100) for each identification field value of the vector (201), and de-identifying the identification field values (230) of the record (100). The step of creating a vector of identification field values (201) extracts the values from one or more structured portions (101) of the record (100). An action (202) is defined for each identification field to de-identify the identification field. The method may include defining a mapping (203) of unstructured portions (111, 112, 113, 114) of the record (100), and extracting the unstructured portions (111, 112, 113, 114) of the record (100), wherein the steps of searching and de-identifying are carried out on the extracted unstructured portions (205).
摘要:
System, method and program product for managing reference data. A data access program receives data retrieval policies and retrieve reference data from remote sources in accordance with the data retrieval policies. A research application assists in generating a conclusion based on said reference data which has been retrieved. A local data system stores the reference data retrieved by the data access program. The local data system associates the conclusion with the retrieved reference data. In response to retrieval of updates to the reference data, the local data system records that the conclusion is based on stale reference data, and can notify an entity responsible for the conclusion that the conclusion is based on stale reference data. Optionally, the local data system can notify the research application to process the updated reference data to assist in generating a new conclusion or validating the first conclusion, as the case may be, based on the updated reference data.
摘要:
A distributed computing system provides a data management system in communication with a software management system and enables the software applications to move to the data to be processed in a distributed computing environment. The software management system stores a plurality of computer-executable software applications thereon and is in communication with a user system for receiving a selection of one of the plurality of software applications. The software management system generates an identifier for the selected software application which is provided to the data management system, which then obtains the selected software application from the software management system based on the identifier. The obtained software application is then executed with data from a data storage system using resources from a resource system. The resource system includes a plurality of resources and a manage mechanism for managing assignment of the plurality of resources to support execution of the software application.
摘要:
System, method and program product for managing reference data. A data access program receives data retrieval policies and retrieve reference data from remote sources in accordance with the data retrieval policies. A research application assists in generating a conclusion based on said reference data which has been retrieved. A local data system stores the reference data retrieved by the data access program. The local data system associates the conclusion with the retrieved reference data. In response to retrieval of updates to the reference data, the local data system records that the conclusion is based on stale reference data, and can notify an entity responsible for the conclusion that the conclusion is based on stale reference data. Optionally, the local data system can notify the research application to process the updated reference data to assist in generating a new conclusion or validating the first conclusion, as the case may be, based on the updated reference data.
摘要:
System, method and program product for managing data for researchers. A research data server receives and manages experimental data and research data and results from the researchers, and operates with a virtual storage device to maintain the experimental data and research data and results. A reference data access server receives and manages external reference data relating to the research and operating with the virtual storage device to maintain the external reference data. Computational resources allow researchers to capture, process and analyze experimental data to obtain results. A research data network connects the virtual storage device, research data server, reference data access server and the computational resources to allow transfer of data there between. Security management services authenticate and authorize access by the researchers to the system.