FOA Title: 
Request for Information (RFI): Input on Development of a NIH Data Catalog
Grant Type: 
Primary IC: 
Release Date: 
Jun 06 2013
Expiration Date: 
AC Source: 
Request Information RFI): Input Development a NIH Data Catalog Notice Number: NOT-HG-13-011 Key Dates Release Date: June 6, 2013 Response Date: June 25, 2013 Issued National Human Genome Research Institute NHGRI) Purpose Request Information RFI) to solicit comments ideas the development implementation an NIH Data Catalog part the overall Big Data Knowledge BD2K) Initiative. Background Biomedical research becoming data-intensive researchers generating using increasingly large, complex, diverse datasets. era lsquo;Big Data’ biomedical research taxes ability many researchers release, locate, analyze, interact these data associated software due the lack tools, accessibility, training.  response these new challenges biomedical research, in response the recommendations the Data Informatics Working Group DIWG) the Advisory Committee the NIH Director(, NIH launched trans-NIH Big Data Knowledge BD2K) Initiative. long-term goal the BD2K Initiative to support advances data science, quantitative sciences, policy, training are needed the effective of Big Data biomedical research.  term ldquo;biomedical” used here the broadest sense include biological, biomedical, behavioral, social, environmental, clinical studies relate understanding health disease).  term lsquo;Big Data’ refers datasets are increasingly larger, complex, which exceed abilities currently used approaches manage analyze.  ldquo;Big Data” also meant capture opportunities address challenges facing biomedical researchers accessing, managing, analyzing integrating large datasets diverse data types.  Such data types include imaging, phenotypic, molecular including omics), clinical, environmental, behavioral, many types biological biomedical data.  ldquo;Big Data” also includes data generated other purposes e.g. social media, search histories, cell phone data) they repurposed applied address health research questions.  Biomedical Big Data primarily emanate three sources: 1) small number groups produce very large amounts data, usually part projects specifically funded produce important resources use the research community large, large collections electronic health records; 2) individual investigators produce large datasets their own project, which might broadly useful the research community at-large; 3) even greater number investigators each produce small datasets whose value be amplified aggregating integrating with data. of DIWG recommendations to promote data sharing through establishment central federated Data Catalogs. Among issues raised how establish minimal relevant metadata facilitate data sharing, broad adoption standards enhance data retrieval, well data citation adoption the catalog the broader biomedical community. BD2K now considering development a biomedical Data Catalog make biomedical research data findable citable, PubMed does scientific publications.  Such Data Catalog make easier researchers find, share, cite data, well the publications grants they associated with. Data Catalog distinct a data repository, would help data such repositories easily findable citable a consistent manner. addition supplying core, minimal metadata ensure valid data reference, is envisioned a Data Catalog include links to location the data, the NIH Reporter record the grant supported research, relevant publications within PubMed journals, possibly associated software algorithms. NIH BD2K Working Group charged exploring concept a Data Catalog determined it be important query broad mix Data Catalog designers, stakeholders, potential users their experiences advice the NIH it considers development a Data Catalog. order better appreciate issues need be addressed the possible solutions could lead implementation a Data Catalog, NIH thus seeks input the broader research communities.  Establishing such Data Catalog also part NIH’s response the White House Office Science Technology Policy February 2013 memorandum, ldquo;Increasing Access the Results Federally Funded Scientific Research.” Information Requested maximize impact this potentially valuable community resource facilitate use scientists a broad range expertise, seek input a proposal develop Biomedical Data Catalog.  comments include are limited the following categories: area expertise interest a Data Catalog. may include, biomedical researcher, informatics professional, library sciences expert, publisher, professional society, participation another stakeholder community. critical barriers, opportunities, incentives making data easily discoverable citable, the possible impact a Data Catalog. Possible Data Catalog linkage existing data repositories ensure data within repository findable how ensure such linkages remain to date accurate. your research field no existing repositories store data, comments include a Data Catalog might usefully link to data where such data might located. the lack a data repository might affect data discoverability, usability, citability. useful level granularity a Data Catalog entry.  instance, Data Catalog entry correspond all data a publication, only particular data type within given study, individual dataset a single experiment. Any potential requirements Data Catalog registration data NIH-funded supported investigators. Whether Data Catalog entry benefits a scientific abstract describes data, including potential uses the rationale its creation. feasibility the development a Data Catalog potentially support future uses. appropriate metrics use create successful Data Catalog. Submitting Response responses must submitted via email June 25, 2013.  Please include Notice number NOT-HG--13-011 the subject line. Response this RFI voluntary. Responders free address any all the categories listed above. submitted information be reviewed the NIH staff. request for information planning purposes only should be construed a solicitation as obligation the part the Federal Government. NIH does intend make any awards based responses this RFI to otherwise pay the preparation any information submitted for Government's of such information. NIH use information submitted response this RFI its discretion will provide comments any responder’s submission. However, responses the RFI be reflected future funding opportunity announcements. information provided be analyzed may appear reports. Respondents advised the Government under obligation acknowledge receipt the information received provide feedback respondents respect any information submitted.  proprietary, classified, confidential, sensitive information should included your response. Government reserves right use any non-proprietary technical information any resultant solicitation(s). Inquiries Please direct inquiries to: Jennie Larkin, Ph.D. National Heart Lung Blood institute National Institutes Health 6701 Rockledge Dr. Rockledge II, room 8200 Bethesda, MD 20892-7940 Telephone: 301) 435-0513 Email: