July Data Sharing and Reuse Seminar

Friday, July 14, 2023

Dr. Elizabeth DuPre will present Open Science Enables New Neuroscience: A call for community-driven methods development at the monthly Data Sharing and Reuse Seminar on July 14, 2023, at 12 p.m. EDT.

About the Seminar

In this talk, I will argue for the unique role of community-driven software in modern computational science. In particular, that community-driven methods development provides necessary structure for creating robust and reliable scientific inferences--and that open data availability is critical in supporting these efforts. To advance this argument, I will consider two case studies from my own work on methods for analyzing neuroimaging data. I will conclude with current opportunities and challenges in this space and provide suggestions for how researchers can support community-driven software in their own work.

About the Speaker

Elizabeth DuPre, Ph.D. is a Wu Tsai interdisciplinary postdoctoral research fellow at Stanford University, working between Professor Russ Poldrack and Professor Scott Linderman. As a psychologist and computational neuroscientist, her work focuses on modeling individual brain activity across a range of cognitive states—and assessing the generalizability of these individualized models—by extending statistical methods for human neuroimaging data analysis. Through her work, Dr DuPre has taken an active role developing tools in the open-source Python ecosystem, with a focus on improving the reproducibility of analysis workflows.

About the Seminar Series

The seminar is open to the public and registration is required each month. Individuals who need interpreting services and/or other reasonable accommodations to participate in this event should contact Danielle Johnikin (link sends e-mail) at 301-670-4990. Requests should be made at least five days in advance of the event.

The National Institutes of Health (NIH) Office of Data Science Strategy hosts this seminar series to highlight exemplars of data sharing and reuse on the second Friday of each month at noon ET. The monthly series highlights researchers who have taken existing data and found clever ways to reuse the data or generate new findings. A different NIH institute or center will also share its data science activities each month.

The Generalist Repository Ecosystem Initiative (GREI): First Year Momentum Leads to Exciting Future Plans

Wednesday, May 31, 2023

During the first year of the Generalist Repository Ecosystem Initiative (GREI), the effort has made noteworthy progress in fostering collaboration across the NIH generalist repository landscape. The GREI team has delivered on not only technical capabilities but on community outreach and engagement with a training webinar series, a community workshop, and conference presentations.

The GREI program brings together seven generalist repository awardees (Dataverse, Dryad, Figshare, Mendeley Data, Open Science Framework, Vivli, and Zenodo) to work together in a “coopetition” (competition and cooperation) model of collaboration to reduce the barriers to NIH data sharing, discovery, and reuse. The coopetition effort has organized into functional working groups focused on use cases, metadata and search, metrics, and community engagement with the goals of enhancing interoperability across generalist repositories and supporting the data needs of research communities.

In order to engage the community in GREI’s first year, the repositories held a workshop in January 2023 to examine generalist repositories as a key part of the evolving NIH data sharing landscape from the perspective of researchers, academic, and NIH communities; to offer training about use cases and best practices for data sharing and discovery in generalist repositories; and to gather community feedback to inform future GREI work.

The workshop came at an important time for data scientists. The 2023 NIH Data Management and Sharing Policy (DMSP) had just gone into effect, making it timely for researchers and those supporting them at institutions to consider data sharing practices and resources. Researchers indicated a strong need for more resources specific to the DMSP such as templates for data sharing, guidance on selecting the most appropriate repository for specific data types, and checklists for data sharing workflows and best practices. Workshop attendees also requested that the GREI team focus on developing new ways to give credit for open data sharing and reuse as well as making data discoverable without duplication, including through cataloging and connecting data.

“I’m quite proud of the progress we’ve made in GREI’s first year,” said Ishwar Chandramouliswaran, ODSS team lead for FAIR Data & Resources, who was instrumental in standing up and leading GREI. “We hope to take the momentum from year one and push GREI to new levels in the next year to not only lower barriers for sharing and reuse of NIH funded data, but more importantly to incentivize researchers to understand the value of sharing and data as a scholarly output in of itself.”

Indeed, GREI is engaged in some important work in their second year. They recently published the first iteration of use cases for sharing data and searching for data in each of the GREI repositories – a catalog of use cases that will grow over time. The repositories are currently collaborating to determine a common core metadata schema based on the DataCite schema that each repository will adopt, enhancing interoperability and discoverability of datasets across repositories. This schema is expected to be published in summer 2023 and open for public comment and iteration, as well as for adoption beyond generalist repositories. Similarly, common metadata will also support the collection of enhanced and common metrics of data impact across these repositories, another GREI goal to support tracking the impact of NIH funded research data. Lastly, they are engaging with key audiences including data librarians, academic institutions, and specific biomedical research communities that have reached out such as neuroscience to provide training and outreach via webinars, conferences presentations, and other resources, and to gather community feedback on their data sharing needs for generalist repositories. Community members can engage with GREI via contactgrei@googlegroups.com to ask questions, offer suggestions, and learn about upcoming events and new resources.

I am so happy to see programs like GREI flourishing under the sponsorship of our office. GREI is an excellent example of a program that touches many corners of the biomedical data science world. From bringing together the research community in workshops and trainings to creating the framework for the sharing and reuse of data, GREI is pushing for a more unified NIH. The more we work together like this, the more strides we can make in pushing data science forward and the more patients’ lives we can improve.

Postdoctoral Position in Radiobiology of Glioblastoma

Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD

A postdoctoral position is available to investigate the mechanisms mediating the radioresponse of glioblastoma grown in vitro and in vivo as orthotopic xenografts.  These studies are ultimately aimed at identifying and testing targets for glioblastoma radiosensitization. Numerous druggable targets are available to study.

May Data Sharing and Reuse Seminar

Friday, May 12, 2023

Dr. Avi Ma’ayan will present "The Diabetes Data and Hypothesis Hub (D2H2) and the Playbook Partnership Workflow Builder (PPWB): Bioinformatics Tools for Hypothesis Generation via Data Integration” at the monthly Data Sharing and Reuse Seminar on May 12, 2023, at 12 p.m. EDT.

About the Seminar

This seminar will discuss two new interactive bioinformatics software tools that integrate knowledge from many NIH funded and other key resources for hypothesis generation. The Diabetes Data and Hypothesis Hub (D2H2) is a platform that integrates tools and datasets related to diabetes research into one integrative platform. The Playbook Partnership Workflow Builder (PPWB) is a system that enables novice users to created bioinformatics workflows by exploring a network of distributed but connect microservices. These two new tools will be demonstrated through a variety of use cases with applications that prioritize personalized drugs and targets for kidney disease, diabetes, and pan-cancer.

About the Speaker

Dr. Ma’ayan is a Mount Sinai Endowed Professor in Bioinformatics, Professor in the Department of Pharmacological Sciences, Director of the Mount Sinai Center for Bioinformatics, and a faculty member of the Department of Artificial Intelligence and Human Health at the Icahn School of Medicine at Mount Sinai in New York City. The Ma'ayan Laboratory applies computational methods to study the complexity of regulatory networks in mammalian cells. His research team develops software systems to help experimental biologists form novel hypotheses from high-throughput data with a focus on drug and target discovery through machine learning and other statistical methods.

About the Seminar Series

The seminar is open to the public and registration is required each month. Individuals who need interpreting services and/or other reasonable accommodations to participate in this event should contact Danielle Johnikin at 301-670-4990. Requests should be made at least five days in advance of the event.

The National Institutes of Health (NIH) Office of Data Science Strategy hosts this seminar series to highlight exemplars of data sharing and reuse on the second Friday of each month at noon ET. The monthly series highlights researchers who have taken existing data and found clever ways to reuse the data or generate new findings. A different NIH institute or center will also share its data science activities each month.