June Data Sharing and Reuse Seminar

Friday, June 9, 2023

Dr. Nara Sobreira will present "PhenoDB, a phenotypic and genotypic data sharing tool" at the monthly Data Sharing and Reuse Seminar on June 9, 2023, at 12 p.m. EST.

About the Seminar

This seminar will discuss the use of phenotypic and genotypic data sharing tools to facilitate the discovery of novel disease-causing genes and variants.

About the Speaker

Dr. Sobreira, an associate professor at the McKusick-Nathans Department of Genetic Medicine at Johns Hopkins University, will present PhenoDB. Her research focuses on the identification of the genetic etiology of rare Mendelian diseases. To facilitate this work, she has development public genetic databases and genetic analytical tools that are highly valuable and widely used to promote data sharing, disease gene identification and facilitate collaborations. She participated in the development of PhenoDB and developed the PhenoDB analysis module that is in use around the world. Dr. Sobreira is one of the creators of GeneMatcher, the most widely used data sharing platform of rare Mendelian diseases. In addition, she has created VariantMatcher, for the sharing of gene variant information.

Dr. Sobreira received her medical degree from the Pernambuco University’s School of Medicine in 2003.  She completed a clinical genetics residency in Sao Paulo – Brazil (UNIFESP) before joining the Human Genetics graduate program at Johns Hopkins (2007 to 2012).  This was followed by a one-year postdoc and a two-year clinical genetics fellowship also at Johns Hopkins School of Medicine. 

About the Seminar Series

The seminar is open to the public and registration is required each month. Individuals who need interpreting services and/or other reasonable accommodations to participate in this event should contact Danielle Johnikin (link sends e-mail) at 301-670-4990. Requests should be made at least five days in advance of the event.

The National Institutes of Health (NIH) Office of Data Science Strategy hosts this seminar series to highlight exemplars of data sharing and reuse on the second Friday of each month at noon ET. The monthly series highlights researchers who have taken existing data and found clever ways to reuse the data or generate new findings. A different NIH institute or center will also share its data science activities each month.

July Data Sharing and Reuse Seminar

Friday, July 14, 2023

Dr. Elizabeth DuPre will present Open Science Enables New Neuroscience: A call for community-driven methods development at the monthly Data Sharing and Reuse Seminar on July 14, 2023, at 12 p.m. EDT.

About the Seminar

In this talk, I will argue for the unique role of community-driven software in modern computational science. In particular, that community-driven methods development provides necessary structure for creating robust and reliable scientific inferences--and that open data availability is critical in supporting these efforts. To advance this argument, I will consider two case studies from my own work on methods for analyzing neuroimaging data. I will conclude with current opportunities and challenges in this space and provide suggestions for how researchers can support community-driven software in their own work.

About the Speaker

Elizabeth DuPre, Ph.D. is a Wu Tsai interdisciplinary postdoctoral research fellow at Stanford University, working between Professor Russ Poldrack and Professor Scott Linderman. As a psychologist and computational neuroscientist, her work focuses on modeling individual brain activity across a range of cognitive states—and assessing the generalizability of these individualized models—by extending statistical methods for human neuroimaging data analysis. Through her work, Dr DuPre has taken an active role developing tools in the open-source Python ecosystem, with a focus on improving the reproducibility of analysis workflows.

About the Seminar Series

The seminar is open to the public and registration is required each month. Individuals who need interpreting services and/or other reasonable accommodations to participate in this event should contact Danielle Johnikin (link sends e-mail) at 301-670-4990. Requests should be made at least five days in advance of the event.

The National Institutes of Health (NIH) Office of Data Science Strategy hosts this seminar series to highlight exemplars of data sharing and reuse on the second Friday of each month at noon ET. The monthly series highlights researchers who have taken existing data and found clever ways to reuse the data or generate new findings. A different NIH institute or center will also share its data science activities each month.

The Generalist Repository Ecosystem Initiative (GREI): First Year Momentum Leads to Exciting Future Plans

Wednesday, May 31, 2023

During the first year of the Generalist Repository Ecosystem Initiative (GREI), the effort has made noteworthy progress in fostering collaboration across the NIH generalist repository landscape. The GREI team has delivered on not only technical capabilities but on community outreach and engagement with a training webinar series, a community workshop, and conference presentations.

The GREI program brings together seven generalist repository awardees (Dataverse, Dryad, Figshare, Mendeley Data, Open Science Framework, Vivli, and Zenodo) to work together in a “coopetition” (competition and cooperation) model of collaboration to reduce the barriers to NIH data sharing, discovery, and reuse. The coopetition effort has organized into functional working groups focused on use cases, metadata and search, metrics, and community engagement with the goals of enhancing interoperability across generalist repositories and supporting the data needs of research communities.

In order to engage the community in GREI’s first year, the repositories held a workshop in January 2023 to examine generalist repositories as a key part of the evolving NIH data sharing landscape from the perspective of researchers, academic, and NIH communities; to offer training about use cases and best practices for data sharing and discovery in generalist repositories; and to gather community feedback to inform future GREI work.

The workshop came at an important time for data scientists. The 2023 NIH Data Management and Sharing Policy (DMSP) had just gone into effect, making it timely for researchers and those supporting them at institutions to consider data sharing practices and resources. Researchers indicated a strong need for more resources specific to the DMSP such as templates for data sharing, guidance on selecting the most appropriate repository for specific data types, and checklists for data sharing workflows and best practices. Workshop attendees also requested that the GREI team focus on developing new ways to give credit for open data sharing and reuse as well as making data discoverable without duplication, including through cataloging and connecting data.

“I’m quite proud of the progress we’ve made in GREI’s first year,” said Ishwar Chandramouliswaran, ODSS team lead for FAIR Data & Resources, who was instrumental in standing up and leading GREI. “We hope to take the momentum from year one and push GREI to new levels in the next year to not only lower barriers for sharing and reuse of NIH funded data, but more importantly to incentivize researchers to understand the value of sharing and data as a scholarly output in of itself.”

Indeed, GREI is engaged in some important work in their second year. They recently published the first iteration of use cases for sharing data and searching for data in each of the GREI repositories – a catalog of use cases that will grow over time. The repositories are currently collaborating to determine a common core metadata schema based on the DataCite schema that each repository will adopt, enhancing interoperability and discoverability of datasets across repositories. This schema is expected to be published in summer 2023 and open for public comment and iteration, as well as for adoption beyond generalist repositories. Similarly, common metadata will also support the collection of enhanced and common metrics of data impact across these repositories, another GREI goal to support tracking the impact of NIH funded research data. Lastly, they are engaging with key audiences including data librarians, academic institutions, and specific biomedical research communities that have reached out such as neuroscience to provide training and outreach via webinars, conferences presentations, and other resources, and to gather community feedback on their data sharing needs for generalist repositories. Community members can engage with GREI via contactgrei@googlegroups.com to ask questions, offer suggestions, and learn about upcoming events and new resources.

I am so happy to see programs like GREI flourishing under the sponsorship of our office. GREI is an excellent example of a program that touches many corners of the biomedical data science world. From bringing together the research community in workshops and trainings to creating the framework for the sharing and reuse of data, GREI is pushing for a more unified NIH. The more we work together like this, the more strides we can make in pushing data science forward and the more patients’ lives we can improve.

Postdoctoral Position in Radiobiology of Glioblastoma

Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD

A postdoctoral position is available to investigate the mechanisms mediating the radioresponse of glioblastoma grown in vitro and in vivo as orthotopic xenografts.  These studies are ultimately aimed at identifying and testing targets for glioblastoma radiosensitization. Numerous druggable targets are available to study.