OHSU Library Collaborates with DMICE to Make Biomedical Big Data Science Open Educational Resources Available

For the last couple years, faculty from the OHSU Department of Medical Informatics & Clinical Epidemiology (DMICE) and Library have been developing open educational resources (OERs) in the area of Biomedical Big Data Science. Funded by a grant from the National Institutes of Health (NIH) Big Data to Knowledge (BD2K) Program, OERs have been produced that can be downloaded, used, and repurposed for a variety of educational audiences by both learners and educators.

Development of the OERs is an ongoing process, but we have reached the point where a critical mass of the content is being made available for use and to obtain feedback. The OERs are intended to be flexible and customizable and we encourage others to use or repurpose these materials for training, workshops and professional development or for dissemination to instructors in various fields. They can be used as “out of the box” courses for students, or as materials for educators to use in courses, training programs, and other learning activities. We ultimately aim to create 32 modules on the following topics:

  1. Biomedical Big Data Science
  2. Introduction to Big Data in Biology and Medicine
  3. Ethical Issues in Use of Big Data
  4. Clinical Standards Related to Big Data
  5. Basic Research Data Standards
  6. Public Health and Big Data
  7. Team Science
  8. Secondary Use (Reuse) of Clinical Data
  9. Publication and Peer Review
  10. Information Retrieval
  11. Version Control and Identifiers
  12. Data Annotation and Curation
  13. Data Tools and Landscape
  14. Ontologies 101
  15. Data Metadata and Provenance
  16. Semantic Data Interoperability
  17. Choice of Algorithms and Algorithm Dynamics
  18. Visualization and Interpretation
  19. Replication, Validation and the Spectrum of Reproducibility
  20. Regulatory Issues in Big Data for Genomics and Health Semantic Web Data
  21. Hosting Data Dissemination and Data Stewardship Workshops
  22. Guidelines for Reporting, Publications, and Data Sharing
  23. Terminology of Biomedical, Clinical, and Translational Research
  24. Computing Concepts for Big Data
  25. Data Modeling
  26. Semantic Web Data
  27. Context-based Selection of Data
  28. Translating the Question
  29. Implications of Provenance and Pre-processing
  30. Data Tells a Story
  31. Statistical Significance, P-hacking and Multiple-testing
  32. Displaying Confidence and Uncertainty

At the present time, 20 of the above modules are available for download and use. We are encouraging their use and seeking feedback from those who make use of them. The feedback will be used to improve the available modules and guide development of those not yet released.

We have also been developing mappings to research competencies in other areas, such as for the NIH Clinical and Translational Science Award (CTSA) consortium research competency requirements and the Medical Library Association professional competencies for health sciences librarians. To this end, we have been able to link these materials to existing efforts, and provide training opportunities for learners and educators working in these areas. We ultimately aim to complete this mapping across all of the BD2K training offerings, to align with other groups, avoid redundancy and to ensure we are meeting the needs of these various groups.

This project is actually one of several projects that have been funded by grants to develop and provide education in biomedical informatics and data science. The other projects include:

We hope that all of these materials are useful for many audiences and look forward to feedback enabling their improvement.

For more information, please contact Nicole Vasilevsky.