Skip to content

ImagingDataCommons/IDC-Tutorials

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome!

This repository contains tutorial materials (for the most part, as Python notebooks) that are developed to help you learn about NCI Imaging Data Commons and utilize it in your work.

If this is the first time you hear about IDC, here are some highlights about what it has to offer:

  • >60 TB: IDC contains radiology, brightfield (H&E) and fluorescence slide microscopy images, along with image-derived data (annotations, segmentations, quantitative measurements) and accompanying clinical data

  • free: all of the data in IDC is publicly available: no registration, no access requests

  • commercial-friendly: >95% of the data in IDC is covered by the permissive CC-BY license, which allows commercial reuse (small subset of data is covered by the CC-NC license); each file in IDC is tagged with the license to make it easier for you to understand and follow the rules

  • cloud-based: all of the data in IDC is available from both Google and AWS public buckets: fast and free to download, no out-of-cloud egress fees

  • harmonized: all of the images and image-derived data in IDC is harmonized into standard DICOM representation

The tutorial notebooks are located in the notebooks, and are organized in the following folders.

"Getting Started" notebooks are intended to introduce the users to IDC. We believe those notebooks are the best place to start using IDC. In this notebook series you will learn:

  • how IDC data is organized
  • how to search IDC data
  • how to download data from IDC
  • how to use various visualization tools with IDC data
  • how to properly acknowledge data contributors and stay compliant with the usage license

Notebooks in this folder focus on topics that will require understanding of the basics, and aim to address more narrow use cases of IDC usage. Such topics include:

  • how to search clinical data accompanying IDC images and how to combine imaging and clinical metadata in your searches
  • how to use AWS-specific components for working with IDC data
  • how to deploy open source OHIF and Slim viewers using free Google Cloud resources

This folders contains notebooks that demonstrate the usage of the data in the specific IDC collections. The notebooks in this folder will always have the prefix of the collection_id they correspond to, for easier navigation.

This folder is dedicated to the notebooks focused on the digital pathology (pathomics) applications. The use of DICOM standard is relatively new in digital pathology, and this field is being actively developed, thus a dedicated folder for this.

Demonstrations/examples of analyses of images from IDC.

Here you will find an archive of the notebooks that were used in tutorials, which at times may demonstrate experimental features. By design, the notebooks presented at specific events may not be updated after the event, and are stored in this folder for archival purposes.

IDC is an actively evolving resource. As we develop new and improved capabilities, we improve our recommended usage practices, and may deprecate notebooks that are no longer maintained and may no longer work. You will find thse in the deprecated folder.

testing

This directory is used for the maintenance of the repository to support testing of the actively supported notebooks.

Support

If you have any questions about the notebooks in this repository, please open a discussion thread in IDC user forum, or open the issue in this repository.