Many exciting avenues exist now for computational biologists because of an exponential growth of biological datasets. Although a lot of data is publicly available, including gene expression data (both microarrays and RNA-seq on ArrayExpress, GEO, SRA and ENA), other types of data are often not.However, many efforts are currently ongoing to make other sets of data (such as genotype data, metabolite or proteomics data) available to other researchers as well, using controlled access repositories such as dbGAP and EGA. Additionally, international efforts such as the Biobanking and BioMolecular resources research infrastructure (BBMRI) and the distributed infrastructure for life-science information (ELIXIR) aim to connect different biobanks, enabling research on even larger datasets. This workshop aims to give an overview of the many exciting datasets that currently exist, how to get access to them and what scientific insight can be derived when using such data.
