The AIRR Community has established the AIRR Data Commons (ADC) [Christley et al.], a network of geographically distributed AIRR compliant repositories that adhere to the AIRR Standards. The AIRR Community and the ADC adheres to the FAIR principles of data sharing (Findable, Accessible, Interoperable, and Reproducible). The AIRR Data Commons web API is a web based query API that makes AIRR-seq studies and their associated annotated sequence data in the ADC findable and accessible (the FA in FAIR). Because the ADC API utilizes the MiAIRR Standards and AIRR file formats, the ADC also promotes and facilitates interoperability and data reuse (the IR in FAIR), thereby supporting both reproducibility and meta-analysis.
The AIRR Data Commons has grown from just under 400 million sequence annotations in late 2018 to its current size of nine distributed repositories with 88 studies, 9700 sample repertoires, and 5.2 billion sequence annotations available for data exploration and download. Of the nine distributed community repositories, there are four in Canada (the iReceptor Public Archive (IPA), iReceptor COVID-19, Roche and Type 1 Diabetes repositories managed by iReceptor), one in the US (managed by VDJServer), one VDJBase repository at Bar Ilan University in Israel, one in Germany at DKFZ, and another in Muenster, one in South Africa at NICD, and another at Sorbonne Universite in France.
The AIRR Data Commons can be searched interactively using a web user interface, the iReceptor Gateway [Corrie et al. ].