The AIRR Community has established the AIRR Data Commons (ADC) [Christley et al.], a network of geographically distributed AIRR compliant repositories that adhere to the AIRR Standards. The AIRR Community and the ADC adheres to the FAIR principles of data sharing (Findable, Accessible, Interoperable, and Reproducible). The AIRR Data Commons web API is a web based query API that makes AIRR-seq studies and their associated annotated sequence data in the ADC findable and accessible (the FA in FAIR). Because the ADC API utilizes the MiAIRR Standards and AIRR file formats, the ADC also promotes and facilitates interoperability and data reuse (the IR in FAIR), thereby supporting both reproducibility and meta-analysis.
The AIRR Data Commons has grown from just under 400 million sequence annotations in late 2018 to its current size of seven distributed repositories with 85 studies, 7200 sample repertoires, and 5.4 billion sequence annotations available for data exploration and download. Of the seven distributed community repositories, there are two in Canada (the iReceptor Public Archive (IPA) and iReceptor COVID-19 repositories managed by iReceptor), one in the US (managed by VDJServer), one VDJBase repository at Bar Ilan University in Israel, one in Germany at DKFZ, and another in Muenster, one in South Africa at NICD, and another at Sorbonne Universite in France.