The AIRR Community has established the AIRR Data Commons (ADC) [Christley et al.], a network of geographically distributed AIRR compliant repositories that adhere to the AIRR Standards. The AIRR Community and the ADC adheres to the FAIR principles of data sharing (Findable, Accessible, Interoperable, and Reproducible). The AIRR Data Commons web API is a web based query API that makes AIRR-seq studies and their associated annotated sequence data in the ADC findable and accessible (the FA in FAIR). Because the ADC API utilizes the MiAIRR Standards and AIRR file formats, the ADC also promotes and facilitates interoperability and data reuse (the IR in FAIR), thereby supporting both reproducibility and meta-analysis.
The AIRR Data Commons has grown from just under 400 million sequence annotations in late 2018 to its current size of five distributed repositories with over 60 studies, 6000 repertoires, and 4 billion sequence annotations available for data exploration and download. Of the five distributed repositories, there are two community repositories in Canada (the iReceptor Public Archive (IPA) and iReceptor COVID-19 repositories managed by iReceptor), one community repository in the US (managed by VDJServer) and one research group-specific repository VDJBase repository at Bar Ilan University in Israel and another at Sorbonne Universite in France.