To make it easy to do rigorous analysis of AIRR-seq data.
The purpose of our working group is to encourage practices that enable software tools to work, and to work with one another. A key first priority is to assemble data sets that people can use to test and compare the functionality of various programs. To progress this, we have defined summary statistics that can be used to characterize simulated data sets and compare them to real-world data sets. The result is a software tool, Sumrep, which is described in a recent paper. We will use this tool to assist in the collection of simulated and real-world data sets for testing and benchmarking.
We have also defined a standard for AIRR-Seq software tools. We will promote this over the next year, as a way of encouraging inter-operation and adoption of AIRR standard protocols by providing community support and publicity to compliant tools. We have also started work (in collaboration with the Germline Database Working Group) on an initiative to assess the biological credibility of an AIRR-Seq repertoire, and to identify common technical errors that can occur during its preparation, which can be heard to spot from read quality annotations and other technical measures commonly available today.
Plans for 2021 include:
- Complete Benchmarking Framework
- Identify Test Datasets for Annotation Tools
- Build a tool for assessing repertoire credibility
Co-Leaders: William Lees and Chaim Schramm
Members: Teyfik Agac, Lmar Babrak, Bryan Briney, Christian Busse, Brian Corrie, Patrice Duroux, Alex Horst, William Lees (Co-lead) Susanna Marquez, Patrick Meier, Enkelejda Miho, Eriberto Natali, Mats Ohlin, Chaim Schramm (Co-lead), Adrian Shepherd, Erand Smakaj, Inimary Toby, Deniz Tosoni, and Jian Ye.