Quantification of total T-cell receptor diversity by flow cytometry and spectratyping
© Ciupe et al.; licensee BioMed Central Ltd. 2013
Received: 19 April 2013
Accepted: 26 July 2013
Published: 6 August 2013
T-cell receptor diversity correlates with immune competency and is of particular interest in patients undergoing immune reconstitution. Spectratyping generates data about T-cell receptor CDR3 length distribution for each BV gene but is technically complex. Flow cytometry can also be used to generate data about T-cell receptor BV gene usage, but its utility has not been compared to or tested in combination with spectratyping.
Using flow cytometry and spectratype data, we have defined a divergence metric that quantifies the deviation from normal of T-cell receptor repertoire. We have shown that the sample size is a sensitive parameter in the predicted flow divergence values, but not in the spectratype divergence values. We have derived two ways to correct for the measurement bias using mathematical and statistical approaches and have predicted a lower bound in the number of lymphocytes needed when using the divergence as a substitute for diversity.
Using both flow cytometry and spectratyping of T-cells, we have defined the divergence measure as an indirect measure of T-cell receptor diversity. We have shown the dependence of the divergence measure on the sample size before it can be used to make predictions regarding the diversity of the T-cell receptor repertoire.
The immune system’s ability to fight a large array of foreign particles is facilitated by the diversity of the T-cell receptor (TCR) repertoire . This diversity is generated during thymocyte development by a process of somatic recombination. Inside the thymus, the constant (C) and variable (V) domains of the α and β chains of the TCR are assembled via random genetic rearrangements of the variable (V), diversity (D) and joining (J) gene segments . Additional diversity is added through imprecise joining of the V and J regions along with random nucleotide additions and deletions at the V(D)J junctions [2, 3]. Consequently, most of the variability lies in the third complementary determining region (CDR3) which is encoded by the V(D)J junction and comes in contact with the antigenic peptide on the surface of peptide/major histocompatibility complex (pMHC) molecules [4, 5]. While the total number of lymphocytes in the blood can be directly measured, assessment of the diversity of the TCR repertoire requires more complex and indirect assays in a research setting. Such assays include flow cytometry, spectratyping and nucleotide sequencing.
Different T-cell clones use different V gene families in the rearrangement of their β chains. Through the use of commercially available monoclonal antibodies (named TCR V β), one can use standard flow cytometry on whole blood samples to determine the percentage of CD4 T-cells that use a given TCR BV family in subjects or controls. Measures of heterogeneity of TCR BV family usage in these CD4 T-cells can be used as a substitute for TCR repertoire diversity . Flow cytometry is not only faster, cheaper, and technically simpler to use; the data reflects real population percentages.
Spectratyping uses messenger RNA (mRNA) from T-cells to amplify, by PCR, the complementary DNA (cDNA) across the CDR3 region. This generates information about the heterogeneity of the relative frequencies of different CDR3 length products within a functional TCR BV family. Because different T-cell clones have different sequences or lengths of CDR3, analysis of the CDR3 length distributions can be used to determine the overall TCR repertoire diversity [7–11]. Spectratyping has the advantage of providing a finer level of resolution than just analyzing BV gene family expression on the T-cells of flow cytometry. Although spectratyping provides the total number of CDR3 sizes and their pattern of distribution, the investigator cannot determine the frequency of cells used by a particular BV family. Amplifications of variations from a background distribution of each individual BV family may lead to over-representation of immunodominant clonotypes and therefore yield results that are not representative of the contribution of those cells in the entire T-cell repertoire.
TCR diversity can also be assessed by nucleotide sequencing of DNA CDR3 regions, but this is labor-intensive and generates an even lower level of resolution of the whole T-cell repertoire compared to spectratyping .
This paper focuses on the role of flow cytometry in measuring T-cell population diversity and compares it to T-cell population diversity as given by spectratyping. Traditionally, spectratyping data is quantified using a wide range of methods from visual [13, 14] to quantitative scoring [15–17]. Our group previously described the use of a likelihood method for measuring deviation from a normal TCR repertoire [9, 11]. For each observed CDR3 length distribution by spectratyping, we compute the Kullback-Leibler divergences between the patient CDR3 length distribution and a known reference distribution [9, 11]. We have modified the Kullback-Leibler divergence to measure the deviation of T-cell receptor diversity from normal. This was done by accounting for both the TCR BV family usage as measured by flow cytometry and by comparing the utility of this method to CDR3 length distribution as measured by spectratyping .
Estimator bias is a concern when using this method of divergence scoring. In particular, it is desirable to determine how much deviation in the computation of the divergence occurs when the initial number of lymphocytes used in generating the data is varied. We have addressed this question in the context of divergence measures generated individually by flow cytometry and spectratyping. The results are especially useful when using the techniques for limited numbers of cells.
We used the Kullback-Leibler divergence to quantify similarities between different frequency distributions in the T-cell repertoire diversity when measured by either flow cytometry or spectratyping. We started with two assumptions: 1) the reference distribution corresponds to a polyclonal TCR repertoire and 2) in individual subjects, a positive divergence determines the deviation from the normal TCR repertoire. The flow divergence, D f , is the distance between the individual and the perfectly sampled reference control distributions of all TCR BV family usage measured by flow cytometry. The spectratype divergence, D s , is the distance between the individual and the perfectly sampled reference control distributions of the CDR3 lengths of each TCR BV family and averaged over all TCR BV families as measured by spectratyping (see section Kullback-Leibler divergence and ).
where i = f,s for flow cytometry and spectratyping, respectively. L f is the number of BV families used in the flow cytometry assay (in our case 18) and L s is the number of CDR3 lengths used in the spectratype assay (in our case 14).
Therefore, only the number of measured events, n, and the dimension of the measured space, L i are needed to correct the divergence measures. We used this formula to assess the performance of D f and D s measures in an athymic DiGeorge subject (Figure 1) during a period of limited numbers of peripheral blood T-cells as the patient underwent immune reconstitution following thymus transplantation.
Flow cytometry results
Average CD4 T-cell sample size, measured flow divergence D f , and corrected flow divergence D f , corr in a DiGeorge subject
Average CD4 nr
in gate (n)
D f value
Df, corr value
Summary of T-cell sample size and the corresponding flow divergence values D f
Average CD4 T-cell nr
in gate n
divergence D f
where, y(n) is the observed D f and n is the number of CD4 T-cells in the sample. The intercept α is the true divergence, Df,corr, and the slope C quantifies the rate at which the diversity is dependent on the sample size. In equation (1), slope C corresponds to the (L f - 1)/2 value, which for an assay that uses 18 BV families, reduces to 8.5. The errors, ε, are independent and normally distributed.
Parameter values and confidence intervals for model (2)
where α i are the corrected divergence values for the patient i, with i = 1,...,8. The rate at which the diversity is dependent on the sample size, C, is considered constant among the subjects. The errors for each of the subjects, ε i , are independent and normally distributed.
Parameter values and confidence intervals for model (3)
From our estimates C = 7.705 and α = 0.19 (median 0.12). This implies the sample size, n, must be larger than 364 (median 577) cells for an accurate Df,corr estimate. In our case, we gated the flow cytometry on CD4 T-cells, so more than 364 CD4 T-cells, or events, must be captured in the flow analysis.
CD3 T-cell sample size, measured spectratype divergence D s , and corrected spectratype divergence D s,,corr in a DiGeorge subject
Days after transplant
CD3 T-cells n 0
Measured D s value
Corrected Ds, corr value
The corrected Ds,corr is found by subtracting (L s - 1)/2n, where n = n0/π, from the measured divergence at each time point, where L s = 14 (Table 5). The measured and corrected divergences as a function of 1/n0 are plotted in Figure 1(b). We note that there is no correction in the measured spectratype divergence, D s , since the number n0 of CD3 T-cells that we are starting with is always high.
By combining the individual contributions of flow and spectratype divergence, we defined the total divergence, D (see section ‘Kullback-Leibler divergence’). D measures the divergence of the individual from the perfectly sampled reference control and accounts for differences in distributions of CDR3 lengths within each TCR BV family by spectratyping as well as differences in distributions of overall TCR BV families by flow cytometry. Corrections in the flow and spectratype divergences are sufficient to ensure that the total divergence is independent of the sample size.
The data used in our study came from flow cytometry and spectratype assays in both DiGeorge subjects after thymus transplantation and healthy adult volunteers. This study presents significant information regarding the utility of flow cytometry, as well as spectratyping, to assess the diversity of the antigen receptor repertoire. Importantly, these data identify a bias in measurement errors which must be corrected. The paper presents the relationships between the number of gated events in the flow cytometry assay, as well as the number of CD3 T-cells in the spectratype assay, and the information-theory measures, D f and D s , used as surrogates of TCR diversity.
We addressed a critical issue of estimator bias. Starting with the assumption that such a bias exists, we have derived ways to account for the error in the measured divergences. We show that D f and D s can be corrected by substracting a number inversely proportional to the sample size.
Correlation coefficient and p-values as given by a Pearson comparison test, between the inverse average number of CD4 T-cell used in flow cytometry assays and the flow divergence
Our study allows us to predict a lower bound for the number of CD4 T-cells needed in the flow cytometry gated events. We have shown that at least 364 CD4 T-cells have to be counted as gated events for a 90% confidence in the D f measures. With fewer gated events, the D f measurement cannot be used as a substitute for diversity. This is particularly important to keep in mind when assessing patients with limited numbers of T-cells, such as those undergoing immune reconstitution following thymus, stem cell or bone marrow transplantation. Each of these is a clinical situation in which the development of the T-cell repertoire correlates to immune competency. Thus, these data provide a quantitative basis by which T-cell repertoire diversity can be assessed by flow cytometry.
Correlation coefficient and p-values as given by a Pearson comparison test, between the inverse total number of CD3 T-cell used in spectratype assays and the spectratye divergence
The total divergence actively incorporates the flow divergence. Correction in the flow divergence, D f , guarantees independence of the total divergence, D, from the sample size.
In conclusion, sample size is a sensitive parameter in the predicted flow divergence values, but not in the spectratype divergence values. Although using flow cytometry to assess T-cell repertoire diversity is a valuable tool, one must have sufficient cells, or events, in the flow cytometry gate before using either the flow or the total divergence as a prediction for the TCR repertoire diversity.
List of TCR BV families and antibodies used in the flow cytometry assay
V β 1
V β 2
V β 3
V β 4
V β 5.1
V β 5.3
V β 5.2
V β 7.1
V β 7.2
V β 8.1 & V β 8.2
V β 9
V β 11
V β 12
V β 13.2
V β 13.6
V β 14
V β 16
V β 17
V β 18
V β 20
V β 22
V β 23
List of TCR VB families and antibodies excluded from the flow cytometry studies
V β 13.1 & 13.4 & 13.6
TRBV6-5 & 6-6 & 6-9
V β 21.3
Subjects were enrolled in protocols that were approved by the Duke University Health System Institutional Review Board and were reviewed by the Food and Drug Administration under an Investigational New Drug application. All subjects were children. The parent(s) of each subject provided written informed consent.
Mean % of CD4 T-cells that use a TCR BV family as predicted by the flow cytometry assay
Mean % of CD4 T-cells
V β 1
V β 2
V β 3
V β 4
V β 5.1
V β 5.3
V β 5.2
V β 7.1
V β 7.2
V β 8.1 & V β 8.2
V β 9
V β 11
V β 12
V β 13.2
V β 13.6
V β 14
V β 16
V β 17
V β 18
V β 20
V β 22
V β 23
Flow Kullback-Leibler divergence
The flow Kullback-Leibler divergence is a measure of the distance between the two frequency distributions or, equivalently, it is the inefficiency of assuming that the distribution of BV family usage is p i , i = 1,...,n F , when the true frequency usage is P i ,i = 1,...,n F .
Spectratype Kullback-Leibler divergence
Total Kullback-Leibler divergence
Sampling bias - theoretical derivation
The distribution of BV family usage (CDR3 length within a BV family) of a perfectly sampled reference control can be described by a L f (L s )-dimensional multinomial distribution with the parameter vector P, where p i is the relative numbers of T-cells that use the BV family (CDR3 length) i. The distribution of the actual, but not yet observed, BV family (CDR3 length) usage in individual patient/controls are subsamples q of the ideal distribution, where q i are the relative numbers of T-cells that use the BV family (CDR3 length) i. The distance between these two distributions is given by the parameter d-1, with a large d accounting for a closer similarity between P and q. Finally, the observed distribution of BV family usage (CDR3 length), p, are samples of n measured events for every individual patient/control, where p i are the relative numbers of T-cells that use the BV family (CDR3 length) i. Here L f (L s ) is the dimension of the measured space, i.e. the number of BV families used in the flow cytometry assay, in our case 18 (the number of CDR3 lengths used in spectratyping assay, in our case 14).
is the Kullback-Leibler divergence between p and q.
which relaxes the concern of variability due to sampling error.
This work was supported by National Institute of Health grants R01 AI 54843, R01 AI 47040, M03 RR60 (Duke General Clinical Research Center, National Center for Research Resources, National Institute of Health), and Office of Orphan Products Development, Food and Drug Administration, grant FD-R-002606. MLM and TBK are members of the Duke Comprehensive Cancer Center. We acknowledge the technical assistance of Marilyn Alexieff, Jie Li, Chia-San Hsieh, Jennifer Lonon and Julie E. Smith, the clinical research assistance of Stephanie Gupton and Alice Jackson, and the regulatory affairs assistance of Elizabeth McCarthy and Michele Cox are appreciated as is the clinical care by the faculty and fellows of the Duke Pediatric Allergy and Immunology Division. We acknowledge the collaboration of surgeons James Jaggers, Andrew Lodge, Henry Rice, Micheal Skinner, and Jeffrey Hoehner. We appreciate the assistance of Drs. Michael Cook and Scott Langdon in the Duke Comprehensive Cancer Center flow cytometry and sequencing facilities.
- Nikolich-Zugich J, Slifka M, Messaoudi I: The many important facets of T-cell repertoire diversity. Nat Rev Immunol. 2004, 4: 123-10.1038/nri1292.View ArticlePubMedGoogle Scholar
- Davis M, Bjorkman P: T-cell antigen receptor genes and T- cell recognition. Nature. 1988, 334: 395-401. 10.1038/334395a0.View ArticlePubMedGoogle Scholar
- Alt F, Oltz E, Young F, Gorman J, Taccioli J, Chen J: VDJ recombination. Immunol Today. 1992, 13: 306-314. 10.1016/0167-5699(92)90043-7.View ArticlePubMedGoogle Scholar
- Garcia K, Degano M, Stanfield R, Brunmark A, Jackson M, Peterson P, Teyton L, Wilson I: An alphabeta T cell receptor structure at 2.5 A and its orientation in the TCR-MHC complex. Science. 1996, 274: 209-10.1126/science.274.5285.209.View ArticlePubMedGoogle Scholar
- Davis M, Boniface J, Reich Z, Lyons D, Hampl J, Arden B, Chien Y: Ligand recognition by αβ T-cell receptors. Annu Rev Immunol. 1998, 16: 523-544. 10.1146/annurev.immunol.16.1.523.View ArticlePubMedGoogle Scholar
- Markert ML, Alexieff MJ, Li J, Sarzotti M, Ozaki DA, Devlin BH, Sempowski GD, Hale LP, Buckley R, Rice HE, Mahaffey SM, Skinner MA: Postnatal thymus transplantation with immunosuppression as treatment for DiGeorge syndrome. Blood. 2004, 104: 2574-2581. 10.1182/blood-2003-08-2984.View ArticlePubMedGoogle Scholar
- Cochet M, Pannetier C, Regnault A, Darche S, Leclerc C, Kourilsky P: Molecular detection and in vivo analysis of the specific T cell response to a protein antigen. Eur J Immunol. 1992, 22 (10): 2639-2647. 10.1002/eji.1830221025.View ArticlePubMedGoogle Scholar
- Gorski J, Yassai M, Zhu X, Kissela B, Keever C, Flomberg N: Circulating T cell repertoire complexity in normal individuals and bone marrow recipients analyzed by CDR3 size spectratyping. Correlation with immune status. J Immunol. 1994, 152: 5109-5119.PubMedGoogle Scholar
- Kepler T, He M, Tomfohr J, Devlin B, Sarzotti M, Markert M: Statistical analysis of antigen receptor spectratype data. Bioinformatics. 2005, 21 (16): 3394-3400. 10.1093/bioinformatics/bti539.View ArticlePubMedGoogle Scholar
- Pannetier C, Even J, Kourilsky P: T-cell repertoire diversity and clonal expansions in normal and clinical samples. Immunol Today. 1995, 16: 176-10.1016/0167-5699(95)80117-0.View ArticlePubMedGoogle Scholar
- Ciupe S, Markert M, Devlin B, Kepler T: The dynamics of T-cell receptor repertoire diversity following thymus transplantation in Digeorge anomaly. PLoS Comp Biol. 2009, 5: 1-13.View ArticleGoogle Scholar
- The Immunoscope Approach for the Analysis of T Cell Repertoires. Edited by: Oksenberg JR. The Antigen T Cell Receptor: Selected Protocols and Applications, 1998, Chapman and Hall,New YorkGoogle Scholar
- Ferrand C, Robinet E, Contassot E, Certoux J, Lim A, Hervé P, Tiberghien P: Retrovirus-mediated gene transfer in primary T lymphocytes: influence of the transduction/selection process and of ex vivo expansion on the T cell receptor beta chain hypervariable region repertoire. Hum Gene Ther. 2000, 11: 1151-1164. 10.1089/10430340050015202.View ArticlePubMedGoogle Scholar
- Kook H, Risitano A, Zeng W, Wlodarski M, Lottemann C, Nakamura R, Barrett J, Young N, Maciejewski J: Changes in T-cell receptor VB repertoire in aplastic anemia: effects of different immunosuppressive regimens. Blood. 2002, 99: 3668-3675. 10.1182/blood.V99.10.3668.View ArticlePubMedGoogle Scholar
- Bomberger C, Singh-Jairam M, Rodey G, Guerriero A, Yeager A, Fleming W, Holland HK, Waller E: Lymphoid reconstitution after autologous PBSC transplantation with FACS-sorted CD34+ hematopoietic progenitors. Blood. 1998, 91: 2588-2600.PubMedGoogle Scholar
- Peggs K, Verfuerth S, D’Sa S, Yong K, Mackinnon S: Assesing diversity: immune reconstitution and T-cell receptor BV spectratype analysis following stem cell transplantation. J Haematol. 2003, 120: 154-165. 10.1046/j.1365-2141.2003.04036.x.View ArticleGoogle Scholar
- Wu C, Chillemi A, Alyea E, Orsini E, Neuberg D, Soiffer R, Ritz J: Reconstitution of T-cell receptor repertoire diversity following T-cell depleted allogeneic bone marrow transplantation is related to hematopoietic chimerism. Blood. 2000, 95: 352-359.PubMedGoogle Scholar
- Press W, Teukolsky S, Vetterling W, Flannery B: Numerical Recipes with Source Code CD-ROM: The Art of Scientific Computing, 3rd edition. 2007, Cambridge University Press, New York, NYGoogle Scholar
- Markert M, Sarzotti M, Ozaki D, Sempowski G, Rhein M, Hale L, Deist FL, Alexieff M, Li J, Hauser E, Haynes B, Rice H, Skinner M, Mahaffey S, Jaggers J, Stein L, Mill M: Thymic transplantation in complete DiGeorge syndrome: immunologic and safety evaluations in 12 patients. Blood. 2003, 102: 1121-1130. 10.1182/blood-2002-08-2545.View ArticlePubMedGoogle Scholar
- Erdélyi A: Asymptotic Expansions. 1956, New York: Dover PublicationsGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.