Validation of biomarkers to predict response to immunotherapy in cancer: Volume I — pre-analytical and analytical validation
Journal for ImmunoTherapy of Cancer volume 4, Article number: 76 (2016)
Immunotherapies have emerged as one of the most promising approaches to treat patients with cancer. Recently, there have been many clinical successes using checkpoint receptor blockade, including T cell inhibitory receptors such as cytotoxic T-lymphocyte-associated antigen 4 (CTLA-4) and programmed cell death-1 (PD-1). Despite demonstrated successes in a variety of malignancies, responses only typically occur in a minority of patients in any given histology. Additionally, treatment is associated with inflammatory toxicity and high cost. Therefore, determining which patients would derive clinical benefit from immunotherapy is a compelling clinical question.
Although numerous candidate biomarkers have been described, there are currently three FDA-approved assays based on PD-1 ligand expression (PD-L1) that have been clinically validated to identify patients who are more likely to benefit from a single-agent anti-PD-1/PD-L1 therapy. Because of the complexity of the immune response and tumor biology, it is unlikely that a single biomarker will be sufficient to predict clinical outcomes in response to immune-targeted therapy. Rather, the integration of multiple tumor and immune response parameters, such as protein expression, genomics, and transcriptomics, may be necessary for accurate prediction of clinical benefit. Before a candidate biomarker and/or new technology can be used in a clinical setting, several steps are necessary to demonstrate its clinical validity. Although regulatory guidelines provide general roadmaps for the validation process, their applicability to biomarkers in the cancer immunotherapy field is somewhat limited. Thus, Working Group 1 (WG1) of the Society for Immunotherapy of Cancer (SITC) Immune Biomarkers Task Force convened to address this need. In this two volume series, we discuss pre-analytical and analytical (Volume I) as well as clinical and regulatory (Volume II) aspects of the validation process as applied to predictive biomarkers for cancer immunotherapy. To illustrate the requirements for validation, we discuss examples of biomarker assays that have shown preliminary evidence of an association with clinical benefit from immunotherapeutic interventions. The scope includes only those assays and technologies that have established a certain level of validation for clinical use (fit-for-purpose). Recommendations to meet challenges and strategies to guide the choice of analytical and clinical validation design for specific assays are also provided.
Increased understanding of cellular and molecular tumor immunology over the past two decades has enabled the identification of new ways to manipulate the immune response against cancer to counteract immunosuppressive mechanisms that evolve during tumor progression. Monoclonal antibodies (mAbs) to the cytotoxic T-lymphocyte-associated antigen 4 (CTLA-4) and programmed cell death-1 (PD-1) protein, two T cell-inhibitory checkpoint receptors with independent mechanisms of action, have demonstrated improvement in overall survival in advanced melanoma patients [1–3]. Significant clinical benefit (including durable tumor responses and extension of progression-free and overall survival) has also been shown in tumor types as diverse as non-small cell lung cancer (NSCLC), renal cell carcinoma (RCC), bladder cancer, and Hodgkin’s disease [4–12].
Despite demonstrated successes, responses to immunotherapy interventions only occur in a minority of patients. Attempts are being made to improve the activity of immunotherapies with novel combinatorial strategies and with biomarker optimization. A wave of recent clinical trial results has highlighted the potential for combination therapies that include these immunomodulating agents [13–17]. A wide range of biomarkers and assays is required to guide cancer therapy for several reasons: i) a variety of immunotherapy agents with different mechanisms of action including immunotherapies that target activating and inhibitory T cell receptors (e.g., CTLA-4 and PD-1), adoptive T cell therapies that include tissue infiltrating lymphocytes (TILs), chimeric antigen receptors (CARs), and T cell receptor (TCR) modified T cells ; ii) tumor heterogeneity including changes in antigenic profiles over time and location for an individual patient; and iii) a variety of immune-suppressive mechanisms that are active in the tumor microenvironment (TME). Optimizing biomarkers for immunotherapy could help to properly select patients for treatment, identify rational combination therapies, and define progression and resistance. In addition, biomarkers may help define the mechanism of action for different agents and help with dose selection as well as sequencing of drug combinations. Although most of immune therapies engage T cells and the assessment of cell-mediated cytotoxicity is integral for the selection of biomarkers of response to immunotherapy, the cancer immune response is a multi-step process involving interactions between the tumor and microenvironment including multiple cell subsets and soluble mediators functioning at different times and at different anatomical sites (tumor, lymph nodes, and blood) as well as the tumor stroma and vasculature. Thus, profiling of the tumor-immune interface with multiparametric technologies that encompass the dimensionality and complexity of this interaction are likely to be needed to monitor and stratify cancer patients for individual therapeutic requirements.
A number of candidate biomarkers and platforms with the potential to be developed into assays to predict response to immunotherapy have been identified in research studies. Platforms based on multiplexed transcriptome analysis, protein expression, and genomic variability are discussed in SITC Immune Biomarker Task Force reports (Additional file 1). The availability of these platforms and novel technologies should facilitate the integration of the molecular features of the tumor and the host factors for the development of multiplex profiles to guide personalized treatment in the future.
The focus of this review is to discuss the requirements for advancing a biomarker assay through the validation process to its clinical application. The validation of such assays should ultimately qualify them for use in clinical decision making. Specific examples of the assays already in use such as immunohistochemistry (IHC) based PD-L1 assays or soon be approved for use in clinical laboratories are discussed to illustrate the requirements for analytical validation (Table 1). Prototypes of these assays have been shown in research and small clinical studies to be potentially useful as patient enrichment tools. Although analytical validation data for each specific platform are available, none of these have been clinically validated yet as a predictive biomarker, except for PD-L1, which will be discussed below.
According to the position paper by Lee and colleagues , the biomarker assay validation process can be separated into several continuous steps: assessment of basic assay performance (analytical validation); characterization of the performance of the assay with regard to its intended use (clinical validation); and validation in clinical trials that ensures that the assay performs robustly according to predefined specifications (fit-for-purpose) and facilitates the establishment of definitive acceptance criteria for clinical use (validation of clinical utility). The fit-for-purpose approach for biomarker development and validation addresses the assay validation that should be tailored to meet the intended purpose of the biomarker. The fit-for-purpose method validation is an umbrella term that is used to describe distinct stages of the validation process.
Analytical validation defines how accurately and reliably the test measures the analyte(s) of interest in the patient specimen. Analytical validity is defined as the assay’s ability to accurately and reliably measure the analyte of interest in the clinical laboratory and in specimens representative of the population of interest. Analytical validity refers to the three different phases of assay development: pre-analytical, analytical, and post-analytical phase.
Clinical validation should demonstrate how robustly and reliably the test results correlate with the clinical outcome of interest. Practically, clinical validity implies that the cancer biomarker assay separates a population into two or more distinct groups with different biological characteristics or clinical outcomes.
Clinical utility is defined as an assay’s ability to significantly improve clinical outcomes, i.e., does the use of the biomarker result in patient benefit or add value to patient management decision making compared with current practices.
Pre-analytical and analytical assay validation steps are discussed in Volume I, while Volume II is focused on the clinical validation and validation of clinical utility of the assays as well as regulatory considerations.
Specific examples of relevant assays are discussed in detail in the following section and are summarized in Table 1. The scope of the paper includes only those assays that have established a certain level of validation for clinical use as biomarkers predictive of response to immunotherapy. Multiple biomarkers and platforms that require standardized assays and are lacking even initial clinical validation demonstrating its clinical utility (fit-for-purpose) are not the focus of this publication.
1. Flow cytometry
Phenotypic analysis of T cells can provide information regarding their activation status using assays based on multiplex flow cytometry examining a panel of lymphocyte markers. A baseline signature of frequencies of myeloid-derived suppressor cells (MDSCs) and regulatory T cells (Tregs), and high absolute eosinophil counts (AEC) has been recently shown to be associated with favorable outcome in patients with melanoma receiving ipilimumab . Interestingly, higher baseline frequencies of circulating CD4 + CD25 + FoxP3+ Tregs were associated with improved overall survival (OS) in this patient population . Tregs represent direct target cells of ipilimumab due to constitutive expression of CTLA-4 by those cells which might be one of the reasons that patients with higher levels of circulating Tregs are more likely to benefit from anti-CTLA-4 antibodies. In order to be implemented in routine clinical settings, this biomarker signature needs to be analytically and clinically validated (including a panel of markers required for the analysis and enumeration of MDSCs and Tregs) .
2. Enzyme-linked ImmunoSpot (ELISpot)
Enzyme-Linked ImmunoSpot (ELISpot) is a highly quantitative assay for monitoring the secretion of cytokines and cytotoxic mediators (e.g., perforin, granzyme B). It can measure a wide range of cellular responses and is capable of assessing critical immune-related activity of antigen-specific T cell stimulation. The most common analytes investigated today are cytokines (interferon (IFN)ɣ, interleukin (IL)-2, IL-5, IL-10, IL-17, granzyme B, tumor necrosis factor (TNF), and granulocyte-macrophage colony-stimulating factor (GM-CSF)). Other factors can also be evaluated with this platform, such as chemokines (e.g., CXCL8, CCL4). The IFNγ ELISpot assay has been used extensively for monitoring immune responses in the development of vaccines for the prevention and treatment of infectious diseases; however, there is also a body of literature demonstrating the correlation of the clinical outcome of cancer patients in immunotherapeutic trials with ELISpot results [21, 22].
Specifically, clinical trials have shown a significant correlation of antigen-specific ELISpot responses with patient survival after administration of a melanoma antigen-specific peptide based vaccine in advanced-stage patients . The magnitude of antigen-specific IFNγ-secreting cells, as measured by ELISpot, showed correlation towards survival after the administration of a prostate-specific antigen vaccine in prostate cancer patients, as well as a human epidermal growth factor receptor 2 (HER2/neu) specific vaccine in breast cancer patients [24–26]. Compared to the IFNγ ELISpot assay, the granzyme B ELISpot may be a more direct measure of cytotoxic cell activity because it measures one of the primary effector molecules of cell-mediated cytotoxicity. Cytotoxic activity of CD8+ T cells measured by granzyme B release after stimulation with MUC antigen was found to be predictive for the survival of pancreatic cancer patients independently of type of therapy (chemoradioimmunotherapy or 5FU-based chemotherapy) . Considering that the tumor infiltration is a reflection of a pre-existing immunity and is predictive of response to anti-checkpoint immunotherapy (discussed below), it appears logical to assume that functional assessment of cytotoxic activity of CD8+ T cells following stimulation with specific tumor associated antigen(s) by ELISpot may also be predictive of response to immunotherapy.
3. Single cell network profiling (SCNP)
Single Cell Network Profiling (SCNP) is a unique proteomic approach that quantifies functional immune signaling capacity, simultaneously across multiple immune cell subsets. One of the major advantages of this technology in the context of tumor immunotherapy is the ability to monitor cellular functional capacity without physical cell isolation. This enables the detection and monitoring of immune signaling and communication within the complex and interlocked immune system. The data generated are highly dimensional, including functional information across many signaling pathways at one time, with resolution down to rare immune cell subsets. This enables the generation of predictive and prognostic information in heterogeneous disease states. Clinical validation of the technology has been established in non-M3 AML, with classifiers for the prediction of response to frontline standard induction therapy in the elderly and pediatric populations [28, 29].
PD-L1 level measurement
There is increasing evidence to support the hypothesis that a pre-existent adaptive anti-tumor immune response in the TME correlates with clinical benefit to checkpoint blockade with anti-CTLA-4 or anti-PD-1/PD-L1 inhibitors [30, 31]. Recently, three IHC assays to measure PD-L1 expression have been approved by the U.S. Food and Drug Administration (FDA). One is a companion diagnostic assay to identify advanced NSCLC patients that may be treated with pembrolizumab . The second assay was approved as a complementary diagnostic to inform on risk-benefit for patients with non-squamous NSCLC and melanoma patients treated with nivolumab . The third and most recently approved assay is also a complementary diagnostic that was approved for patients with metastatic urothelial cancer considering treatment with the anti-PD-L1 therapy atezolizumab .
Although PD-L1 appears to enrich for response to anti-PD-1/L1 therapy in some disease settings, it has low Negative Predictive Value (NPV), which is of concern in life-threatening diseases such as the end-stage cancer setting, and low Positive Predictive Value (PPV). Adding to the complexity of applying PD-L1 IHC assay in clinical practice is that there are numerous separate diagnostic assays in development, and each might be tied to a different therapeutic agent. Existing tests for PD-L1 detection that have not been FDA approved will require analytical and clinical validation and it is unclear whether the assays will be interchangeable. Consequently, testing the same sample with different PD-L1 assays may yield different results even when used in accordance with the manufacturer’s instructions. The discrepancy in PD-L1 staining using different assays including negative results may be due in part to cellular, spatial, and temporal heterogeneity in PD-L1 expression, which is a dynamic marker of response to T cell activation and it is up-regulated on tumor cells by IFNγ. In addition, differences in antibody usage, various algorithms for scoring as well as cut-off values contribute to the challenge of data interpretation in the clinical setting for this marker.
T cell infiltrate
There are indications that an “inflamed” signature in tumors (i.e., the presence of T cell infiltrates) may be associated with improved clinical outcome in response to checkpoint inhibitors as compared with a “noninflamed” phenotype observed in tumors lacking a T cell infiltrate. In addition, significant correlation between the presence of tumor infiltrating lymphocytes (TILs) and the PD-L1 expression in the TME has been described .
Pre-treatment samples from melanoma patients who benefited from anti-PD-1 treatment showed a significantly higher density of CD8+ cells at both the invasive margin and the tumor center compared with the group of patients who experienced progression under the same treatment. However, the best predictive parameter for the probability of clinical response to PD-1 blocking therapy was high density of CD8+ T cells at the invasive tumor margin. The next best predictors were CD8+ cells in the tumor center, tumor and invasive margin PD-1 expression, and tumor and invasive margin PD-L1 expression . Classification of tumors into four groups on the basis of their PD-L1 status and presence or absence of TILs has the potential to identify pathways that should be targeted to elicit the best response for each tumor type . Furthermore, clinical responses to checkpoint blockade therapy were found to be associated with T helper type 1 (Th1) gene expression and elevated expression of IFNγ as well as IFNγ-inducible genes [36–38]. Suppressive Tregs and MDSCs may also have a role in negatively affecting the activity of anti-PD-L1-blockade in various tumors [39–41].
The pattern of expression of PD-L1 and tumor inflammation can also differ in tumor subtypes. For example, PD-1/PD-L1 receptors are differentially expressed in molecular subtypes of breast cancer (triple negative breast cancer (TNBC) vs. non-TNBC and colon cancer (CRC) (microsatellite-high (MSI-H) vs. microsatellite stable (MSS) cases). These subsets of immunogenic tumors (e.g., MSI-H CRC) attract TILs, which produce IFNγ that up-regulates PD-L1 on tumor cells and demonstrate characteristic of an inflamed phenotype, such as prominent tumor lymphocytic infiltrate and macrophages located at the invasive front of the tumor. In contrast, most non-inflamed tumors at baseline show a lack of PD-L1 by either tumor cells or tumor infiltrating immune cells. Thus, the presence of T cells and PD-1/PD-L1 can provide an indication for potential benefit of immunotherapy in aggressive subtypes of breast and colon cancers for which no targeted therapy is currently available [42, 43].
It has been previously shown that quantifying the densities of two lymphocyte populations—cytotoxic CD8+ T cells and memory T cells expressing CD45RO+ antigen, CD3+ and CD8+ T cells, or CD3+ and memory CD45RO+ T cells (CD3/CD45RO, CD3/CD8 or CD8/CD45RO)—both in the tumor core and in the invasive margin of tumors, termed “Immunoscore,” could predict survival of early-stage colorectal cancer patients [44, 45]. The prognostic value of the “Immunoscore” is currently undergoing clinical validation as an international effort (NCT01688232). Considering the importance of T cell infiltrate for cancer prognosis, the immune profiling may potentially serve as a predictive biomarker for certain type of immune manipulation, if it can be clinically validated.
Overall, these data suggest that pre-existing adaptive immunity as measured at the tumor level by CD8 T cell infiltration and their spatial distribution as well as PD-L1 expression may be required to predict clinical response to anti-PD-L1 inhibitors. In addition, the presence of Tregs, MDSC, or other T cell inhibitory molecules (such as LAG-3, TIM-3, and IDO) needs to also be characterized to provide a complete view of the interaction between cancer and immune system at the level of the individual patient.
5. Genomic landscape
Recent advances in next generation sequencing (NGS) technologies allow for rapid sequencing of large segments of an individual’s DNA including whole exomes (WES) and entire genome (WGS). NGS technologies utilize high-throughput approaches of clonally amplified or single molecule templates, which are then sequenced in a massively parallel fashion. NGS allows for the identification of a large panel of somatic mutations, i.e., mutational load across different types of cancer. Overall, patients who had tumors bearing a high frequency of somatic mutations like melanoma, NSCLC, and MSI-H colorectal cancer were significantly more likely to achieve clinical benefit from checkpoint blockade including CTLA-4 and PD-1 inhibitors [46–50]. The increased mutation load may activate adaptive immunity and attract CD8+ cell infiltrates, which results in the inflamed tumor phenotype. This suggests that genomic analysis to assess total mutational load could be incorporated in the treatment decision making process to determine who will benefit from immune-therapeutic approaches.
Improvements in computer algorithms to predict neoepitopes from exome sequences that are presented with MHC class I and II as potential targets to T cell receptors will allow further evaluation of the clinical relevance of somatic mutations. These neo-epitopes may aid in the identification of biomarkers to predict overall survival in tumors such as primary lung adenocarcinomas in response to immunotherapy . Putative immunogenic 9– and 10–amino acid neoantigens with affinity for HLA class I molecules using patient-specific nonsynonymous mutations based on HLA types were significantly associated with clinical benefit in some studies . However, the correlation between neoantigen load and clinical benefit diminished when increasingly stringent thresholds for affinity of binding were applied and recurrent neoantigens did not reveal any shared features or features exclusive to responders . These data suggest that clinical relevance of the neoantigens might depend on the proper antigen processing and neoepitope affinity as well as HLA expression, which is frequently aberrant in tumors. Better algorithms might be also needed to assess the immunoprotective properties of mutation derived neoepitopes.
Recent clinical trial data also demonstrated the utility of microsatellite instability (MSI) status as a predictive marker for response to PD-1 blockade in CRC patients treated with a checkpoint inhibitor pembrolizumab . Mismatch repair (MMR) deficiency occurs in a small fraction of CRC as well as cancers of the uterus, stomach, biliary tract, pancreas, ovary, prostate, and small intestine. Tumors with genetic defects in the MMR pathway are known to harbor hundreds to thousands of somatic mutations, especially in regions of repetitive DNA known as microsatellites, which result from deficient MMR machinery. Moreover, MMR–deficient tumors display prominent immune infiltration and Th1-T cells associated cytokine-rich environment as well as immune checkpoint receptors including PD-1 (and its ligand PD-L1), CTLA-4 and LAG-3, a finding consistent with a pre-existent immune response [42, 53–57].
WES of tumor samples followed by extensive bioinformatic analysis to identify immunogenic epitopes is not yet practical for routine diagnostic use. MSI testing, in contrast, is routinely performed in most diagnostic laboratories through the evaluation of selected microsatellite sequences or through an IHC based approach. Therefore, MSI testing has the potential to be an immediately useful approach to predict clinical benefit to PD-1/PD-L1 pathway inhibitors in patients with MMR deficient tumors.
Immunosequencing is a multiplex PCR-based method that amplifies rearranged TCR complementarity determining region (CDR) 3 sequences for a given TCR locus and exploits the capacity of high-throughput sequencing (HTS) technology to enumerate and quantify hundreds of thousands of TCR CDR3 chains simultaneously. Multiple V, D, and J gene segments exist in the germline genome. Initial receptor diversity is generated by recombination of V, D, and J segments, and additional non-templated diversity is introduced at the junctions by insertion of random nucleotides (N). The immunosequencing assay uses a multiplex PCR with forward primers in each V segment and reverse primers in each J segment. The TCR repertoire from circulating peripheral blood mononuclear cells has been profiled prior to and following administration of an anti-CTLA-4 blocking antibody . In response to the administration of the anti-CTLA-4 monoclonal antibody, there was a marked increase in both the “richness” (number of unique TCRβ sequences) of circulating T cells and the diversity of the T cell population. Interestingly, this increase appeared to be generalized, with no particular clone or subgroup of clones demonstrating a significantly greater increase than others. This observation suggests that clones that have been sequestered or “kept at bay” are somehow released by this therapeutic intervention. Of note, the degree of systemic toxicity associated with this form of therapy also correlated with increases in the richness and diversity metrics, suggesting that some of the clones being kept at bay are those that are capable of conferring more generalized inflammatory or autoimmune responsiveness. Biopsies of skin lesions from patients with metastatic melanoma were obtained and subjected to TCRβ immunosequencing analysis before treatment with anti-PD1 blocking monoclonal antibody [6, 59]. Patients whose tumors had the highest number of T cells and the more clonal T cell repertoire were most likely to respond to this therapy. Conversely, all of those patients whose total T cell number and clonality measure fell below the median for each of these parameters had progressive disease. Moreover, biopsies obtained more than 3 weeks following the initiation of the anti-PD-1 therapy showed that patients whose tumors showed significant expansion of pre-existing T cell clones in response to the therapy were most likely to have demonstrated a clinical response.
7. Multiplexed-gene expression profiling
While the focus of the approaches discussed earlier has been on tumor or immune cells, other technologies assessing predictive biomarkers in the immune-oncology space are focusing on the interaction of tumor cells with the TME including immune cells. Gene expression analysis of RNA levels incorporates a large amount of data that can have prognostic and predictive relevance and can be used to characterize both tumor and immune cells.
The nCounter Dx Analysis system (NanoString Laboratories, Inc.) uses gene-specific probe pairs that hybridize directly with the mRNA in solution eliminating any enzymatic reactions, and does not require RNA amplification that might introduce bias in the results. The nCounter Dx Analysis System assay simultaneously measures the expression levels of up to 800 target genes and a specific panel of immune response genes is also available. The instrument, reagents and software have received 510(k) clearance from the FDA for use with the Prosigna Breast Cancer Prognostic Gene Signature Assay .
Considering that there are clinically validated, multi-gene expression prognostic tests currently used in the clinical setting (such as OncoTypeDX, Prosigna, and Mammaprint, the latter two cleared by FDA through the 510(k) process), the probability of gene expression signatures to be developed as markers predicting response to immunotherapy is significant. In this regard, recent data showed that measuring immune-related biomarkers, including T cell specific, antigen presentation–related, and IFNγ signaling–related genes, may allow for improved selection of patients likely to respond to anti–PD-1 therapy with pembrolizumab consistent with the hypothesis that clinical responses to PD-1 blockade occur in patients with a preexisting interferon-mediated adaptive immune response [61, 62].
Pre-analytical and analytical validation
Although assays for immune-oncology are subject to the same analytical validation requirements as other bio-analytic assays, there are some basic differences that may impact the analytical validation process. Table 2 highlights the differences between single analyte bioassays (measuring a single protein or metabolite) vs. assays measuring immune response. Although immune response assays can be singular, most biomarkers will require multiparameter tests that depend on an increased number of controls, complex scoring algorithms, high-throughput performance data analysis, and results output. In addition, in the US, when a predictive marker will be used to direct patient enrollment or for patient stratification in clinical trials, the assay will need to be performed in a Clinical Laboratory Improvement Amendments (CLIA) laboratory. CLIA labs follow Clinical and Laboratory Standards Institute (CLSI) guidelines for determination of standard assay parameters such as precision, accuracy, limit of detection, specificity, and reference range. A typical analytical validation plan involves several steps in which the assay must be optimized for multiple parameters:
Sample-related (pre-analytic parameters)
Assay-related (analytical parameters)
Data-related (post-analytical parameters)
An important step in biomarker validation is the evaluation of pre-analytical factors that may affect assay performance due to specimen-related variability as outlined below (Fig. 1). For immunotherapies, there may be a need to monitor ex vivo immune responses in phenotypical or functional assays, which require high-quality samples to ensure reliable analytic output. To ensure that optimal pre-analytic processing regimens are followed, standard operating procedures (SOPs) for controlling specific biomarker development steps are essential. To create the best practice metrics, blood collection and storage media optimization protocols are often developed in conjunction with other pre-analytical parameters. General guidance on pre-analytical quality indicators and their harmonization, including analytical stability and laboratory quality control (QC) have been published .
To improve standardization of specimens, the US National Cancer Institute (NCI) has published best practice guidelines for biospecimen collections . In addition, specific guidelines for the analytical requirements of biomarkers have been set up [65, 66].
1. Whole blood and specific immune cell subsets assays
Pre-analytical processing of samples for diagnostic assays including those used for single cell immune response assays, such as ELISpot, flow cytometric analysis, and SCNP, includes patient-related factors such as tissue-ischemia time, pretreatment with drugs, dynamic nature of the analyte, and sample heterogeneity. Analyte stability can be affected by the sample collection process including anticoagulants used for blood draws, freezing/thawing, time between collection and testing, and storage conditions before processing. Guidance documents related to the handling of peripheral blood mononuclear cells (PBMC) has been published previously by the Immunology of Diabetes Society that contains recommendations and references addressing the various pre-analytical steps that need to be considered . Additional guidelines regarding isolation and preservation of PBMC for functional analysis are also available [67–70]. A highly relevant issue for immune-based assays is the avoidance of contamination with granulocytes  that are potent suppressors of T cell function in in vitro assays [72, 73]. Processing of fresh whole blood or PBMCs is not always practical in large clinical trials. Thus, cryopreservation of PBMCs is an alternative for the purpose of batching samples over time and for banking samples for future use. However, it can decrease cell viability and function and decrease yield. Therefore, it requires standardization between sites and infrastructure commitment to decrease the variability.
The optimal anticoagulants chosen to preserve blood samples are highly dependent on the type of target analyte (e.g., nucleic acid or protein), the specific blood cell type of interest (e.g., T cells, B cells, or NK cells), and the specific assay platform. As an example, a study addressing this issue for a gene expression profiling assay resulted in recommendations for Na2EDTA over formaldehyde as an RNA stability additive  whereas others have found that to preserve cell surface antigen integrity for flow cytometry, sodium heparin was optimal . Special collection tubes, chip-based devices, or media additives for preservation of particular cell subsets are increasingly being deployed to achieve better compatibility with multicenter based late stage clinical trials especially for “liquid biopsy” (circulating tumor cells , cell-free DNA , and exosomes ). These specialized tubes can be prohibitively costly when used in an exploratory banking setting. Thus, in trials testing undefined and exploratory biomarkers, blood cells, serum, and/or plasma may be banked under generalized conditions that may or may not be optimal for a particular analyte and platform.
Blood cell components
Immunotherapies targeting specific components of the immune system, e.g., innate, adaptive, memory, naïve cells, and Tregs, can affect both target cells as well as other cells across the immune system. Most of the therapies currently in development engage CD8+ cytotoxic cells, and assessment of cell-mediated cytotoxicity is an important measure to predict immunotherapy response. These therapies might, however, require the development and validation of assays to interrogate other cell subsets for which assays have not been routinely generated, including immune cell subsets such as B cells , monocytes/macrophages , MDSCs , natural killer (NK) cells , T helper cells, and other T cell subtypes (Tregs, naïve, and memory T cells) .
Different cell subsets require specific pre-analytical protocols, to preserve their cell type-specific functional qualities. To ensure delivery of meaningful results, concurrent assessment of integrity of multiple cell subsets during pre-analytical validation for an optimal combination of parameters (storage, collection, and processing) is highly recommended.
Flow cytometry allows for characterization of many subsets of cells, including rare subsets in a complex mixture such as blood. Flow cytometry can be used to assess not only expression of cell-surface proteins, but also that of intracellular phosphoproteins, cytokines, transcription factors, and functional readouts. The accurate measurements of variation in the human immune system requires precise and standardized assays to distinguish true biological changes from technical artifacts . Because flow cytometry remains highly variable with regard to sample handling, reagents, instruments set up, and data analysis the Human Immunology Project has been proposed for global standardization of flow-cytometry immunophenotyping. In addition, a repository of immunological data for data mining for biomarkers will be part of the project .
The ELISpot platform enables analysis of T, B, NK cells as well as of monocytes at the single cell level, though is mainly restricted to the functional aspect of cell analysis. For this platform, PBMC or TILs need to be isolated within a strict time frame to avoid granulocyte contamination and related suppression of functionality [84, 85]. Excellent guidance is provided in the latest CLSI document for the performance of single cell immune assays . Apoptotic cell contamination should be kept to a minimum . Overnight resting of previously frozen samples prior to the assay has been shown to remove apoptotic cells and restore functionality [87, 88].
Multiparametric technology platforms, such as SCNP, enable simultaneous analysis of the functional capacity of multiple and rare immune cell subsets without the need for cell subset isolation or novel sample processing procedures. Samples are drawn into standard sodium-heparin coated tubes, and where necessary, PBMCs are prepared using standard Ficoll separation and cryopreservation procedures for viable sample preparation and storage . Cell-subset identification is performed by in silico “isolation” of subsets that are identified by fluorochrome-conjugated antibodies recognizing phenotypic markers.
Plasma and serum
Circulating free proteins, chemokine, and cytokine levels can be measured using either plasma or serum samples. Circulating free DNA (cfDNA) in plasma is gaining significance as a monitoring tool for tumor progression and therapy response.
Because major differences exist in the protein profile of plasma and serum, it is important that once chosen as the primary sample type either serum or plasma is consistently used during the entire course of the validation of a blood biomarker test, unless these fluids have been shown to be interchangeable . Common variables to pay attention include: i) the nonlinear dilution pattern of majority of soluble cytokines, ii) preferential distribution behavior of different analyte levels in plasma, and iii) non-specific background that can affect signal reproducibility via inhibitory or stimulatory mechanisms. When no one matrix covers every target of interest, thorough validation is highly recommended to define the best matrices to obtain optimal performance, especially under multiplexed setups . For example, IL-6 was found to be significantly less represented in serum than in plasma, while the level of CXCL8 was found higher in serum than in plasma . For individual circulating proteins, chemokine and cytokine, quantitative immunoassays, such as singleplex enzyme-linked immunosorbent assay (ELISA), are frequently used. Multiplex platforms like Luminex or Meso Scale (MSD) technologies are commonly used for quantitation of groups of analytes.
For assay development using biofluids, including cfDNA or miRNA, background effects on the assay readout such as hemolysis should be assessed. The preference is for plasma because the clotting reaction for serum preparation not only alters the proteomic composition of the sample, but also contains DNA from leukocytes and thus is less suitable for tumor specific cfDNA analysis. It is feasible to use samples taken for routine hematology measurements, but lithium heparin tubes should be avoided as lithium is a PCR inhibitor [93, 94]. Consensus SOPs for the collection, processing, handling, and storage of serum and plasma samples for biomarker discovery and validation are available .
2. Tissue-based assays
Tissue based biomarkers can be measured on freshly frozen (FF) tumor samples or formalin fixed paraffin embedded (FFPE) tissue. FFPE tissue blocks are often available as archival materials as part of bio-banked samples for conventional IHC, which is the most widely used platform for biomarker assessment in diagnostic surgical pathology and for retrospective research. However, damage to the protein and nucleic acid frequently occurs through the fixation, embedding, and prolonged storage of FFPE samples.
IHC is a multi-step process that requires standardized conditions for tissue collection, fixation and processing, preparation of the IHC slide, and interpretation of the staining results. IHC based assays remain important tests as companion diagnostics (CDx) to assess antigen expression on diagnostic or surgical specimens for selecting patients and predicting patient-response to specific targeted therapies (e.g., HER2 expression for Herceptin), and more recently PD-L1 measurement as a CDx for pembrolizumab treatment of NSCLC patients. Published guidelines for measuring established biomarkers such as estrogen receptor, progesterone receptor, and HER2 are available [96, 97]. Of particular importance is the consideration of tissue collection and shipping of paraffin slides, which is a major challenge for multi-institution studies where central processing and banking is performed . General guidelines, including analyte stability and laboratory quality control, for performing analysis of tissue-based molecular biomarkers have been published .
Time is a critical factor throughout the biospecimen collection and processing period, especially for proteins that are highly labile. Minimizing the pre-analytic variability for IHC-based analysis needs to address tissue removal from the patient. It is generally accepted that 2 h of ischemia does not significantly alter the protein, DNA or RNA conformation, or preservation of microscopic features. To preserve antigenicity of PD-L1 in IHC assays, it is recommended to store slide-mounted tissue sections in the dark at 2-8 °C. In addition, staining within 6 months of sectioning is recommended for reliable interpretation of PD-L1 expression due to the instability of the antigen .
Time to fixation and the fixation period are also critical factors affecting the quality of both RNA and protein, especially phosphoproteins that are notoriously unstable depending on the time of fixation, duration of fixation, and the type of fixative . Published guidelines for optimal protein staining include fixation in 10 % neutral buffered formalin (NBF) for 24 h, dehydration in several changes of xylene and ethanol for 1.5-15 h, and embedding in paraffin for 0.5–4.5 h . For PD-L1 detection, fixation time for 12–72 h in 10 % NBF is recommended, as fixation times of ≤3 h may result in variable PD-L1 detection . The specific conditions, however, may vary from protein to protein due to the biochemical nature of the protein.
Embedding can have a great impact on pre-analytical and analytical variability especially when the presence of tumor immune-infiltrate is required to be integrated in the context of specific location in the tissue specimen, e.g., invasive tumor margin. Association of TILs (e.g., CD3, CD8) at the invasive margin in melanoma has been shown to correlate with response to PD-1 pathway inhibitors [35, 36]. T cell-infiltrate location (invasive margin and/or tumor center) has been previously identified as an important consideration in the “Immunoscore” algorithm for prognosis in CRC and a variety of other tumors . Standardization and consensus guidelines for TILs assessment in breast cancer to foster their integration into future clinical trials and diagnostic practice has also been published .
Antigen retrieval conditions also depend on the nature of the antigen and should be carefully controlled (e.g., the pH of the retrieval solution for PD-L1 must be 6.1 ± 0.2, as a pH below 5.9 may give erroneous results). Specific conditions, however, will vary due to the biochemical nature of the antigen, membrane vs. cytoplasmic or nuclear localization as well as variability of expression of the specific antigen in different histologies. To control pre-analytical requirements of the assay’s performance, running the test on a series of in-house tissues with known IHC performance characteristics representing known positive and negative tissues is recommended (reference samples).
Although IHC for a single marker remains a standard method in pathology laboratories, tumor stratification, in particular in immune-oncology, will likely require quantitative and multiple marker approaches to accurately define the multi-dimensional interactions between cancer and the immune system, which are relevant for clinical decision making. A standardized methodology for evaluating PD-L1 expression and TILs might be required as a prerequisite for integrating these parameters in standard histopathological practice as well as in clinical trials. Quantitative and multiplexed IHC and immunofluorescence-based platforms have been discussed in detail in publications resulting from other Biomarker Task Force activities (Additional file 1) .
Next Generation Sequencing (NGS)-based tests for tumor mutation analysis, similar to other complex molecular diagnostics, should demonstrate adequate analytical and clinical performance . They should follow SOPs that specifically address materials and procedures including patient’s sample type, method of DNA extraction as well as technical metrics for DNA quantification and quality, which can negatively impact sensitivity and reproducibility of the assay .
For somatic mutation detection using NGS assays, an important pre-analytical consideration is the collection and storage of quality controlled samples. Various standardized preservation methods have been developed for DNA  in various sample types including FFPE, FF tissues, and fine-needle biopsies [108, 109]. Nucleic acids, in particular DNA, are more stable than proteins and are therefore less sensitive to variation in sample processing, although formalin fixation has been shown to reduce DNA and RNA solubility and induce a high frequency of sequence alterations . An important factor is determining the minimal amount of FFPE material required for a NGS clinical assay. Usually a minimum of 80 % tumor content in the extracted material from FFPE tumor samples is required, but samples with as low as 10 % of tumor content have been used in research studies [105, 111].
Tumor enrichment using macro-dissection is helpful to quantitatively assess somatic variant allele frequency and copy number values (CNV). It also increases sensitivity and reproducibility of the data. Whole tumor section should be considered when assessing contribution of the tumor stroma, which could be important for quantitation of components of TME including immune system components such as TILs.
The quantity of DNA needed as input for an assay can vary depending on the analyte and assay platform. FFPE tumor DNA from clinical samples presents a challenge for mutation testing specifically when the DNA input from mutated cells is low, the DNA can be damaged, and C > T artifacts in DNA from the fixation and embedding process frequently occur. Amplification steps can be used before sequencing (i.e., library creation), but this process is associated with an increased risk of errors. Quantification of DNA and RNA can be performed by spectrophotometry, fluorimetry, or by PCR. Yet, absorbance does not reflect integrity of DNA since it does not measure fragmentation or degradation resulting from tissue processing. These limitations can be overcome by utilizing novel qPCR type approaches for input material optimization .
Immunosequencing of TCRβ for T cell clonality used a multiplex PCR and is routinely performed on genomic DNA extracted from FFPE samples. The size of the amplicon for TCRβ analysis is generally compatible with the level of degradation of DNA caused by the fixation process. Further refinements of the immunosequencing assay to make it even more robust on DNA extracted from FFPE samples are currently under development .
Gene expression-based tests
The preparation of intact and pure mRNA is one of the key factors in mRNA gene quantification. Extraction of nucleic acids and particularly RNA is very sensitive to nucleases. Thus, nuclease-free conditions should be implemented to control variability in steps such as sample collection, tissue fixation, and FFPE blocks handling including sectioning. For the extraction of nucleic acids from the FFPE tumor tissue, a method for the simultaneous isolation of high-quality DNA, RNA, and microRNA as well as protein from the same sample has been developed [114, 115].
To measure quality, the RNA Integrity Number (RIN) obtained from RNA electropherogram traces (e.g., Bioanalyzer traces) has been used traditionally as measures of FFPE RNA. However, RIN values from degraded FFPE fragments samples are not a sensitive measure of RNA quality and are not reliable predictors for successful library preparation. Illumina developed the DV200 metric to access FFPE RNA quality by accurately measuring the percentage of RNA >200 nucleotides. DV200 > 30 % of RNA samples ensures that degraded RNA fragments meet the requirements for efficient target capture and is a reliable predictor of library preparation .
Gene expression analysis using RNAseq, microarrays, or qPCR platforms on RNA prepared from FFPE tissues has been notoriously challenging due to poor quality RNA and the chemical modification of the nucleic acids. Furthermore, assessment of RNA degradation indicates that the degree of RNA fragmentation and the sensitivity to fragmentation depend on the specific transcript. Therefore, selecting a proper internal control gene from listed housekeeping genes for normalization is very critical for successful gene expression analysis using RNAseq analysis. However, other platforms, such as the Nanostring nCounter System, which have been optimized for RNA prepared from FFPE samples do not suffer from the same limitations. Specifically, NanoString probe code-set design and detection method appear to be able to accommodate the fragmented nature of FFPE tissue RNA better than most of the other currently available technologies.
Recent clearance by FDA under 510(k) regulation of the NanoString’s Prosigna (PAM50) gene signature panel showed that when using macro-dissected FFPE tissue slides as the starting sample, the reproducibility was quite high. The analytic validation of a gene expression prognostic signature has been recently published . The analytic studies described in the publication resulted in the optimal tissue and optimal RNA specifications required for acceptance of clinical samples in the marketed assay (i.e., tumor surface area in H&E stained slides >4 mm2/slide, tumor cellularity required (>10 %), and need for non-tumor tissue macro-dissection). These data suggest that gene expression profiling upon application of suitable controls and standard procedure can achieve a fit-for-purpose assay for successful clinical application .
3. Reagent qualification and stability
One of the crucial steps in the analytic validation of any assay is the qualification of the specific reagents, unique to each test. Chemical compounds can decompose under freeze and thaw cycles, and both short and long term storage conditions can affect cell processing and DNA/RNA extraction. The stability of the stock solutions, of the analyte and the internal standard should be evaluated at assay specific conditions. Conditions used in reagent stability testing should reflect situations likely to be encountered during actual sample handling, storage, and analysis.
As part of the qualification process of assay reagents, stability testing of critical reagents, such as primary antibodies, enzymes, and recombinant cytokines, should be performed to define stability windows and sample expiration dates. Clear directions in prequalification criteria for large-batch stored materials are highly recommended (e.g., a viability cut-off to qualify control donor PBMC used as an in-study quality control). For functional cell-based assays, such as ELISpot and SCNP that require cell-preconditioning, specific validated SOPs ensuring reproducibility are necessary [88, 118].
For example, in order to qualify reagents for SCPN, each antibody-fluorochrome conjugate is titrated independently against 3 qualified control samples to select the optimal titer in the relevant buffer conditions following reagent qualification SOPs. Cocktails comprising all components are then generated following SOPs and before incorporation in the assay are qualified for performance using SOP qualified control samples (cell lines and/or banked control PBMC from healthy donors). Modulators (e.g., cytokines, drugs, anti-TCR, or anti-BCR) are formulated and qualified for assay incorporation using standard samples as for the assay cocktails, testing for both positive and negative signaling (e.g., anti-TCR stimulation should induce signaling in the T cells but not B cells within the well).
Analytical validation involves confirming that the assay used for the biomarker measurement has established: i) Accuracy, ii) Precision, iii) Analytical sensitivity, iv) Analytical specificity, v) Reportable range of test results for the test system, vi) Reference intervals (normal values) with controls and calibrators, vii) Harmonized analytical performance if the assay is to be performed in multiple laboratories, and viii) Establishment of appropriate quality control measures. The requirements for analytical validation as well as their definitions are summarized in full in Table 3.
Analytic repeatability and reproducibility is a requirement for the implementation of all diagnostic tests and is particularly critical for predictive assays given the implications of misclassifications of patients for treatment. Use of positive and negative controls and standardized SOPs are required to assure reproducibility. Guidelines for the number of replicates needed to validate the performance of molecular diagnostic assays, as well as such considerations as the linearity of assay response, dynamic range, limits of detection, analyte stability within the intended matrix, and intra- and inter-laboratory coefficient of variability have been provided [19, 119].
Precision refers to closeness of agreement between a series of measurements and evaluates random error that may be identified as within-run, between-run within-day, between-day, or within-laboratory. Precision is quantitatively expressed in terms of the standard deviation (SD), variance, or coefficient of variation (CV) of a series of measurements. Precision is often a function of the analyte concentration, with small concentrations resulting in poorer precision (i.e., larger SD, variance, and CV) than high concentrations. Precision should be assessed at the medical decision points of relevance to the intended clinical application of the tumor biomarker. Precision is determined by reproducibility and repeatability of the assay which allow quantitative determination of the closeness of agreement among measurements. The reproducibility is generally measured by the % CV, which is defined as the standard deviation divided by the mean of the assay result expressed as a percent .
The FDA and European Medicines Agency (EMA) acceptance criteria for biological assays typically define the required between-run and within-run precision as CV of 10 or 15 % for quality control samples and 20 % for lower limit of quantification (LLOQ) samples [120, 121]. However, the new CLSI guidelines for single cell-based functional assays suggests larger CV acceptance (up to 30 %) and requires more repetitions (6 to 10 replicates) in assay validation to reflect the high degree of heterogeneity of the majority of live cell-based immune assays (including intracellular cytokine staining, HLA-peptide multimer assay, ELISpot, and cell proliferation assays) . It is important to note that the ultimate CV acceptance can only be evaluated in the clinical context in which the test is used (e.g., for a patient stratification assay variability around the test cutoff together with the distribution in the target patient population of the test results will need to be considered).
Depending on the particular category, an assay can require a distinct type of analytic validation. Definite quantitative assays make use of calibrators and a regression model to calculate absolute quantitative values for unknown samples. The reference standard must be well defined and should be a representative of the biomarker. This type of assay can be accurate and precise. In relative-quantitative assays, reference calibrators can be used; however, because standards are not fully representative of the biomarker, assay precision can be validated, while the accuracy of the assay can only be estimated.
Precision for single cell immune assays, e.g., ELISpot (including intra- and inter-assay variability as well as reproducibility) is a particularly critical validation parameter. Inherent variability of these assays should be adequately addressed as they are frequently used in the clinic to longitudinally monitor changes in immune parameters in response to an immune intervention (such a vaccine administration). Precision data are essential to render results of measurements at different time points that are comparable in a meaningful way i.e., an increase in the magnitude of measured responses after vaccination/treatment has to significantly differ from the determined variability. Precision testing includes replicate measurements of the same conditions in one experiment (repeatability) and repetition of the assay with the same samples on different days by all assay operators involved in a study (intra-assay precision) and in all participating laboratories (reproducibility), if applicable.
A rather challenging task with these assays is to determine accuracy, i.e., the closeness of agreement of the measured value and the true value. This is particularly true for ELISpot as well as for other single cell functional assays, due to the lack of a gold standard/test that is able to provide an exact measurement of antigen-specific cells in a given sample. Obtaining data on how accurate a laboratory performance is in relation to a specific assay can be achieved via participation in large proficiency panels that provide relative accuracy for a laboratory in comparison to other laboratories testing the same sample(s) in the same assay. An international ELISpot Proficiency Panel for IFNɣ is conducted on a yearly basis and is open for participation to any laboratory independent of affiliation or research background .
Efforts to harmonize classic single-cell immune monitoring assays have included the identification of critical assay steps, and guidelines for harmonized assay conduct have been made available (ELISpot [123–125], multimer staining [126, 127], intracellular cytokine staining [128–130] and Immunoscore [131, 132]). These efforts have been shown to dramatically reduce the variability among laboratories and provide a basis for the comparison of immune assay results obtained at different sites, or even across trials .
For SCNP, captured data include quantification of cell subset frequencies and specific intracellular read outs for each of the cell subsets in both the basal (unmodulated) and modulated state. In addition, various aspects of modulated signaling in each cell subset and/or signaling inhibition by in vitro drug exposure are captured by metrics that are computed by comparing data for cells subject to different conditions. In this manner, the degree of evoked signal, for example, is established by comparing data obtained in the modulated well for a specific donor sample with the data obtained from the same sample in the adjacent unmodulated well. The “Fold” metric is applied to measure magnitude of the responsiveness of a signal in a specific cell population relative to the unmodulated reference. The proportion of a cell population that is responsive to modulation is measured by the Uu (rank based metric based on Mann–Whitney U statistic) metric. Similarly, inhibited signaling is captured using both magnitude and population-based metrics .
Reproducibility of semi-quantitative assays such as IHC is a unique problem in that it is difficult to measure variation between assay results. For IHC assays, results are usually expressed as low, medium, or high or on a scale of 1 to 3. For such assays, reproducibility is generally measured in terms of the kappa (ĸ) statistic and percent agreement among different observers . Although there is no generally accepted value of ĸ that indicates the level of agreement, it has been suggested that ĸ <0.4 represents poor, 0.4–0.6 moderate, 0.6–0.8 significant, and 0.8 very good agreement: total agreement is indicated by a value of 1.0 .
A semi-quantitative assays do not use calibration standards but has a continuous response that is expressed in terms of a characteristic of the test sample. Precision can be validated but not accuracy. The ideal level of agreement or concordance in such assays is unclear, although a level of agreement of 85 % is considered to be acceptable. Inter-observer reproducibility might represent a major challenge to the reliable assessment of the IHC results in addition to tissue-processing.
2. Multiparametric assays
Validation and maintaining reproducibility of multiparametric assays is much more challenging considering the number of analytic variables associated with high content assays (such as NanoString, flow cytometry, SCNP, mutational load, and TCR sequencing). The capacity of high-throughput platforms, such as nCounter Dx Analysis System (NanoString) or flow cytometry based analysis SCNP enable multi-dimensional analysis of the immune system. Instead of detecting a single or limited number of molecular targets, assays are able to detect tens to hundreds of distinct molecular features simultaneously .
SCNP enables the simultaneous analysis of the functional capacity of multiple immune cell subsets in the same well. Controls for assay performance, reagents, and multiplexing are therefore required to validate reproducibility and precision . Multiplexed reagent “cocktails” are generated comprising 8 or more fluorochrome-conjugated antibodies that recognize both cell surface and intracellular phenotyping molecules (e.g., CD3, CD4, CD56, and FoxP3) and intracellular readouts of activity (e.g., p-Akt, and p-ERK) following sample modulation with selected stimuli. The use of pre-formatted lyophilized-reagent plates (Lyoplates, BD Biosciences) can help to decrease staining variability compared with using individual liquid reagents in multiple studies in immunophenotyping  as well as functional assays .
To control for multiplexing, each assay should be run with a well-characterized control for assay performance included in the top row of every plate (healthy control donor PBMCs or cell line). In addition, rainbow control particles included in the final column of each plate should be included to control for cytometer performance and enable normalization within and across plates. The control samples (typically healthy donor PBMCs) are typically from leukapheresed whole blood in which multiple vials of the same donor preparation are available and are qualified for use following a standard signaling panel defined by SOPs. Control donor bridging across assays is also performed where appropriate. When cell lines are used, batch preparations are made to cover multiple assay runs and are qualified following SOPs.
For NGS, assay performance characteristics include: accuracy (degree of agreement between the nucleic acid sequences derived from the assay and reference sequence); precision (the degree to which repeated sequence analyses give the same results); repeatability (within-run precision); reproducibility (between-run precision); and sensitivity (the likelihood that the assay will detect the targeted sequence variations, if present). Sensitivity also includes the probability that the assay will not detect a sequence variation when none is present. Two different NGS platforms using different chemistries for amplification based systems coupled to massively parallel sequencing are commonly used for NGS applications (Illumina TruSeq and Ion Torrent AmpliSeq). Each platform has specific parameters relevant to the laboratory and test requirements including instrument size, instrument cost, run time, read length, and cost per sample [116, 139, 140].
For WES and WGS, the focus of validation is on developing metrics that define a high-quality exome/genome, such as the average coverage across the exome/genome and the percentage of bases that meet a set minimum coverage threshold. The minimum acceptable level of the concordance of single nucleotide polymorphisms (SNPs) identified as compared with the reference should be established). Minimum coverage threshold necessary to determine variants relevant for the diagnostics need to be also established experimentally as low coverage increases the risk of missing low-level variants. Even after the macro-dissection step, patient tumor samples are still contaminated with normal cells derived from surrounding tissue or from reactive infiltrate, which may skew the representation of mutant alleles. The American College of Medical Genetics (ACMG) has developed clinical laboratory standards for NGS , which specifically address the unique challenges of WES/WGS .
The TCR immunosequencing assay is a Laboratory Developed Test (LDT) that has been CLIA and CAP certified. Data presented at the time of these certifications supported the following assay parameters: analytic accuracy, sensitivity, lower level of detection (LOD), lower limit of quantification (LLOQ), specificity (including interfering factors), linear reportable range, and precision.
Two methods have been analytically validated to determine MSI phenotype in colon cancer, yet neither is FDA approved/cleared. PCR analysis with a panel of mononucleotide markers (BAT-25, BAT-26, MONO-27, NRhwe21, and NR-24) and IHC based analysis of the MMR proteins (MLH1, MSH2, MSH6, and PMS2) have been proposed. Both tests show high reproducibility; however, IHC-based test, unlike PCR, has disadvantages such as dependence on antibody panels and challenges of analytical performance evaluation of the IHC based assay. CAP provides a detailed summary on several clinically important issues, such as the number and types of markers used, methods used to perform the assay, and definition of MSI-H and MSI-L phenotypes. This information is valuable to clinical laboratories that are currently offering this test as well as to those that are planning to launch this test for predicting response to anti-PD-1 inhibitors [142, 143].
3. Reference materials for immune assays
For efficient assay development, particular care must be given to establish the conditions that allow validation of the assay to meet required sensitivity and specificity by usage of well-defined standards. Inclusion of appropriate control materials to ensure that assays are working accurately and reproducibly is a key to the success of any assay. Each experiment must include controls that reflect both the analytical and post-analytical processing to assess artefactual findings leading to misinterpretation of experimental results. Ideally, consistent reference materials should be used across all stages of analytical validation. Table 4 provides a list of recommended standard materials as reliable controls for specific immune assays. There are two different types of reference materials depending on the purpose of application: i) validation references and ii) quality control references.
Reference materials are used in assay validation to estimate intra- and inter-run accuracy/precision and stability. Quality control reference materials are used during in-study sample analysis to accept or reject assay runs. For both types of reference materials, low (undetectable, <LOD) and high (maximum working concentration) reference levels can be established as negative and positive controls, respectively. The same biological sample can serve multiple purposes (e.g., as validation reference and quality control reference). However, a validation reference, by its nature, is used to show assay parallelism with patient samples, behaving with similar performance measurements (i.e., specificity, precision, and sensitivity), while the quality control references are used to test acceptance criteria.
Because of the lack of well-characterized and well-regulated “reference standard materials” (typically authorized by US Pharmacopeial Convention (USP) and National Institute of Standards & Technology (NIST) or other international agencies such as National Institute for Biological Standards and Controls (NIBSC), World Health Organization (WHO), etc.) for quantitative measures of immune analytes, reference materials often in the forms of biological samples are used to assess relative accuracy of an assay performance (cell lines and tissue specimens). To better reflect the complexity of immune cell-based assays, synthetic reference materials or “home-brew” references are created by preparing mixtures of known analyte(s) (e.g., recombinant proteins) at known concentrations.
Unlike quantitative assays in which the result is a continuous number expressed using an approved or certified reference standard, semi-quantitative assays, such as immune response assays, rarely have reference standards and are expressed in relation to a baseline characteristic of a sample. These assays generally lack calibrators but may have standards for the different categorical values that are usually not certified by a regulatory body.
For blood-based assays, the reference samples may include cell lines or control PBMC donor samples that are prepared and cryopreserved following SOPs to ensure standardized preparation. These controls are qualified for use following SOPs that define both the test and the required output data parameters for inclusion in the assay. For example, in SCNP a defined range of signaling across pre-specified nodes is used to qualify a sample for use as a control. The use of PBMC from leukapheresed whole blood enables the generation of large batches of control donor PBMC that can cross multiple assay runs. For T cell assays, specific TCR-engineered T cells can be obtained and used as performance control . “Bridging” samples are used to enable the transfer of one control donor to another over time and multiple assay runs in instances where one donor sample would be exhausted.
For IHC, cores containing positive and negative protein expressing or genetically modified cell lines that are extensively characterized using molecular assays, IHC, Western blot and fluorescent in situ hybridization (FISH) or well characterized tissue specimens are recommended to be included on the same slide. For example, human tonsil tissue is recommended for PD-L1 IHC as strong positive staining should be detected in portions of the crypt epithelium and weak to moderate staining of the follicular macrophages in the germinal centers. Negative staining should be observed in endothelium, fibroblasts, and surface epithelium . Cultured cell lines could represent an alternative source of material for quality control that are homogenous, uniform in quality, and can be processed and embedded in paraffin. Culture cell lines can be used as a control for the validity of the staining, but should not be used for interpretation of patients’ data . Efforts using validation of RNA levels for accurate PD-L1 detection is also ongoing .
Although relative quantitative assays constitute the great majority of immune response assays so far, RNA or DNA-based methods, such as NGS, TCR sequencing or gene expression profiling methods that may become predictive for response to immunotherapy, if validated, are highly quantitative due to availability of synthetic reference materials. Generally, major sequencing reagent providers have a set of standards that serve to control instrument performance in addition to standards for technical performance of the assay in order to conserve reads for clinical samples in a run.
The NIST recommended HapMap NA12878 control is used for standardization of platform performance when the data are compared with the well-curated, publically available data from different consortia, e.g., Genome in a Bottle (GIAB) Consortium for NA12878, which has extensively quality-controlled reference standard materials for analytical validation of NGS platforms, including DNA standard reference materials with high accuracy for whole genome sequences .
In the case of FFPE tissue-based tests for somatic mutations, control DNA samples available from companies, such as Horizon Dx or Acrometrix (Thermo Fisher, Inc.), provide controls with a clear readout of variant calls at defined positions that greatly aid in the development of somatic mutation assays. Use of controls that match anticipated specimens (such as FFPE controls) in addition to high quality, non-formalin fixed cellular HapMap control materials like NA12878 is particularly useful for establishing background error for formalin-fixation caused deamination based errors, e.g., high background of C/T variant calls and other fixation based artifacts as well as calculation of index calling efficiency with pipelines being utilized .
The immunosequencing assay makes use of independently chemically synthesized templates for every possible V and J combination for any locus for which the assay is developed . These templates provide a known set and frequency of rearranged sequences that allow for control of PCR-bias. They serve as internal controls for every reaction that is run. They can be distributed by a third party regulatory concern for use in laboratory proficiency testing.
Examples of synthetic reference materials also include synthetic vectors serving as reference to control amplification bias for DNA, and cDNA-based NGS, “alien” sequences (sequences of nucleotides which do not exist in humans) as negative controls for the nCounter platform .
The post-analytical phase of biomarker evaluations involves data interpretation of the assay results. Dichotomous variables are relatively straightforward to incorporate into calculations of data sensitivity and specificity. However, most variables in measurement of immune response are continuous, resulting in variability with respect to analytical performance criteria and clinical relevance of the assay, e.g., cutoff points for clinical decision making. Essentially, a cutoff for classifying a sample as positive or negative needs to be determined empirically by correlating results with clinical outcomes in a clinical trial exploring efficacy of a drug as discussed in Volume II.
Flow cytometry-based data interpretation considers many different aspects such as pre-defined gating and clustering strategies, choice of appropriate data transformation for data visualization, inclusion and exclusion criteria, and so on, as shown by numerous published harmonization efforts [129, 149–151]. The minimal reporting guidelines for biological and biomedical investigations (MIBBI) project include a series of reporting frameworks (http://mibbi.sourceforge.net/foundry.shtml) to guide scientific publishing and data reporting to specific web sites where independent analysis is possible. There are several “minimal information” sub-projects under assay or platform-specific focus groups. Flow cytometry (MIFlowCyt)  and T cell assays (MIATA) , NK cell assays (MIANKA), and FISH assay (MISFISHIE)  are those most relevant to immune status monitoring. These initiatives provide useful suggestions for scientific data reporting and may help researchers to determine the degree of laboratory details captured.
Immunohistochemical methods are notoriously nonlinear, and scoring systems are generally vulnerable to heterogeneity in intensity extent and topography of staining. Because of a lack of universal methods, scoring systems for IHC are usually based on characteristics of overall staining intensity using a scale of 0 to 3+ and subcellular localization . The main pitfalls of PD-L1 as a predictive biomarker may be related to both the variability in expression due to tumor heterogeneity as well as IHC assay variability due to different antibody clones, staining platforms, scoring systems, and clinical sampling points. These factors increase the uncertainty for using PD-L1 expression as a patient selection biomarker. Together, these challenges may contribute to the low NPV and PPV of PD-L1 as a predictive marker of clinical benefit to anti-PD-1/PD-L1 blockade.
There are numerous drugs in development targeting the PD-1/PD-L1 pathway; the practice has been to independently develop anti-PD-L1 IHC CDx for individual agents. The different PD-L1 IHC diagnostic kits and assays vary in different percentages of positive cells, scoring systems, and cutoff values (from 1 to 50 %), cells scored (tumor cells and/or infiltrating immune cells), and in the subcellular localization of staining (membrane vs. cytoplasmic). If each therapeutic was approved in conjunction with a specific CDx, this may present a challenge for testing and decision making in the clinic. Examples of tumor samples with different percentage of tumor cells staining for PD-L1 are shown in Fig. 2. PD-L1 immunostaining with a percentage of tumor cell staining of 50 % or higher was associated with significantly longer progression-free survival and overall survival than a lower than 50 % percent of stained cells in a KEYNOTE 001 trial with pembrolizumab in NSCLC. If each therapeutic was approved in conjunction with a specific CDx, this may present a challenge for testing and decision making in the clinic.
Thus, the FDA, the American Association for Cancer Research (AACR),and American Society of Clinical Oncology (ASCO) convened a workshop titled “Complexities in Personalized Medicine: Harmonizing Companion Diagnostics Across a Class of Targeted Therapies” to address comparability across multiple PD-L1 tests. A highlight of the workshop was the unveiling of a “blueprint” proposal developed by four pharmaceutical companies (Bristol-Myers Squibb, Merck & Co. Inc., AstraZeneca PLC, and Genentech, Inc.) and two diagnostic companies (Agilent Technologies, Inc./Dako Corp and Roche/Ventana Medical Systems, Inc.) to analytically cross-compare the four different diagnostics . The scope of this study was to establish technical comparability and to define the key performance parameters of each assay. Preliminary results of this effort were presented at the 2016 AARC annual meeting. Analyses from the Blueprint Project confirm that there is high concordance for the two approved PD-L1 diagnostics in NSCLC .
Because IHC is the cornerstone of hospital pathology, significant efforts to measure T cell immune infiltrates as potential predictive markers for clinical decision-making in immunotherapy have been focused in particular on multiplex quantitative IHC approaches. Image-based readouts for IHC using automated methods remove the subjectivity of the traditional system and provide more continuous and reproducible scoring of protein expression in tissue samples. The assessment of TILs by digital image analysis has the potential, for example, to determine the number of TILs per mm2 stromal tissue as an exact measurement contrary to the approximate semi-quantitative evaluation currently used. Automated quantitative analysis (AQUA) provides an automated IHC-based analysis and scoring system for assessing the target protein’s signal intensity normalized over the tumor areas and subcellular compartment of biological significance . AQUA has been noted as a promising new strategy for the measurement of hormone receptors testing in breast cancer tissue [158, 159].
Recently developed mass cytometry techniques with the ability to allow multiplexed and directly quantitative imaging of tissue samples helps to overcome many of the current IHC limitations. In these approaches, primary antibodies labeled with rare lanthanide metals with a unique mass that is easily assessed by time-of-flight mass spectrometry. Imaging software is used to re-construct the 2-D stained tissue image from the detected heavy metal ions. CyTOF (Cytometry by Time-Of-Flight) utilizes a laser to destroy the tissue/antibodies and free heavy metal ions. A two dimensional image is created that looks very similar to a routine IHC but with quantitative multiplexed information . Multiplexed ion beam imaging (MIBI) uses a scanning ion beam to liberate the metal ions, which improves the resolution but requires more specialized setup (vacuum, multiple detector MS) . These methods will likely allow for quantitative approaches and development of models to integrate vast amounts of immune response-related information and apply it into clinically applicable settings.
Given the huge amount of sequence data produced by NGS platforms, the development of accurate and efficient data handling and analysis pipelines is essential. NGS data analysis can be divided into four primary operations: (i) base calling, (ii) read alignment, (iii) variant calling, and (iv) variant annotation. A very large number of algorithms are available for each discrete step in data analysis. The accuracy of identifying variants greatly depends on the depth of sequence coverage and variant call quality scores vary between algorithms because of the weighting of quality scores for surrounding bases as well as positional context with respect to primer position and stretches of repetitive bases. Therefore, the final list of quality filtered base calls can be quite different when the same raw data is subjected to analysis with different data analysis software. Another common discrepancy between variant callers involves reporting only non-synonymous and deleterious mutations while other analysis provide a complete list of mutations without filtering for synonymous, coding vs. non-coding, and deleterious vs. tolerated mutations .
For NGS bioinformatics pipelines, a very large number of algorithms are available for each step in data analysis to assess the quality of raw NGS data available for whole exome data analysis, including data preprocessing, alignment, post-alignment processing, variant calling, annotation, and prioritization tools. Starting from available exome sequencing data, mutations can then be assessed for their immunogenic potential in the context of each patient’s MHC haplotype using epitope prediction algorithms. These algorithms provide an estimate of the total number of mutation-associated neoantigens in each tumor. Although the number of predicted mutation-associated neoantigens is usually small, it might be proportionate to the number of actual mutation-associated neoantigens, and tumors with a high number of actual mutation-associated neoantigens are more likely to stimulate the immune system to react against the tumor [51, 162, 163].
In the NanoString platform, the nSolver™ Analysis Software is a validated data analysis program for automatic QC, normalization, and data analysis. It performs automated background subtraction corrections; implements customized quality control on samples/lanes, runs the predictive algorithm, and provides customized sample/patient reports.
As high-throughput methods became widely available there is a need for computational methodologies for interpretation of the complex data for biological and clinical implications. Algorithms to develop multimodal signatures integrating various types of molecular tumor data (i.e., genomics, protein expression, and functional analyses) with TME factors that reflect the complex biomarker information require the development of multifactorial classifiers/algorithms. A list of commonly used bioinformatics tools for different high-throughput technologies have been provided and discussed in other publications from the SITC Immune Biomarkers Task Force activities .
Any software used to automate any part of the assay for clinical application must ultimately be validated for its intended use prior to clinical application, as required by 21 CFR §820.70(i) . In addition, computer systems used to create, modify, and maintain electronic records and to calculate multiplexed assay results (e.g., outputs of algorithmic models) are also subject to the same validation requirements.
Such computer systems must be validated to ensure accuracy, reliability, consistent intended performance, and the ability to discern invalid or altered records. Testing of device software functionality in a simulated use environment and user site testing are typically included as components of an overall design validation program for a software automated device. In large measure, software validation is a matter of developing a “level of confidence” that all requirements and user expectations for the software automated functions and features of the device are met.
The biological complexity of the tumor and immune system interaction contributes to multiple challenges associated with technical development of clinically applicable assays when evaluating different variables as markers of clinical benefit to immunotherapy. Recent developments in research and technologies have facilitated better understanding of this interaction and will provide means for development of such assays. However, each of the potential biomarkers and the associated assay demands high-quality validation so it can reach clinical application. To date, various promising candidate assays and platforms to predict response to immunotherapy are available, as discussed in this publication and other reports of the SITC Immune Biomarkers Task Force activity (Additional file 1). However, so far, only the PD-L1 IHC assays to inform anti-PD-1/PD-L1 treatment have been validated for clinical utility. Considering the increased relevance and emphasis on biomarker development in cancer immunotherapy, there is an enormous need to facilitate and improve the steps to demonstrate clinical value of molecular diagnostics in this space. Although many guidelines for assay validation are available, this review differs from previously published reports, as it covers the key steps in the entire process including: i) analytical validation (Volume I), ii) clinical validation, iii) the strategies for demonstration of clinical utility and iv) the regulatory approval process for clinically applicable diagnostics (Volume II) in the context of assays for immunotherapy response. Applying approaches and recommendations as outlined in this review should enable more efficient assay development to identify biomarkers, which are crucial to guide personalized therapy and for advancing immunotherapy options for cancer patients. Therefore, the implementation of the following practices/steps are recommended:
Ensure a fit-for-purpose approach for assay development, including biomarker selection and validation.
Specific quality-control and quality assurance practices for appropriate procurement for blood-based and the tissue-based assays for each specific biomarker should be considered.
Ensure that optimal pre-analytic processing regimens and standard operating procedures (SOPs) for controlling specific biomarker are followed.
Procedures with rigorous quality assurance, reproducibility, and control procedures built in should be considered for analytical validation step.
The interpretation of assay results must be complemented by proper reference standards, including reagents and assay controls (positive and negative controls, if appropriate).
Biostatistics and computerized approaches for data quantification and interpretation as well as algorithm development for multiplex signatures based on phenotypic, functional, and genomic data should be considered.
Bioinformatics approaches for the integration of complex, multicomponent, high-throughput types of molecular data from tumor and immune factor analysis should be considered.
To evaluate the robustness of semi-quantitative methods and to enable the analytical and clinical validation of biomarkers, reference standards and/or coordinated efforts across centralized laboratories (proficiency panels) are recommended.
General Guidance for Fit-for-purpose Biomarker Validation 
Best Practices for Biospecimen Resources, NCI, NIH 
List of Cleared or Approved Companion Diagnostic Devices, FDA 
Regulations of General Biological Products Standards, FDA 
Guidance for Gene Expression Profiling Platforms, FDA 
Principles of Analytical Validation for Immunohistochemical Assays 
Guidelines for Validation of Cell Based Fluorescence Assays 
Guidelines for Evaluation of Qualitative Test Performance 
Guidelines for Evaluation of Precision Performance of Clinical Chemistry Devices 
Guidelines for Verification of Precision and Estimation of Bias 
Guidelines for Quality Assurance for Immunohistochemistry 
Guidelines for Performance of Single Cell Immune Response Assays 
Guidelines for Enumeration of Immunologically Defined Cell Populations by Flow Cytometry 
American association for cancer research
Absolute eosinophil counts
American society of clinical oncology
B cell receptor
College of American pathologists
Chimeric antigen receptor
Complementarity determining region
Circulating free DNA
Clinical laboratory improvement amendments
Clinical and laboratory standard institute
Copy number values
Cytotoxic lymphocyte-associated antigen 4
Coefficient of variation
Enzyme-linked immunosorbent assay
European medicines agency
Food and drug administration
Fluorescent in situ hybridization
Genome in a bottle
Granulocyte-macrophage colony-stimulating factor
International clinical cytometry society
International society for laboratory hematology
Myeloid derived suppressor cells
Multiplexed ion beam imaging
Neutral buffered formalin
National cancer institute
Next generation sequencing
National institute for biological standards and controls
National institute of standards and technology
Natural killer cell
Negative predictive value
Non-small cell lung cancer
Peripheral blood mononuclear cells
Programmed cell death protein 1
Programmed cell death ligand 1
Positive predictive value
Renal cell carcinoma
RNA integrity number
Single cell network profiling
Society for immunotherapy of cancer
Standard operating procedure
T cell receptor
T-helper type 1
Tissue infiltrating lymphocytes
Triple negative breast cancer
Tumor necrosis factor
Regulatory T cells
US Pharmacopeial convention
Whole exome sequencing
Whole genome sequencing
World Health Organization
Pardoll DM. Immunology beats cancer: a blueprint for successful translation. Nat Immunol. 2012;13(12):1129–32. doi:10.1038/ni.2392.
Hodi FS, O’Day SJ, McDermott DF, Weber RW, Sosman JA, Haanen JB, et al. Improved survival with ipilimumab in patients with metastatic melanoma. N Engl J Med. 2010;363(8):711–23. doi:10.1056/NEJMoa1003466.
Robert C, Thomas L, Bondarenko I, O’Day S, Weber J, Garbe C, et al. Ipilimumab plus dacarbazine for previously untreated metastatic melanoma. N Engl J Med. 2011;364(26):2517–26. doi:10.1056/NEJMoa1104621.
Topalian SL, Hodi FS, Brahmer JR, Gettinger SN, Smith DC, McDermott DF, et al. Safety, activity, and immune correlates of anti-PD-1 antibody in cancer. N Engl J Med. 2012;366(26):2443–54. doi:10.1056/NEJMoa1200690.
Motzer RJ, Rini BI, McDermott DF, Redman BG, Kuzel TM, Harrison MR, et al. Nivolumab for metastatic renal cell carcinoma: results of a randomized phase II trial. J Clin Oncol. 2015;33(13):1430–7. doi:10.1200/jco.2014.59.0703.
Tumeh PC, Harview CL, Yearley JH, Shintaku IP, Taylor EJ, Robert L, et al. PD-1 blockade induces responses by inhibiting adaptive immune resistance. Nature. 2014;515(7528):568–71. doi:10.1038/nature13954.
Ascierto PA, Marincola FM. 2015: the year of anti-PD-1/PD-L1s against melanoma and beyond. EBioMedicine. 2015;2(2):92–3. doi:10.1016/j.ebiom.2015.01.011.
Ansell SM, Lesokhin AM, Borrello I, Halwani A, Scott EC, Gutierrez M, et al. PD-1 blockade with nivolumab in relapsed or refractory Hodgkin’s lymphoma. N Engl J Med. 2015;372(4):311–9. doi:10.1056/NEJMoa1411087.
Garon EB, Rizvi NA, Hui R, Leighl N, Balmanoukian AS, Eder JP, et al. Pembrolizumab for the treatment of non-small-cell lung cancer. N Engl J Med. 2015;372(21):2018–28. doi:10.1056/NEJMoa1501824.
Powles T, Eder JP, Fine GD, Braiteh FS, Loriot Y, Cruz C, et al. MPDL3280A (anti-PD-L1) treatment leads to clinical activity in metastatic bladder cancer. Nature. 2014;515(7528):558–62. doi:10.1038/nature13904.
Topalian SL, Drake CG, Pardoll DM. Immune checkpoint blockade: a common denominator approach to cancer therapy. Cancer Cell. 2015;27(4):450–61. doi:10.1016/j.ccell.2015.03.001.
Philips GK, Atkins M. Therapeutic uses of anti-PD-1 and anti-PD-L1 antibodies. Int Immunol. 2015;27(1):39–46. doi:10.1093/intimm/dxu095.
Lussier DM, Johnson JL, Hingorani P, Blattman JN. Combination immunotherapy with α-CTLA-4 and α-PD-L1 antibody blockade prevents immune escape and leads to complete control of metastatic osteosarcoma. J Immunother Cancer. 2015;3(1):1–11.
Smyth MJ. Abstract SY07-01: New targets in combination cancer immunotherapies. Cancer Res. 2015;75(15 Supplement):SY07–1-SY-1.
Perez-Gracia JL, Labiano S, Rodriguez-Ruiz ME, Sanmamed MF, Melero I. Orchestrating immune check-point blockade for cancer immunotherapy in combinations. Curr Opin Immunol. 2014;27:89–97. doi:10.1016/j.coi.2014.01.002.
Wolchok JD, Kluger H, Callahan MK, Postow MA, Rizvi NA, Lesokhin AM, et al. Nivolumab plus ipilimumab in advanced melanoma. N Engl J Med. 2013;369(2):122–33. doi:10.1056/NEJMoa1302369.
Larkin J, Chiarion-Sileni V, Gonzalez R, Grob JJ, Cowey CL, Lao CD, et al. Combined nivolumab and ipilimumab or monotherapy in untreated melanoma. N Engl J Med. 2015;373(1):23–34. doi:10.1056/NEJMoa1504030.
Hinrichs CS, Rosenberg SA. Exploiting the curative potential of adoptive T-cell therapy for cancer. Immunol Rev. 2014;257(1):56–71. doi:10.1111/imr.12132.
Lee JW, Devanarayan V, Barrett YC, Weiner R, Allinson J, Fountain S, et al. Fit-for-purpose method development and validation for successful biomarker measurement. Pharm Res. 2006;23(2):312–28. doi:10.1007/s11095-005-9045-3.
Martens A, Wistuba-Hamprecht K, Geukes Foppen MH, Yuan J, Postow MA, Wong P, et al. Baseline peripheral blood biomarkers associated with clinical outcome of advanced melanoma patients treated with ipilimumab. Clin Cancer Res. 2016;22(12):2908–18. doi:10.1158/1078-0432.ccr-15-2412.
Kenter GG, Welters MJ, Valentijn AR, Lowik MJ, der Meer DM B-v, Vloon AP, et al. Vaccination against HPV-16 oncoproteins for vulvar intraepithelial neoplasia. N Engl J Med. 2009;361(19):1838–47. doi:10.1056/NEJMoa0810097.
Walter S, Weinschenk T, Stenzl A, Zdrojowy R, Pluzanska A, Szczylik C, et al. Multipeptide immune response to cancer vaccine IMA901 after single-dose cyclophosphamide associates with longer patient survival. Nat Med. 2012;18(8):1254–61. doi:10.1038/nm.2883.
Kirkwood JM, Lee S, Moschos SJ, Albertini MR, Michalak JC, Sander C, et al. Immunogenicity and antitumor effects of vaccination with peptide vaccine+/−granulocyte-monocyte colony-stimulating factor and/or IFN-alpha2b in advanced metastatic melanoma: eastern cooperative oncology group phase II trial E1696. Clin Cancer Res. 2009;15(4):1443–51. doi:10.1158/1078-0432.ccr-08-1231.
Sheikh NA, Petrylak D, Kantoff PW, Dela Rosa C, Stewart FP, Kuan LY, et al. Sipuleucel-T immune parameters correlate with survival: an analysis of the randomized phase 3 clinical trials in men with castration-resistant prostate cancer. Cancer Immunol Immunother. 2013;62(1):137–47. doi:10.1007/s00262-012-1317-2.
Gulley JL, Arlen PM, Madan RA, Tsang KY, Pazdur MP, Skarupa L, et al. Immunologic and prognostic factors associated with overall survival employing a poxviral-based PSA vaccine in metastatic castrate-resistant prostate cancer. Cancer Immunol Immunother. 2010;59(5):663–74. doi:10.1007/s00262-009-0782-8.
Disis ML, Wallace DR, Gooley TA, Dang Y, Slota M, Lu H, et al. Concurrent trastuzumab and HER2/neu-specific vaccination in patients with metastatic breast cancer. J Clin Oncol. 2009;27(28):4685–92. doi:10.1200/jco.2008.20.6789.
Karakhanova S, Ryschich E, Mosl B, Harig S, Jager D, Schmidt J, et al. Prognostic and predictive value of immunological parameters for chemoradioimmunotherapy in patients with pancreatic adenocarcinoma. Br J Cancer. 2015;112(6):1027–36. doi:10.1038/bjc.2015.72.
Lacayo NJ, Alonzo TA, Gayko U, Rosen DB, Westfall M, Purvis N, et al. Development and validation of a single-cell network profiling assay-based classifier to predict response to induction therapy in paediatric patients with de novo acute myeloid leukaemia: a report from the Children’s oncology group. Br J Haematol. 2013;162(2):250–62. doi:10.1111/bjh.12370.
Cesano A, Willman CL, Kopecky KJ, Gayko U, Putta S, Louie B, et al. Cell signaling-based classifier predicts response to induction therapy in elderly patients with acute myeloid leukemia. PLoS One. 2015;10(4):e0118485. doi:10.1371/journal.pone.0118485.
Taube JM, Anders RA, Young GD, Xu H, Sharma R, McMiller TL, et al. Colocalization of inflammatory response with B7-h1 expression in human melanocytic lesions supports an adaptive resistance mechanism of immune escape. Sci Transl Med. 2012;4(127):127ra37. doi:10.1126/scitranslmed.3003689.
Taube JM, Klein A, Brahmer JR, Xu H, Pan X, Kim JH, et al. Association of PD-1, PD-1 ligands, and other features of the tumor immune microenvironment with response to anti-PD-1 therapy. Clin Cancer Res. 2014;20(19):5064–74. doi:10.1158/1078-0432.ccr-13-3271.
Dako. PD-L1 IHC 22C3 pharmDx Specification Sheet. 2015. http://www.dako.com/download.pdf?objectid=128206002. Accessed 3 Oct 2016.
Dako. PD-L1 IHC 28–8 pharmDx Specification Sheet. 2015. http://www.dako.com/download.pdf?objectid=128371004. Accessed 3 Oct 2016.
Ventana. PD-L1 (SP142 Assay) Specification Sheet. 2016. http://www.accessdata.fda.gov/cdrh_docs/pdf16/P160002c.pdf. Accessed 6 Sept 2016.
Teng MW, Ngiow SF, Ribas A, Smyth MJ. Classifying cancers based on T-cell infiltration and PD-L1. Cancer Res. 2015;75(11):2139–45. doi:10.1158/0008-5472.can-15-0255.
Herbst RS, Soria JC, Kowanetz M, Fine GD, Hamid O, Gordon MS, et al. Predictive correlates of response to the anti-PD-L1 antibody MPDL3280A in cancer patients. Nature. 2014;515(7528):563–7. doi:10.1038/nature14011.
Spranger S, Spaapen RM, Zha Y, Williams J, Meng Y, Ha TT, et al. Up-regulation of PD-L1, IDO, and T(regs) in the melanoma tumor microenvironment is driven by CD8(+) T cells. Sci Transl Med. 2013;5(200):200ra116. doi:10.1126/scitranslmed.3006504.
Ribas A RC, Hodi FS, Wolchok JD, Joshua AM, Hwu WJ, et al. Association of response to programmed death receptor 1 (PD-1) blockade with pembrolizumab (MK-3475) with an interferon-inflammatory immune gene signature. J Clin Oncol. 2015;33 [suppl; abstract 3001].
Meyer C, Cagnon L, Costa-Nunes CM, Baumgaertner P, Montandon N, Leyvraz L, et al. Frequencies of circulating MDSC correlate with clinical outcome of melanoma patients treated with ipilimumab. Cancer Immunol Immunother. 2014;63(3):247–57. doi:10.1007/s00262-013-1508-5.
deLeeuw RJ, Kost SE, Kakal JA, Nelson BH. The prognostic value of FoxP3+ tumor-infiltrating lymphocytes in cancer: a critical review of the literature. Clin Cancer Res. 2012;18(11):3022–9. doi:10.1158/1078-0432.ccr-11-3216.
Domingues P, Gonzalez-Tablas M, Otero A, Pascual D, Miranda D, Ruiz L et al. Tumor infiltrating immune cells in gliomas and meningiomas. Brain Behav Immun. 2015. doi:10.1016/j.bbi.2015.07.019
Llosa NJ, Cruise M, Tam A, Wicks EC, Hechenbleikner EM, Taube JM, et al. The vigorous immune microenvironment of microsatellite instable colon cancer is balanced by multiple counter-inhibitory checkpoints. Cancer Discov. 2015;5(1):43–51. doi:10.1158/2159-8290.cd-14-0863.
Mittendorf EA, Philips AV, Meric-Bernstam F, Qiao N, Wu Y, Harrington S, et al. PD-L1 expression in triple-negative breast cancer. Cancer Immunol Res. 2014;2(4):361–70. doi:10.1158/2326-6066.cir-13-0127.
Pages F, Kirilovsky A, Mlecnik B, Asslaber M, Tosolini M, Bindea G, et al. In situ cytotoxic and memory T cells predict outcome in patients with early-stage colorectal cancer. J Clin Oncol. 2009;27(35):5944–51. doi:10.1200/jco.2008.19.6147.
Mlecnik B, Tosolini M, Kirilovsky A, Berger A, Bindea G, Meatchi T, et al. Histopathologic-based prognostic factors of colorectal cancers are associated with the state of the local immune reaction. J Clin Oncol. 2011;29(6):610–8. doi:10.1200/jco.2010.30.5425.
Snyder A, Makarov V, Merghoub T, Yuan J, Zaretsky JM, Desrichard A, et al. Genetic basis for clinical response to CTLA-4 blockade in melanoma. N Engl J Med. 2014;371(23):2189–99. doi:10.1056/NEJMoa1406498.
Rizvi NA, Hellmann MD, Snyder A, Kvistborg P, Makarov V, Havel JJ, et al. Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science. 2015;348(6230):124–8. doi:10.1126/science.aaa1348.
van Rooij N, van Buuren MM, Philips D, Velds A, Toebes M, Heemskerk B, et al. Tumor exome analysis reveals neoantigen-specific T-cell reactivity in an ipilimumab-responsive melanoma. J Clin Oncol. 2013;31(32):e439–42. doi:10.1200/jco.2012.47.7521.
Carreno BM, Magrini V, Becker-Hapak M, Kaabinejadian S, Hundal J, Petti AA, et al. Cancer immunotherapy. A dendritic cell vaccine increases the breadth and diversity of melanoma neoantigen-specific T cells. Science. 2015;348(6236):803–8. doi:10.1126/science.aaa3828.
Campesato LF, Barroso-Sousa R, Jimenez L, Correa BR, Sabbaga J, Hoff PM, et al. Comprehensive cancer-gene panels can be used to estimate mutational load and predict clinical benefit to PD-1 blockade in clinical practice. Oncotarget. 2015;6(33):34221–7. doi:10.18632/oncotarget.5950.
McGranahan N, Furness AJ, Rosenthal R, Ramskov S, Lyngaa R, Saini SK et al. Clonal neoantigens elicit T cell immunoreactivity and sensitivity to immune checkpoint blockade. Science. 2016. doi:10.1126/science.aaf1490
Van Allen EM, Miao D, Schilling B, Shukla SA, Blank C, Zimmer L, et al. Genomic correlates of response to CTLA-4 blockade in metastatic melanoma. Science. 2015;350(6257):207–11. doi:10.1126/science.aad0095.
Le DT, Uram JN, Wang H, Bartlett BR, Kemberling H, Eyring AD, et al. PD-1 blockade in tumors with mismatch-repair deficiency. N Engl J Med. 2015;372(26):2509–20. doi:10.1056/NEJMoa1500596.
Timmermann B, Kerick M, Roehr C, Fischer A, Isau M, Boerno ST, et al. Somatic mutation profiles of MSI and MSS colorectal cancer identified by whole exome next generation sequencing and bioinformatics analysis. PLoS One. 2010;5(12):e15661. doi:10.1371/journal.pone.0015661.
Koopman M, Kortman GA, Mekenkamp L, Ligtenberg MJ, Hoogerbrugge N, Antonini NF, et al. Deficient mismatch repair system in patients with sporadic advanced colorectal cancer. Br J Cancer. 2009;100(2):266–73. doi:10.1038/sj.bjc.6604867.
Dudley JC, Lin MT, Le DT, Eshleman JR. Microsatellite instability as a biomarker for PD-1 blockade. Clin Cancer Res. 2016;22(4):813–20. doi:10.1158/1078-0432.ccr-15-1678.
Smyrk TC, Watson P, Kaul K, Lynch HT. Tumor-infiltrating lymphocytes are a marker for microsatellite instability in colorectal carcinoma. Cancer. 2001;91(12):2417–22.
Robert L, Tsoi J, Wang X, Emerson R, Homet B, Chodon T, et al. CTLA4 blockade broadens the peripheral T-cell receptor repertoire. Clin Cancer Res. 2014;20(9):2424–32. doi:10.1158/1078-0432.ccr-13-2648.
Cha E, Klinger M, Hou Y, Cummings C, Ribas A, Faham M, et al. Improved survival with T cell clonotype stability after anti-CTLA-4 treatment in cancer patients. Sci Transl Med. 2014;6(238):238ra70. doi:10.1126/scitranslmed.3008211.
Nielsen T, Wallden B, Schaper C, Ferree S, Liu S, Gao D, et al. Analytical validation of the PAM50-based prosigna breast cancer prognostic gene signature assay and nCounter analysis system using formalin-fixed paraffin-embedded breast tumor specimens. BMC Cancer. 2014;14:177. doi:10.1186/1471-2407-14-177.
Ribas A, Robert C, Hodi FS, Wolchok JD, Joshua AM, Hwu W-J et al. Association of response to programmed death receptor 1 (PD-1) blockade with pembrolizumab (MK-3475) with an interferon-inflammatory immune gene signature. J Clin Oncol. 2015;33((suppl; abstr 3001)).
Higgs BW RP, Blake-Haskins JA, Zhu W, Morehouse C, Brohawn PZ, Rebelatto MC, Yao Y, Jin X, Shi L, Ranade K. High tumoral IFNy mRNA, PD-L1 protein, and combined IFNy mRNA/PDL1 protein expression associates with response to durvalumab (anti-PD-L1) monotherapy in NSCLC patients. Abstract Book of the 40th ESMO Congress (ESMO 2015) Vienna, Austria. 2015(15 LBA).
Plebani M, Sciacovelli L, Aita A, Chiozza ML. Harmonization of pre-analytical quality indicators. Bioch Med. 2014;24(1):105–13. doi:10.11613/BM.2014.012.
Office of Biorepositories and Biospecimen Research, National Cancer Institute, National Institutes of Health, US Department of Health and Human Services. National Cancer Institute Best Practices for Biospecimen Resources. 2011. https://biospecimens.cancer.gov/bestpractices/2011-NCIbestpractices.pdf. Accessed 3 Oct 2016.
Chau CH, Rixe O, McLeod H, Figg WD. Validation of analytic methods for biomarkers used in drug development. Clin Cancer Res. 2008;14(19):5967–76. doi:10.1158/1078-0432.ccr-07-4535.
Lee JW, Weiner RS, Sailstad JM, Bowsher RR, Knuth DW, O’Brien PJ, et al. Method validation and measurement of biomarkers in nonclinical and clinical samples in drug development: a conference report. Pharm Res. 2005;22(4):499–511. doi:10.1007/s11095-005-2495-9.
Mallone R, Mannering SI, Brooks-Worrell BM, Durinovic-Bello I, Cilio CM, Wong FS, et al. Isolation and preservation of peripheral blood mononuclear cells for analysis of islet antigen-reactive T cell responses: position statement of the T-cell workshop committee of the immunology of diabetes society. Clin Exp Immunology. 2011;163(1):33–49. doi:10.1111/j.1365-2249.2010.04272.x.
Afonso G, Scotto M, Renand A, Arvastsson J, Vassilieff D, Cilio CM, et al. Critical parameters in blood processing for T-cell assays: validation on ELISpot and tetramer platforms. J Immunol Methods. 2010;359(1–2):28–36. doi:10.1016/j.jim.2010.05.005.
Clinical and Laboratory Standards Institute. CLSI document I/LA26-A2: Performance of Single Cell Immune Response Assays; Approved Guideline. 2nd ed. Wayne, PA; 2013.
Weinberg A, Song LY, Wilkening CL, Fenton T, Hural J, Louzao R, et al. Optimization of storage and shipment of cryopreserved peripheral blood mononuclear cells from HIV-infected and uninfected individuals for ELISPOT assays. J Immunol Methods. 2010;363(1):42–50. doi:10.1016/j.jim.2010.09.032.
McKenna KC, Beatty KM, Vicetti Miguel R, Bilonick RA. Delayed processing of blood increases the frequency of activated CD11b + CD15+ granulocytes which inhibit T cell function. J Immunol Methods. 2009;341(1–2):68–75. doi:10.1016/j.jim.2008.10.019.
De Rose R, Taylor EL, Law MG, van der Meide PH, Kent SJ. Granulocyte contamination dramatically inhibits spot formation in AIDS virus-specific ELISpot assays: analysis and strategies to ameliorate. J Immunol Methods. 2005;297(1–2):177–86. doi:10.1016/j.jim.2004.12.009.
Schmielau J, Finn OJ. Activated granulocytes and granulocyte-derived hydrogen peroxide are the underlying mechanism of suppression of t-cell function in advanced cancer patients. Cancer Res. 2001;61(12):4756–60.
Letzkus M, Luesink E, Starck-Schwertz S, Bigaud M, Mirza F, Hartmann N, et al. Gene expression profiling of immunomagnetically separated cells directly from stabilized whole blood for multicenter clinical trials. Clin Transl Med. 2014;3:36. doi:10.1186/s40169-014-0036-z.
Parkinson DR, Dracopoli N, Petty BG, Compton C, Cristofanilli M, Deisseroth A, et al. Considerations in the development of circulating tumor cell technology for clinical use. J Transl Med. 2012;10:138. doi:10.1186/1479-5876-10-138.
El Messaoudi S, Rolet F, Mouliere F, Thierry AR. Circulating cell free DNA: preanalytical considerations. Clin Chim Acta. 2013;424:222–30. doi:10.1016/j.cca.2013.05.022.
Sparrow RL, Chan KS. Microparticle content of plasma for transfusion is influenced by the whole blood hold conditions: pre-analytical considerations for proteomic investigations. J Proteomics. 2012;76 Spec No.:211–9. doi:10.1016/j.jprot.2012.07.013.
Deneys V, Thiry V, Hougardy N, Mazzon AM, Leveugle P, De Bruyere M. Impact of cryopreservation on B cell chronic lymphocytic leukaemia phenotype. J Immunol Methods. 1999;228(1–2):13–21.
Koryakina A, Frey E, Bruegger P. Cryopreservation of human monocytes for pharmacopeial monocyte activation test. J Immunol Methods. 2014;405:181–91. doi:10.1016/j.jim.2014.01.005.
Kotsakis A, Harasymczuk M, Schilling B, Georgoulias V, Argiris A, Whiteside TL. Myeloid-derived suppressor cell measurements in fresh and cryopreserved blood samples. J Immunol Methods. 2012;381(1–2):14–22. doi:10.1016/j.jim.2012.04.004.
Voshol H, Dullens HF, Den Otter W, Vliegenthart JF. Human natural killer cells: a convenient purification procedure and the influence of cryopreservation on cytotoxic activity. J Immunol Methods. 1993;165(1):21–30.
Strauss L, Bergmann C, Gooding W, Johnson JT, Whiteside TL. The frequency and suppressor function of CD4 + CD25highFoxp3+ T cells in the circulation of patients with squamous cell carcinoma of the head and neck. Clin Cancer Res. 2007;13(21):6301–11. doi:10.1158/1078-0432.ccr-07-1403.
Maecker HT, McCoy JP, Nussenblatt R. Standardizing immunophenotyping for the human immunology project. Nat Rev Immunol. 2012;12(3):191–200.
Bull M, Lee D, Stucky J, Chiu YL, Rubin A, Horton H, et al. Defining blood processing parameters for optimal detection of cryopreserved antigen-specific responses for HIV vaccine trials. J Immunol Methods. 2007;322(1–2):57–69. doi:10.1016/j.jim.2007.02.003.
Kierstead LS, Dubey S, Meyer B, Tobery TW, Mogg R, Fernandez VR, et al. Enhanced rates and magnitude of immune responses detected against an HIV vaccine: effect of using an optimized process for isolating PBMC. AIDS Res Hum Retrovir. 2007;23(1):86–92. doi:10.1089/aid.2006.0129.
Lenders K, Ogunjimi B, Beutels P, Hens N, Van Damme P, Berneman ZN, et al. The effect of apoptotic cells on virus-specific immune responses detected using IFN-gamma ELISPOT. J Immunol Methods. 2010;357(1–2):51–4. doi:10.1016/j.jim.2010.03.001.
Kutscher S, Dembek CJ, Deckert S, Russo C, Korber N, Bogner JR, et al. Overnight resting of PBMC changes functional signatures of antigen specific T- cell responses: impact for immune monitoring within clinical trials. PLoS One. 2013;8(10):e76215. doi:10.1371/journal.pone.0076215.
Santos R, Buying A, Sabri N, Yu J, Gringeri A, Bender J, et al. Improvement of IFNg ELISPOT performance following overnight resting of frozen PBMC samples confirmed through rigorous statistical analysis. Cells. 2014;4(1):1–18. doi:10.3390/cells4010001.
Hawtin RE, Cesano A. Immune monitoring technology primer: single cell network profiling (SCNP). J Immunother Cancer. 2015;3:34. doi:10.1186/s40425-015-0075-z.
Rosenberg-Hasson Y, Hansmann L, Liedtke M, Herschmann I, Maecker HT. Effects of serum and plasma matrices on multiplex immunoassays. Immunol Res. 2014;58(2–3):224–33. doi:10.1007/s12026-014-8491-6.
Yu Z, Kastenmuller G, He Y, Belcredi P, Moller G, Prehn C, et al. Differences between human plasma and serum metabolite profiles. PLoS One. 2011;6(7):e21230. doi:10.1371/journal.pone.0021230.
de Jager W, Bourcier K, Rijkers GT, Prakken BJ, Seyfert-Margolis V. Prerequisites for cytokine measurements in clinical trials with multiplex immunoassays. BMC Immunol. 2009;10:52. doi:10.1186/1471-2172-10-52.
Kirschner MB, Edelman JJ, Kao SC, Vallely MP, van Zandwijk N, Reid G. The impact of hemolysis on cell-free microRNA biomarkers. Front Genet. 2013;4:94. doi:10.3389/fgene.2013.00094.
Bettegowda C, Sausen M, Leary RJ, Kinde I, Wang Y, Agrawal N, et al. Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci Transl Med. 2014;6(224):224ra24. doi:10.1126/scitranslmed.3007094.
Tuck MK, Chan DW, Chia D, Godwin AK, Grizzle WE, Krueger KE, et al. Standard operating procedures for serum and plasma collection: early detection research network consensus statement standard operating procedure integration working group. J Proteome Res. 2009;8(1):113–7. doi:10.1021/pr800545q.
Hammond ME, Hayes DF, Dowsett M, Allred DC, Hagerty KL, Badve S, et al. American society of clinical oncology/college of American pathologists guideline recommendations for immunohistochemical testing of estrogen and progesterone receptors in breast cancer. J Clin Oncol. 2010;28(16):2784–95. doi:10.1200/JCO.2009.25.6529.
Wolff AC, Hammond ME, Hicks DG, Dowsett M, McShane LM, Allison KH, et al. Recommendations for human epidermal growth factor receptor 2 testing in breast cancer: American society of clinical oncology/college of American pathologists clinical practice guideline update. J Clin Oncol. 2013;31(31):3997–4013. doi:10.1200/JCO.2013.50.9984.
Economou M, Schoni L, Hammer C, Galvan JA, Mueller DE, Zlobec I. Proper paraffin slide storage is crucial for translational research projects involving immunohistochemistry stains. Clin Transl Med. 2014;3(1):4. doi:10.1186/2001-1326-3-4.
Cree IA, Deans Z, Ligtenberg MJ, Normanno N, Edsjo A, Rouleau E, et al. Guidance for laboratories performing molecular pathology for cancer patients. J Clin Pathol. 2014;67(11):923–31. doi:10.1136/jclinpath-2014-202404.
Burns JA, Li Y, Cheney CA, Ou Y, Franlin-Pfeifer LL, Kuklin N, et al. Choice of fixative is crucial to successful immunohistochemical detection of phosphoproteins in paraffin-embedded tumor tissues. J Histochem Cytochem. 2009;57(3):257–64. doi:10.1369/jhc.2008.952911.
Engel KB, Moore HM. Effects of preanalytical variables on the detection of proteins by immunohistochemistry in formalin-fixed, paraffin-embedded tissue. Arch Pathol Lab Med. 2011;135(5):537–43. doi:10.1043/2010-0702-RAIR.1.
Galon J, Angell HK, Bedognetti D, Marincola FM. The continuum of cancer immunosurveillance: prognostic, predictive, and mechanistic signatures. Immunity. 2013;39(1):11–26. doi:10.1016/j.immuni.2013.07.008.
Salgado R, Denkert C, Demaria S, Sirtaine N, Klauschen F, Pruneri G, et al. The evaluation of tumor-infiltrating lymphocytes (TILs) in breast cancer: recommendations by an international TILs working group 2014. Ann Oncol. 2015;26(2):259–71. doi:10.1093/annonc/mdu450.
Yuan J, Hegde PS, Clynes R, Foukas PG, Harari A, Kleen TO, et al. Novel technologies and emerging biomarkers for personalized cancer immunotherapy. J Immunother Cancer. 2016;4:3. doi:10.1186/s40425-016-0107-3.
Pant S, Weiner R, Marton MJ. Navigating the rapids: the development of regulated next-generation sequencing-based clinical trial assays and companion diagnostics. Front Oncol. 2014;4:78. doi:10.3389/fonc.2014.00078.
Rehm HL, Bale SJ, Bayrak-Toydemir P, Berg JS, Brown KK, Deignan JL, et al. ACMG clinical laboratory standards for next-generation sequencing. Genet Med. 2013;15(9):733–47. doi:10.1038/gim.2013.92.
Roder B, Fruhwirth K, Vogl C, Wagner M, Rossmanith P. Impact of long-term storage on stability of standard DNA for nucleic acid-based methods. J Clin Microbiol. 2010;48(11):4260–2. doi:10.1128/JCM.01230-10.
Hadd AG, Houghton J, Choudhary A, Sah S, Chen L, Marko AC, et al. Targeted, high-depth, next-generation sequencing of cancer genes in formalin-fixed, paraffin-embedded and fine-needle aspiration tumor specimens. J Mol Diagn. 2013;15(2):234–47. doi:10.1016/j.jmoldx.2012.11.006.
Xie R, Chung JY, Ylaya K, Williams RL, Guerrero N, Nakatsuka N, et al. Factors influencing the degradation of archival formalin-fixed paraffin-embedded tissue sections. J Histochem Cytochem. 2011;59(4):356–65. doi:10.1369/0022155411398488.
Wong SQ, Li J, Tan AY, Vedururu R, Pang JM, Do H, et al. Sequence artefacts in a prospective series of formalin-fixed tumours tested for mutations in hotspot regions by massively parallel sequencing. BMC Med Genomics. 2014;7:23. doi:10.1186/1755-8794-7-23.
McDonald SA, Mardis ER, Ota D, Watson MA, Pfeifer JD, Green JM. Comprehensive genomic studies: emerging regulatory, strategic, and quality assurance challenges for biorepositories. Am J Clin Pathol. 2012;138(1):31–41. doi:10.1309/ajcpxba69lnscvmh.
Sah S, Chen L, Houghton J, Kemppainen J, Marko AC, Zeigler R, et al. Functional DNA quantification guides accurate next-generation sequencing mutation detection in formalin-fixed, paraffin-embedded tumor biopsies. Genome Med. 2013;5(8):77. doi:10.1186/gm481.
Robins H. Immunosequencing: applications of immune repertoire deep sequencing. Curr Opin Immunol. 2013;25(5):646–52. doi:10.1016/j.coi.2013.09.017.
Pena-Llopis S, Brugarolas J. Simultaneous isolation of high-quality DNA, RNA, miRNA and proteins from tissues for genomic applications. Nat Protoc. 2013;8(11):2240–55. doi:10.1038/nprot.2013.141.
Kalmar A, Wichmann B, Galamb O, Spisak S, Toth K, Leiszter K, et al. Gene expression analysis of normal and colorectal cancer tissue samples from fresh frozen and matched formalin-fixed, paraffin-embedded (FFPE) specimens after manual and automated RNA isolation. Methods. 2013;59(1):S16–9. doi:10.1016/j.ymeth.2012.09.011.
Illumina. TruSeq RNA Access Techical Note. http://www.illumina.com/content/dam/illumina-marketing/documents/products/technotes/evaluating-rna-quality-from-ffpe-samples-technical-note-470-2014-001.pdf. Accessed 3 Dec 2016.
US Food and Drug Administration. Guidance for Industry and FDA Staff - Class II Special Controls Guidance Document: Gene Expression Profiling Test System for Breast Cancer Prognosis. 2007. http://www.fda.gov/MedicalDevices/DeviceRegulationandGuidance/GuidanceDocuments/ucm079163.htm
Cesano A, Rosen DB, O’Meara P, Putta S, Gayko U, Spellmeyer DC, et al. Functional pathway analysis in acute myeloid leukemia using single cell network profiling assay: effect of specimen source (bone marrow or peripheral blood) on assay readouts. Cytometry B Clin Cytom. 2012;82(3):158–72. doi:10.1002/cyto.b.21007.
Cummings J, Raynaud F, Jones L, Sugar R, Dive C. Fit-for-purpose biomarker method validation for application in clinical trials of anticancer drugs. Br J Cancer. 2010;103(9):1313–7. doi:10.1038/sj.bjc.6605910.
European Medicines Agency, Committee for Medicinal Products for Human Use (CHMP). Guideline on Bioanalytical Method Validation. 2011. http://www.ema.europa.eu/docs/en_GB/document_library/Scientific_guideline/2011/08/WC500109686.pdf. Accessed 3 Dec 2016.
US Food and Drug Administration. General Biological Products Standards: 21 Code of Federal Regulations 610 http://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfcfr/CFRSearch.cfm?CFRPart=610. Accessed 3 Oct 2016.
Immudex. MHC Multimer & Elispot Proficiency Panels. Copenhagen, Denmark. http://www.immudex.com/proficiency-panels.aspx.
Britten CM, Gouttefangeas C, Welters MJ, Pawelec G, Koch S, Ottensmeier C, et al. The CIMT-monitoring panel: a two-step approach to harmonize the enumeration of antigen-specific CD8+ T lymphocytes by structural and functional assays. Cancer Immunol Immunother. 2008;57(3):289–302. doi:10.1007/s00262-007-0378-0.
Janetzki S, Panageas KS, Ben-Porat L, Boyer J, Britten CM, Clay TM, et al. Results and harmonization guidelines from two large-scale international elispot proficiency panels conducted by the cancer vaccine consortium (CVC/SVI). Cancer Immunol Immunother. 2008;57(3):303–15. doi:10.1007/s00262-007-0380-6.
Janetzki S, Price L, Schroeder H, Britten CM, Welters MJ, Hoos A. Guidelines for the automated evaluation of Elispot assays. Nat Protoc. 2015;10(7):1098–115. doi:10.1038/nprot.2015.068.
Britten CM, Janetzki S, Ben-Porat L, Clay TM, Kalos M, Maecker H, et al. Harmonization guidelines for HLA-peptide multimer assays derived from results of a large scale international proficiency panel of the cancer vaccine consortium. Cancer Immunol Immunother. 2009;58(10):1701–13. doi:10.1007/s00262-009-0681-z.
Attig S, Price L, Janetzki S, Kalos M, Pride M, McNeil L, et al. A critical assessment for the value of markers to gate-out undesired events in HLA-peptide multimer staining protocols. J Transl Med. 2011;9:108. doi:10.1186/1479-5876-9-108.
Welters MJ, Gouttefangeas C, Ramwadhdoebe TH, Letsch A, Ottensmeier CH, Britten CM, et al. Harmonization of the intracellular cytokine staining assay. Cancer Immunol Immunother. 2012;61(7):967–78. doi:10.1007/s00262-012-1282-9.
McNeil LK, Price L, Britten CM, Jaimes M, Maecker H, Odunsi K, et al. A harmonized approach to intracellular cytokine staining gating: results from an international multiconsortia proficiency panel conducted by the cancer immunotherapy consortium (CIC/CRI). Cytometry A. 2013;83(8):728–38. doi:10.1002/cyto.a.22319.
Jaimes MC, Maecker HT, Yan M, Maino VC, Hanley MB, Greer A, et al. Quality assurance of intracellular cytokine staining assays: analysis of multiple rounds of proficiency testing. J Immunol Methods. 2011;363(2):143–57. doi:10.1016/j.jim.2010.08.004.
Galon J, Pages F, Marincola FM, Angell HK, Thurin M, Lugli A, et al. Cancer classification using the Immunoscore: a worldwide task force. J Transl Med. 2012;10:205. doi:10.1186/1479-5876-10-205.
Galon J, Mlecnik B, Bindea G, Angell HK, Berger A, Lagorce C, et al. Towards the introduction of the ‘Immunoscore’ in the classification of malignant tumours. J Pathol. 2014;232(2):199–209. doi:10.1002/path.4287.
Janetzki S, Britten CM. The impact of harmonization on ELISPOT assay performance. Methods Mol Biol. 2012;792:25–36. doi:10.1007/978-1-61779-325-7_2.
Mengel M, von Wasielewski R, Wiese B, Rudiger T, Muller-Hermelink HK, Kreipe H. Inter-laboratory and inter-observer reproducibility of immunohistochemical assessment of the Ki-67 labelling index in a large multi-centre trial. J Pathol. 2002;198(3):292–9. doi:10.1002/path.1218.
Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–74.
Veldman-Jones MH, Brant R, Rooney C, Geh C, Emery H, Harbron CG, et al. Evaluating robustness and sensitivity of the NanoString technologies nCounter platform to enable multiplexed gene expression analysis of clinical samples. Cancer Res. 2015;75(13):2587–93. doi:10.1158/0008-5472.CAN-15-0262.
Nomura L, Maino VC, Maecker HT. Standardization and optimization of multiparameter intracellular cytokine staining. Cytometry A. 2008;73:984–91. doi:10.1002/cyto.a.20602.
Belouski SS, Wilkinson J, Thomas J, Kelly K, Wang SW, Suggs S, Ferbas J. Utility of lyophilized PMA and ionomycin to stimulate lymphocytes in whole blood for immunological assays. Cytometry B Clin Cytom. 2009;78:59–64. doi:10.1002/cyto.b.20492.
Gargis AS, Kalman L, Berry MW, Bick DP, Dimmock DP, Hambuch T, et al. Assuring the quality of next-generation sequencing in clinical laboratory practice. Nat Biotechnol. 2012;30(11):1033–6. doi:10.1038/nbt.2403.
Frampton GM, Fichtenholtz A, Otto GA, Wang K, Downing SR, He J, et al. Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing. Nat Biotechnol. 2013;31(11):1023–31. doi:10.1038/nbt.2696.
Chang KC, Marton MJ. Genomics clinical trial assay development: issues and lesson learned OMICS group eBook. 2015. doi:10.4172/978-1-63278-040-9-041.
Zhang L. Immunohistochemistry versus microsatellite instability testing for screening colorectal cancer patients at risk for hereditary nonpolyposis colorectal cancer syndrome. Part II. The utility of microsatellite instability testing. J Mol Diagn. 2008;10(4):301–7. doi:10.2353/jmoldx.2008.080062.
The College of American Pathology Technology Assessment Committee. Prognostic Uses of MSI Testing. 2011. http://www.cap.org/apps/docs/committees/technology/microsatellite_testing.pdf. Accessed 3 Oct 2016.
Bidmon N, Attig S, Rae R, Schroder H, Omokoko TA, Simon P, et al. Generation of TCR-engineered T cells and their use to control the performance of T cell assays. J Immunol. 2015;194(12):6177–89. doi:10.4049/jimmunol.1400958.
Clinical and Laboratory Standards Institute. Quality Assurance for Immunocytochemistry; Approved guideline. CLSI document MM4-A (1-56238-396-5), Wayne, PA; 1999.
Schalper KA, Velcheti V, Carvajal D, Wimberly H, Brown J, Pusztai L, et al. In situ tumor PD-L1 mRNA expression is associated with increased TILs and better outcome in breast carcinomas. Clin Cancer Res. 2014;20(10):2773–82. doi:10.1158/1078-0432.ccr-13-2702.
Zook JM, Chapman B, Wang J, Mittelman D, Hofmann O, Hide W, et al. Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls. Nat Biotechnol. 2014;32(3):246–51. doi:10.1038/nbt.2835.
Carlson CS, Emerson RO, Sherwood AM, Desmarais C, Chung MW, Parsons JM, et al. Using synthetic templates to design an unbiased multiplex PCR assay. Nat Commun. 2013;4:2680. doi:10.1038/ncomms3680.
Kvistborg P, Gouttefangeas C, Aghaeepour N, Cazaly A, Chattopadhyay PK, Chan C, et al. Thinking outside the gate: single-cell assessments in multiple dimensions. Immunity. 2015;42(4):591–2. doi:10.1016/j.immuni.2015.04.006.
Spidlen J, Moore W, Brinkman RR. ISAC’s Gating-ML 2.0 data exchange standard for gating description. Cytometry A. 2015;87(7):683–7. doi:10.1002/cyto.a.22690.
Tanqri S, Vall H, Kaplan D, Hoffman B, Purvis N, Porwit A, et al. Validation of cell-based fluorescence assays: practice guidelines from the ICSH and ICCS - part III - analytical issues. Cytometry B Clin Cytom. 2013;84(5):291–308. doi:10.1002/cyto.b.21106.
Lee JA, Spidlen J, Boyce K, Cai J, Crosbie N, Dalphin M, et al. MIFlowCyt: the minimum information about a flow cytometry experiment. Cytometry A. 2008;73(10):926–30. doi:10.1002/cyto.a.20623.
Britten CM, Janetzki S, van der Burg SH, Huber C, Kalos M, Levitsky HI, et al. Minimal information about T cell assays: the process of reaching the community of T cell immunologists in cancer and beyond. Cancer Immunol Immunother. 2011;60(1):15–22. doi:10.1007/s00262-010-0940-z.
Deutsch EW, Ball CA, Berman JJ, Bova GS, Brazma A, Bumgarner RE, et al. Minimum information specification for in situ hybridization and immunohistochemistry experiments (MISFISHIE). Nature Biotechnol. 2008;26(3):305–12. doi:10.1038/nbt1391.
Averbuch S, Emancipator K, McCaffery I, McElhinny A, Stanforth D, Walker J et al. A Blueprint Proposal for Companion Diagnostic Comparability. Washington; 2015. http://www.fda.gov/downloads/MedicalDevices/NewsEvents/WorkshopsConferences/UCM439440.pdf.
Comparison of Three Different PD-L1 Diagnostic Tests Shows a High Degree of Concordance, http://www.aacr.org/Newsroom/Pages/News-Release-Detail.aspx?ItemID=872#.WAweWPkrLiw. Accessed 18 Apr 2016.
Rubin MA, Zerkowski MP, Camp RL, Kuefer R, Hofer MD, Chinnaiyan AM, et al. Quantitative determination of expression of the prostate cancer protein alpha-methylacyl-CoA racemase using automated quantitative analysis (AQUA): a novel paradigm for automated and continuous biomarker measurements. Am J Pathol. 2004;164(3):831–40.
Camp RL, Chung GG, Rimm DL. Automated subcellular localization and quantification of protein expression in tissue microarrays. Nat Med. 2002;8(11):1323–7. doi:10.1038/nm791.
Allred DC, Carlson RW, Berry DA, Burstein HJ, Edge SB, Goldstein LJ, et al. NCCN task force report: estrogen receptor and progesterone receptor testing in breast cancer by immunohistochemistry. J Natl Compr Canc Netw. 2009;7 Suppl 6:S1–S21. quiz S2-3.
Giesen C, Wang HAO, Schapiro D, Zivanovic N, Jacobs A, Hattendorf B, Schuffler PJ, Grolimund D, Buhmann JM, Brandt S, Varga Z, Wild PJ, Gunther D, Bodenmiller B. Highly multiplexed imaging of tumor tissues with subcellular resolution by mass cytometry. Nat Methods. 2014;11:417–22. doi:10.1038/nmeth.2869.
Angelo M, Bendall SC, Finck R, Hale MB, Hitzman C, Borowsky AD, Levenson RM, Lowe JB, Liu SD, Zhao S, Natkunam Y, Nolan GP. Multiplexed ion beam imaging of human breast tumors. Nature Med. 2014;20:436–42. doi:10.1038/nm.3488.
Jones S, Anagnostou V, Lytle K, Parpart-Li S, Nesselbush M, Riley DR, et al. Personalized genomic analyses for cancer mutation discovery and interpretation. Sci Transl Med. 2015;7(283):283ra53. doi:10.1126/scitranslmed.aaa7161.
Lundegaard C, Lamberth K, Harndahl M, Buus S, Lund O, Nielsen M. NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8–11. Nucleic Acids Res. 2008;36(Web Server issue):W509–12. doi:10.1093/nar/gkn202.
US Food and Drug Administration. 21CFR820.70 Production and process controls. Revised April 1, 2015. https://www.gpo.gov/fdsys/granule/CFR-2012-title21-vol8/CFR-2012-title21-vol8-sec820-70
US Food and Drug Administration. List of Cleared or Approved Companion Diagnostic Devices (In Vitro and Imaging Tools). 2015. http://www.fda.gov/MedicalDevices/ProductsandMedicalProcedures/InVitroDiagnostics/ucm301431.htm. Accessed 3 Oct 2016.
Clinical and Laboratory Standards Institute. User Verification of Precision and Estimation of Bias; Approved Guideline - Third Edition. EP15-A3. 2014;34(12).
Fitzgibbons PL, Bradley LA, Fatheree LA, Alsabeh R, Fulton RS, Goldsmith JD, et al. Principles of analytic validation of immunohistochemical assays: guideline from the college of American pathologists pathology and laboratory quality center. Arch Pathol Lab Med. 2014;138(11):1432–43. doi:10.5858/arpa.2013-0610-CP.
Aziz N, Zhao Q, Bry L, Driscoll DK, Funke B, Gibson JS, et al. College of American Pathologists’ laboratory standards for next-generation sequencing clinical tests. Arch Pathol Lab Med. 2015;139(4):481–93. doi:10.5858/arpa.2014-0250-CP.
Schrijver I, Aziz N, Farkas DH, Furtado M, Gonzalez AF, Greiner TC, et al. Opportunities and challenges associated with clinical diagnostic genome sequencing: a report of the association for molecular pathology. J Mol Diagn. 2012;14(6):525–40. doi:10.1016/j.jmoldx.2012.04.006.
Barnett D, Louzao R, Gambell P, De J, Oldaker T, Hanson CA. Validation of cell-based fluorescence assays: practice guidelines from the ICSH and ICCS - part IV - postanalytic considerations. Cytometry B Clin Cytom. 2013;84(5):309–14. doi:10.1002/cyto.b.21107.
Tarhini AA, Edington H, Butterfield LH, Lin Y, Shuai Y, Tawbi H, et al. Immune monitoring of the circulation and the tumor microenvironment in patients with regionally advanced melanoma receiving neoadjuvant ipilimumab. PLoS One. 2014;9(2):e87705. doi:10.1371/journal.pone.0087705.
Di Giacomo AM, Calabro L, Danielli R, Fonsatti E, Bertocci E, Pesce I, et al. Long-term survival and immunological parameters in metastatic melanoma patients who responded to ipilimumab 10 mg/kg within an expanded access programme. Cancer Immunol Immunother. 2013;62(6):1021–8. doi:10.1007/s00262-013-1418-6.
Hodi FS, Lee S, McDermott DF, Rao UN, Butterfield LH, Tarhini AA, et al. Ipilimumab plus sargramostim vs ipilimumab alone for treatment of metastatic melanoma: a randomized clinical trial. JAMA. 2014;312(17):1744–53. doi:10.1001/jama.2014.13943.
Ku GY, Yuan J, Page DB, Schroeder SE, Panageas KS, Carvajal RD, et al. Single-institution experience with ipilimumab in advanced melanoma patients in the compassionate use setting: lymphocyte count after 2 doses correlates with survival. Cancer. 2010;116(7):1767–75. doi:10.1002/cncr.24951.
Wallden B, Pekker I, Popa S, et al. Development and analytical performance of a molecular diagnostic for anti-PD1 response on the nCounter Dx Analysis System. J Clin Oncol. 2016; 34(suppl; abstr 3034).
Piha-Paul SA, Bennouna J, Albright A, et al. T-cell inflamed phenotype gene expression signatures to predict clinical benefit from pembrolizumab across multiple tumor types. J Clin Oncol. 2016; 34(suppl; abstr 1536).
Man Chow LQ, Mehra R, Haddad RI, et al. Biomarkers and response to pembrolizumab (pembro) in recurrent/metastatic head and neck squamous cell carcinoma (R/M HNSCC). J Clin Oncol. 2016; 34(suppl; abstr 6010).
US Food and Drug Administration, Center for Drug Evaluation and Research. Guidance for Industry, Bioanalytical Method Validation. 2013. http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/UCM368107.pdf. Accessed 3/10/2016.
Jennings L, Van Deerlin VM, Gulley ML. Recommended principles and practices for validating clinical molecular pathology tests. Arch Pathol Lab Med. 2009;133(5):743–55. doi:10.1043/1543-2165-133.5.743.
Clinical and Laboratory Standards Institute. Evaluation of Precision Performance of Clinical Chemistry Devices; Approved Guideline. NCCLS document EP5-A (ISBN 1-56238-368-X). Wayne, PA; 1999.
Clinical and Laboratory Standards Institute. User Protocol for Evaluation of Qualitative Test Performance; Approved Guideline—Second Edition. CLSI document EP12-A2. Wayne, PA; 2008.
Clinical and Laboratory Standards Institute. Enumeration of Immunologically Defined Cell Populations by Flow Cytometry; Approved Guideline—Second Edition. CLSI document H42-A2. Wayne, PA; 2007.
The authors thank SITC staff for administrative and organization support. In addition, the authors acknowledge Chelsey Meier, Ph.D. for editorial and medical writing assistance on behalf of SITC.
Availability of data and materials
This manuscript is the result of the collaborative effort of WG1 from the SITC Immune Biomarkers Task Force. All authors participated in aspects of the conception, drafting, critical review, and editing of this paper. In addition, all authors read and approved the final version of this manuscript.
JA is a full-time employee of Janssen Pharmaceuticals, Inc. AC is a full-time employee of NanoString Technologies. SJ is founder and President of ZellNet Consulting, Inc. RH is a full-time employee of Nodality, Inc. IK is a full-time employee of Adaptive Biotechnologies, Inc. PR is a full-time employee of Pfizer, Inc. JZ is a full-time employee of Intrexon Coporation. SRS is a full-time employee of Omni Array Biotechnology, LLC. The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
See related research of article https://jitc.biomedcentral.com/articles/10.1186/s40425-016-0179-0
About this article
Cite this article
Masucci, G.V., Cesano, A., Hawtin, R. et al. Validation of biomarkers to predict response to immunotherapy in cancer: Volume I — pre-analytical and analytical validation. j. immunotherapy cancer 4, 76 (2016). https://doi.org/10.1186/s40425-016-0178-1