The development of a cSMART-based integrated model for hepatocellular carcinoma diagnosis

Background Hepatocellular carcinoma (HCC) generally arises from a background of liver cirrhosis (LC). Patients with cirrhosis and suspected HCC are recommended to undergo serum biomarker tests and imaging diagnostic evaluation. However, the performance of routine diagnostic methods in detecting early HCC remains unpromising. Methods Here, we conducted a large-scale, multicenter study of 1675 participants including 490 healthy controls, 577 LC patients, and 608 HCC patients from nine clinical centers across nine provinces of China, profiled gene mutation signatures of cell-free DNA (cfDNA) using Circulating Single-Molecule Amplification and Resequencing Technology (cSMART) through detecting 931 mutation sites across 21 genes. Results An integrated diagnostic model called “Combined method” was developed by combining three mutation sites and three serum biomarkers. Combined method outperformed AFP in the diagnosis of HCC, especially early HCC, with sensitivities of 81.25% for all stages and 66.67% for early HCC, respectively. Importantly, the integrated model exhibited high accuracy in differentiating AFP-negative, AFP-L3-negative, and PIVKA-II-negative HCCs from LCs. Supplementary Information The online version contains supplementary material available at 10.1186/s13045-022-01396-z.

To the editor, Hepatocellular carcinoma (HCC) is the sixth most common cancer and ranks the fourth in cancer mortality worldwide, and patients with liver cirrhosis (LC) are at high risk of HCC [1,2]. Constantly elevated levels of alpha-fetoprotein (AFP) and other serum biomarkers including AFP-L3 and PIVKA-II generally indicate development of HCC; however, the performance of these biomarkers as diagnostic models for early HCC remains unpromising [3].
The utility of cancer-associated aberrations including genic mutations in cell-free DNA (cfDNA) for cancer detection is a global research hot spot [4,5]. Circulating Single-Molecule Amplification and Resequencing Technology (cSMART) is a detection platform that can simultaneously detect and quantitate multiple plasma DNA variants based on next-generation sequencing [6,7]. A total of 1702 individuals (healthy cohort, LC cohort, and HCC cohort) from nine clinical sites across China were enrolled from June 2018 through January 2019 in this study. In HCC cohort, 27 were excluded according to pathology diagnosis. Finally, 1675 participants (490 healthy controls, 577 LC patients, and 608 HCC patients) were randomly assigned to training/validation/test cohorts (Additional file 1: Fig S1). Detail information of these participants is shown in Additional files 1, 2: Tables S1-S8. 10 mL peripheral blood was provided from each individual for cSMART test at enrollment time.
We first constructed negative background pool using cfDNA samples from 490 healthy individuals. To explore the feature of cfDNA mutations in HCC and LC, 931 regions among 21 genes of 608 HCCs and 577 LCs were detected by cSMART. Top 20 gene mutation sites with high mutation frequency are detailed in Additional files 3, 4: Tables S9-S12. The overall mutation ratio of cfDNA in HCC was significantly higher than that in LC ( Fig. 1a and Additional file 1: Fig. S2). Then, detected mutations were minimized and finally three mutation sites located in different regions of gene TERT, TP53, and CTNNB1 were screened out to be further analysis. The performance of the single mutation gene site in the diagnosis of HCC is shown in Additional file 1: Table S13. A gradual increasing trend in variant allele frequency (VAF) at HCC-specific mutation sites from early HCC (BALC 0/A) to advanced HCC (BCLC C) was identified (Additional file 1: Fig. S3), proving that cSMART was sensitive for quantification of low-copy number DNA in plasma and could accurately reflect the tumor mutational burden.
By integrating three mutations of cfDNA and three serum biomarkers (AFP, AFP-L3, and PIVKA-II), Combined method was developed for diagnosis of HCC. AFP, the most commonly used biomarker, could detect 43 of 151 HCCs in test cohort, and 26 of 112 HCCs in validation cohort at the cutoff value of 400 ng/mL, and achieved diagnostic sensitivity of 56.29%/48.21% at specificity of 91.03%/93.18% in test cohort or validation cohort at 20 ng/mL cutoff value. Combined method showed better performance compared with AFP, detecting 135 of 151 HCCs with a sensitivity of 89.40% at 80.69% specificity in test cohort. More, the sensitivities of this model to detect HCC at BCLC 0 and A were 60.00% and 83.87%, respectively ( Table 1). The same conclusion could also be drawn from the data of the independent validation cohort ( Table 1). Receiver operating characteristic (ROC) curve further corroborated that this cfDNA-based integrated diagnostic model was significantly superior to AFP in the diagnosis of HCC (Fig. 1b).
Next, the accuracy of Combined method to differentiate HCC from LC was evaluated in different subgroups and compared with GALAD and AFP. In test cohort, this model could not only distinguish AFP-positive HCC from LC (accuracy: 95.56%), but also detect AFP-negative HCC who might be missed by conventional diagnostic approaches (accuracy: 83.27%). Furthermore, Combined method exhibited high accuracy for HCC diagnosis in both AFP-L3/PIVKA-II-positive and AFP-L3/PIVKA-IInegative subgroups, outperforming current commonly used biomarkers without over diagnosis (Fig. 1c). In addition, Combined method held high accuracy in diagnosis of liver tumors with any size irrespective of age, gender, Child-Pugh stage, HBV infection status, and cirrhosis history and showed much better performance in detecting early and very early HCC (accuracy: BCLC 0: 60.00%; BCLC A: 83.33%) than GALAD and AFP (Fig. 1d, e). Subsequently, the above conclusions were further confirmed in validation cohort (Additional file 1: Fig. S4).
In conclusion, we developed a retrospective phase 3 study according to the criteria for biomarker development delineated by Pepe et al., identified the unique cfDNA hotspot mutation signature of HCC, and constructed Combined method based on three mutation sites and three serum biomarkers [8]. Combined method has fixed indicators and simple detection process, outperforming conventional approaches in the diagnosis of HCC, especially early HCC, in a noninvasive way. Our model holds great potentials to be incorporated into

Supplementary Information
The online version contains supplementary material available at https:// doi. org/ 10. 1186/ s13045-022-01396-z. Table S1: Basic information of enrolled patients. Table S2: Brief summary of all participants. Table S13: Performance of single mutation site in the diagnosis of HCC. Table S3: Detail information of 345 HCC patients in training cohort. Table S4: Detail information of 344 LC patients in training cohort. Table S5: Detail information of 151 HCC patients in test cohort. Table S6: Detail information of 145 LC patients in test cohort. Table S7: Detail information of 112 HCC patients in validation cohort. Table S8: Detail information of 88 LC patients in validation cohort. Table S9: Information of top 20 gene mutation sites. Table S10: Analysis of top 20 gene mutations in paired cfDNA and tissue samples of HCC patients. Table S11: Analysis of top 20 gene mutations in paired cfDNA and tissue samples of LC patients. Table S12: Analysis of top 20 gene mutations in paired cfDNA and tissue samples of healthy controls.