Ultrasound-based radiogenomics: status, applications, and future direction
Article information
Abstract
Radiogenomics, an extension of radiomics, explores the relationship between imaging features and underlying gene expression patterns. This field is instrumental in providing reliable imaging surrogates, thus potentially representing an alternative to genetic testing. The rapidly growing area of radiogenomics that utilizes ultrasound (US) imaging seeks to elucidate the connections between US image characteristics and genomic data. In this review, the authors outline the radiogenomics workflow and summarize the applications of US-based radiogenomics. These include the prediction of gene variations, molecular subtypes, and other biological characteristics, as well as the exploration of the relationships between US phenotypes and cancer gene profiles. Although the field faces various challenges, US-based radiogenomics offers promising prospects and avenues for future research.
Introduction
Medical imaging, a traditional diagnostic method, has become increasingly important in personalized precision medicine and clinical decision-making [1,2]. Recent advancements in image informatics technology have enabled the high-throughput extraction of quantitative data from digital medical images, prompting the emergence of the field of radiomics. The formal framework of radiomics, first proposed in 2012 [3], involves the extraction of numerous features from images using high-throughput computing and transforming them into mineable data [4]. Furthermore, progress in "omics" technologies, such as genomics, transcriptomics, and proteomics, has facilitated the comprehensive study of various aspects of biological systems [5]. Since the completion of the Human Genome Project, genomic information has been instrumental in uncovering the genetic variations that underlie human diseases, revolutionizing diagnosis, prognosis, and the prediction of treatment responses [6].
Broadly speaking, imaging genomics, also known as radiogenomics, investigates the relationships between imaging features and genetic markers, such as gene mutations, patterns of gene expression, and other relevant traits [7-9]. These imaging features may be qualitative or quantitative, with the latter often involving high-throughput radiomics. However, the definition of radiogenomics has narrowed in recent years due to advancements in radiomics research (Fig. 1) [3,7,9-14]. Radiogenomics, employing radiomics methodologies, is now specifically recognized as an application that studies correlations between quantitative imaging features and genomic characteristics [15,16]. Additionally, the term "radiogenomics" can refer to the investigation of genetic germline variations associated with normal tissue toxicity following radiotherapy [17]. In this review, "radiogenomics" is used in accordance with the first interpretation. In recent years, radiogenomics has demonstrated potential for rapid and non-invasive genotyping across various cancer types by uncovering links between the quantitative imaging characteristics of tumor phenotypes and genomic signatures. This has contributed to the shift toward precision medicine [12,18].

Historical evolution of the concept of radiogenomics.
The blue and orange boxes represent two components that are associated in radiogenomics studies. The blue boxes refer to imaging data and the orange boxes to genomics data. MRI, magnetic resonance imaging; CT, computed tomography; PET, positron emission tomography.
The advancement of artificial intelligence (AI) technologies, including machine learning (ML) and deep learning (DL), has demonstrated benefits in the detection, diagnosis, and classification of diseases [19-21]. AI has also been employed in radiogenomics for image processing and the development of powerful models that aid in making optimal clinical decisions [22].
The primary imaging modalities utilized in radiogenomics include magnetic resonance imaging (MRI), computed tomography (CT), positron emission tomography (PET), and ultrasound (US). US, a key area of medical imaging, offers distinct advantages such as the absence of ionizing radiation, portability, real-time imaging capability, and cost-effectiveness. Meanwhile, AI-based radiomics has considerably improved the accuracy and efficiency of US diagnostics by providing objective and quantitative assessments of US images [23,24]. US-based radiogenomics is an emerging field that relates US imaging features with gene expression patterns or genetic mutations. This area of research has been garnering increased research interest and shows considerable promise.
In this review, the authors present a brief overview of the US-based radiogenomics workflow and summarize pertinent research regarding thyroid, breast, liver, prostate, ovarian, and brain tumors. Furthermore, they examine the limitations and current challenges facing US-based radiogenomics and discuss its potential future applications.
Overview of Radiogenomics Workflows
US-based radiogenomics studies typically integrate the methodologies of US-based radiomics and genomics. Quantitative features are extracted, processed, and analyzed from US images and then associated with specific outcomes. Radiomics methods can be categorized as either classic/conventional radiomics, which involves extracting handcrafted features using conventional ML, or DL radiomics, which employs fully automated data analysis pipelines. The workflow of a radiogenomics study, specifically combining conventional radiomics and genomics, can generally be described in four steps (Fig. 2): (1) image and data acquisition, (2) image segmentation, (3) feature extraction and selection, and (4) data analysis and modeling for radiogenomics outcomes.

Applications of ultrasound (US)-based radiogenomics.
Genomic features are associated with ultrasound imaging features in tumors of the brain, thyroid, breast, liver, prostate, and ovary. IDH, isocitrate dehydrogenase; TERTp, telomerase-reverse transcriptase gene promoter; 1p/19q, chromosome arms 1p and 19q; ER, estrogen receptor; PR, progesterone receptor; HER2, human epidermal growth factor receptor 2.
(1) Image and data acquisition: The workflow begins with radiologists using consistent acquisition parameters to scan and obtain images of lesions. A range of modalities for US imaging can be utilized, including B-mode US (BUS), contrast-enhanced ultrasound (CEUS), color Doppler flow imaging (CDFI), and ultrasound elastography (UE). The selected high-quality US images are then recorded and stored in Digital Imaging and Communications in Medicine format. Adherence to standardized imaging protocols is crucial during data collection to minimize unwanted variability that could confound the results [25]. On the genomic front, molecular pathology data are acquired through genomic analyses such as microarray, DNA and RNA sequencing, or immunohistochemistry (IHC) techniques.
(2) Image segmentation: Segmentation is a critical step in the workflow because the extracted features depend upon the defined region of interest (ROI). Techniques for delineating the ROI include automatic, semi-automatic, and manual segmentation. In US-based radiogenomic studies, the ROI is most frequently demarcated manually (Table 1). Although manual segmentation by experienced radiologists is widely practiced, it is acknowledged to be time-consuming and subject to inter-observer variability. In contrast, automatic segmentation, which requires minimal operator intervention, provides more accurate and reproducible boundaries [33]. Semi-automatic segmentation may represent an effective compromise, allowing the expert to refine the output of the algorithm [34].
(3) Feature extraction and selection: An essential step in the radiogenomics process is the extraction of high-dimensional features from defined ROIs to characterize phenotypes. The extraction of US radiomics features from ROIs is typically performed using specialized software, such as PyRadiomics (https://pyradiomics.readthedocs.io/). Extracted features are primarily categorized into shape-based features, intensity-based (first-order) features, texture (second-order) features, higher-order features, and additional features based on filtering and transformation [4]. The Image Biomarker Standardization Initiative (IBSI) provides guidelines that are recommended for detailed descriptions of radiomics features and their corresponding equations [35]. In the process of feature extraction, hundreds or thousands of features may be computed. Due to potential redundancy among these features, it is crucial to select those that capture essential information to avoid overfitting. Techniques such as least absolute shrinkage and selection operator (Lasso) regression and maximum relevance minimum redundancy are often used to aid in choosing the most informative features by reducing the dimensionality of the dataset.
(4) Modeling for outcomes: Subsequently, ML techniques can be utilized to relate the extracted image features with genomic data and to build predictive models. Conventional ML methods, including logistic regression, random forest, support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) are often applied in the development of models that predict cancer molecular subtypes, gene mutations, or gene expression patterns. The performance of these models, derived from the training set, should be validated or tested on a separate testing set using evaluation metrics such as receiver operating characteristic curve, calibration curve, and clinical decision curve analysis. External validation is considered more credible than internal validation, as the results are typically more robust when validated in independent datasets [2].
DL models are based on a number of advanced neural networks, including deep convolutional neural networks (CNNs), transformers, and other deep neural networks. DL can be integrated into radiomics or genomics research, enabling models to automatically process medical imaging or genomic data as input and produce outcomes that align with the objectives of the study. In radiogenomics studies, DL can serve as a method for association in the mapping process or can be applied to create new model architectures that utilize both imaging data and molecular data.
Application
US-based radiogenomics is still in its nascent phase, with investigations focused on a limited number of cancer types, including those of the thyroid, breast, liver, prostate, ovary, and brain. This section examines US-based radiogenomics research that seeks to uncover correlations between radiomics signatures and gene expression profiles (GEPs) or biological pathways. Additionally, it discusses studies aiming to predict genetic alterations and molecular subtypes across various cancers (Fig. 3).

General workflow of ultrasound (US)-based radiogenomics studies.
Initially, both US image and gene-related data are collected from patients and processed. Subsequently, machine learning or deep learning algorithms are employed to integrate radiomics data with genomic or other “omics” data to construct a model. The results of the radiogenomics analysis are then utilized to aid in patient diagnosis or prognosis.
Thyroid
Genomic alterations play a pivotal role in the development of thyroid cancer, with driver mutations in genes such as BRAF, RAS, and RET influencing the clinicopathological characteristics of thyroid tumors [36]. Consequently, molecular characterization of thyroid cancer is instrumental in more effectively identifying various subtypes of thyroid carcinoma, each exhibiting distinct clinical behaviors and responses to radioactive iodine and targeted therapies [37,38]. US-based radiogenomics could potentially assist in the diagnosis, prognosis, and treatment planning of thyroid cancer through non-invasive methods.
Prediction of BRAF V600E Gene Mutation
A strong association has been demonstrated between the BRAF V600E mutation and the development or progression of papillary thyroid carcinoma (PTC), implicating BRAF V600E as a critical driver of the PTC phenotype [39].
Two preliminary studies attempted to assess the potential of BUS radiomics for predicting BRAF mutation status in PTC, but the outcomes were not encouraging [26,27]. Yoon et al. [26] collected preoperative US images to extract features and calculate a radiomics score, concluding that US-based radiomics did not accurately predict the BRAF V600E mutation in PTCs. Similarly, Kwon et al. [27] reported only moderate performance of classifier models using US radiomics to predict the presence of BRAF mutations in PTCs.
In contrast, two studies from a single team have demonstrated the usefulness of UE images in predicting genetic mutations [28,29]. Wang et al. [28] found that combining UE radiomics features with grayscale US radiomics features demonstrated clinical utility in predicting BRAF V600E mutations among patients with PTC. Building on these findings, Agyekum et al. [29] employed UE radiomics features to develop six distinct ML models. They evaluated the performance of these models in predicting the BRAF V600E mutation and reported that the UE-based radiomics models performed well, with an area under the curve (AUC) ranging from 0.80 to 0.98. These studies suggest that multimodal US could address the limitations of conventional US in extracting gene expression information from target organs.
Prediction of RET Rearrangement
The RET ("rearranged during transfection") gene, a known oncogenic driver in thyroid cancer, is responsible for chromosomal rearrangements such as the RET/PTC fusion gene. Recent clinical evidence suggests that RET rearrangements occur in 10% to 20% of PTCs [40].
Yu et al. [30] developed a DL radiomics nomogram utilizing US to preoperatively predict RET rearrangement in patients with PTC. The nomogram model that integrated clinical and DL radiomics signatures outperformed models based solely on clinical or radiomics signatures, achieving an AUC of 0.9545 in the test cohort and an AUC of 0.9396 in the training cohort. The study utilized a DL methodology to extract US image features imperceptible to the human eye, underscoring the utility of DL in the construction of predictive models for radiogenomics.
Analysis of Gene Expression Levels
Tong et al. [31] conducted a radiogenomics analysis in patients with PTC to predict cervical lymph node metastasis and to explore the relationship between US radiomics features and the molecular characteristics of PTC. Radiomics features and gene modules, obtained through a clustering approach, were correlated to construct a radiogenomic map. This map revealed nine statistically significant pairwise correlations, indicating potential radiogenomic biomarkers for PTC (Fig. 4).

Radiogenomics analysis of radiomics features associated with gene modules.
A radiogenomic correlation map is generated using the Spearman rank correlation method to assess the relationships between ultrasound radiomics features and gene modules. Nine pairwise correlations marked with an asterisk denote statistically significant associations (P<0.05). Red indicates positive correlations, while blue denotes negative correlations. *Statistically significant association. Reprinted from Tong et al. Front Oncol 2021;11:682998 [31], according to Creative Commons License.
Lee et al. [32] analyzed US features assessed by CNNs and radiologists, along with the associated molecular biological mechanisms. They found that certain suspicious US features, such as marked hypoechogenicity, non-circumscribed margins, microcalcifications, and nonparallel shape, were indicative of the upregulation of genes linked to immune response and epithelial-mesenchymal transition. The research suggested that the US characteristics of PTCs, as evaluated by both radiologists and CNNs, could potentially predict the biological behavior and molecular features of the tumors. These radiogenomics studies establish a correlation between radiomics features and complex gene expression data while exploring the underlying molecular mechanisms, this supporting the role of precision medicine in cancer management.
Among US-based radiogenomics studies concerning thyroid cancer, most have investigated the potential of radiomics features to predict specific gene variations in PTC. Additionally, some studies have focused on the correlation between radiomics features and gene expression patterns. The principal findings of these radiogenomics studies are summarized in Table 1.
Breast
Breast cancer represents a significant health risk, as it is the leading cause of cancer-related mortality among women worldwide [41]. Since the first article on breast cancer radiogenomics was published in 2012 [42], a growing body of research has been produced on the subject, utilizing various imaging modalities [43]. The advanced study of genomic markers in breast cancer enables analyses that range from identifying surrogate molecular subtypes to conducting multigene expression-based assays [43]. In the context of breast cancer management, US-based radiogenomics could potentially be employed to predict molecular subtypes and examine the relationships between US characteristics and gene expression, thus providing a clearer picture of the behavior and progression of breast cancer (Tables 2, 3).
Prediction of Molecular Subtypes
Based on gene expression profiling, breast cancer can be classified into four molecular subtypes: luminal A, luminal B, human epidermal growth factor receptor 2 (HER2)–enriched, and basal-like [52,53]. However, identifying molecular subtypes at the genetic level can be expensive and time-consuming. Consequently, in both clinical practice and research, the use of IHC surrogates for breast cancer subtyping is becoming increasingly common [43]. Breast cancers can be categorized into St. Gallen molecular subtypes according to IHC detection of estrogen receptor (ER), progesterone receptor (PR), HER2, and the Ki-67 labeling index, which correlate with intrinsic subtypes [54,55]. Different molecular subtypes demonstrate variations in disease presentation, prognosis, and response to treatment. The primary goal of US-based radiogenomics research in breast cancer is to predict molecular subtypes by identifying specific US imaging features.
In an earlier study, Guo et al. [44] employed a US-based radiomics approach to differentiate between hormone receptor-positive, HER2- negative cancers and triple-negative breast cancers (TNBC). They demonstrated that quantitative US characteristics are strongly linked to receptor status and molecular subtypes in breast cancer. Xu et al. [45] developed four SVM models capable of effectively predicting the molecular expression of ER, PR, HER2, and Ki-67 in breast cancer. To distinguish TNBC from non-TNBC prior to surgery, Xu et al. [46] created a US-based radiomics nomogram that combines US radiomics features with conventional US features, achieving an AUC value of 0.813 in the validation set.
US-based DL algorithms have shown promising results in the prediction of molecular subtypes of breast cancer. Jiang et al. [47] conducted a multicenter retrospective study demonstrating that a CNN model, developed using US images, was highly accurate in distinguishing between luminal and non-luminal diseases, as well as in diagnosing molecular subtypes. Zhou et al. [48] evaluated the performance of three assembled CNN (ACNN) models in predicting molecular subtypes of breast cancer. Their findings indicated that the multimodal ACNN, which integrated grayscale US images, CDFI, and shear wave elastography (SWE), was effective in accurately predicting these subtypes, with an AUC ranging from 0.87 to 0.96. Quan et al. [49] found that a DL radiomics approach, which incorporated breast US video and clinical characteristics, showed superior performance in predicting HER2 expression status, achieving an AUC of 0.81.
Gong et al. [50] conducted pioneering research on the utility of CEUS radiomics for predicting molecular subtypes of breast cancer. They integrated CEUS video data with conventional US images to develop a radiomics model capable of classifying breast cancer into six molecular subtypes. Their findings revealed that CEUS radiomics offers a distinct advantage in determining the receptor status of breast cancer, as the CEUS videos capture dynamic blood perfusion details that reflect vascular features of the tumor. This work also underscores the important role of CEUS in the field of radiogenomics.
Based on imaging features from US and mammography, as well as clinical characteristics, Ma et al. [62] developed an interpretable ML model that demonstrated good performance in identifying breast cancer. Furthermore, it assisted radiologists in distinguishing TNBC from other subtypes, exhibiting an AUC of 0.971. This predictive model incorporated two primary imaging techniques—US and mammography—in breast cancer diagnosis and yielded excellent results. It provides valuable insights into the potential benefits of integrating additional imaging modalities in radiogenomics studies.
Prediction of BRCA Mutation
Germline pathogenic variants in the breast cancer susceptibility genes BRCA1/2 are associated with an increased risk of developing ovarian or breast cancer [63]. Guo et al. [56] developed a nomogram model that integrates radiomics derived from US with clinical characteristics, achieving a high accuracy in detecting germline BRCA1/2 (gBRCA1/2) mutations, as indicated by an AUC of 0.811. Similarly, Deng et al. [51] employed radiomics data from both intratumoral and peritumoral regions, in conjunction with clinicopathological factors, to construct a predictive nomogram. This model demonstrated an AUC of 0.824 for the accurate identification of gBRCA mutations. When a patient has a high risk of carrying a BRCA pathogenic variant, genetic testing is typically recommended [64]. US-based radiogenomics may offer a cost-effective approach for the diagnosis of breast cancer and the identification of individual BRCA mutations, which can inform personalized treatment strategies.
Analysis of Gene Expression
Cui et al. [57] developed a predictive model by conducting a thorough analysis of US radiomics features of HER2-positive breast cancer, exploring the associated gene expression and biological information. Utilizing mRNA expression profiles for HER2-positive breast cancer from the Gene Expression Omnibus (GEO) database, they found that various US radiomics features corresponded to different biological functions. Specifically, one of the US radiomics features, zone entropy, which is elevated in US images of HER2-positive breast cancer, may signify immune cell activity (Fig. 5). This study demonstrates that US radiomics features can reflect the biological functions of cancer in a differential manner.

A feature-mRNA-function network constructed using an independent dataset.
The hierarchical model reveals relationships among five differential ultrasound radiomics features, specific genes, and particular functions in human epidermal growth factor receptor 2 (HER2)–positive breast cancer. The analysis indicates that the size zone non-uniformity feature correlates with carbon metabolism in cancer. The size zone non-uniformity normalized feature is associated with cell-substrate junctions. Small area emphasis has a connection to chemical carcinogenesis, while the small area high gray level emphasis feature relates to the composition of the endomembrane system. Lastly, zone entropy is linked to the activities of peroxidase and oxidoreductase. Reprinted from Cui et al. J Transl Med 2023;21:44 [57], according to Creative Commons License.
In a study by Huang et al. [58], a model was developed that integrated molecular subtypes with US features to predict TP53 and PIK3CA gene mutations in breast cancer. The findings indicated that larger tumor size and a mass-like appearance on US were associated with PIK3CA mutation, while calcifications on US were an independent predictor of TP53 mutation. In a later study [59], the researchers evaluated the correlation between US radiomics features and genomic sequencing in patients with and without TNBC. The sequencing of breast cancer-specific genes revealed that US radiomics features were linked to gene mutations and signaling pathways that differed between the TNBC and non-TNBC groups. These findings suggest that a radiogenomic signature based on US could potentially serve as a predictor for molecular targets in the precision treatment of TNBC.
Using BUS and vascular US (including microvascular US and CEUS) imaging of breast cancers, Park et al. [60] investigated the relationship between US morphological and vascular characteristics and genetic alterations, as revealed by RNA sequencing results from 31 patients with breast cancer. In this radiogenomic study, 26 genes associated with tumor growth, prognosis, and metastasis in breast cancer were found to be significantly upregulated or downregulated in relation to US phenotypes. A heatmap was created to display the gene expression data in breast cancers with nonparallel and parallel orientations (Fig. 6). Building on these findings, the researchers also analyzed DNA sequencing results from the same 31 breast cancer samples. They found that certain vascular features observed on US were significantly associated with genetic alterations linked to angiogenesis and the prognosis of breast cancer [61]. These studies underscore the potential of vascular US to provide surrogate markers for treatment decisions and prognostic predictions in patients with breast cancer.

Heat map illustrating 42 differentially expressed genes in relation to the orientation of breast cancer on B-mode ultrasound (BUS) imaging.
The columns in the table represent 31 individual breast cancer cases, with yellow indicating parallel orientation and green representing nonparallel orientation. The rows correspond to the 42 differentially expressed genes. The color key reflects the level of gene expression: red signifies upregulation and blue signifies downregulation. In the BUS image, a 53-year-old patient with parallel-oriented breast cancer is highlighted with a yellow border, and a 48-year-old patient with nonparallel-oriented breast cancer is highlighted with a green border. Reprinted from Park et al. Radiology 2020;295:24-34 [60], with permission from the Radiological Society of North America Inc.
Liver
Primary liver cancers are a major contributor to cancer-related mortality worldwide [41]. Although limited US radiogenomics and radiomics studies have focused on the liver, the applications of US radiomics in liver tumors have been delineated in terms of diagnosis [65,66] and the prediction of microvascular invasion [67], as well as in assessing the biological behaviors of liver tumors [68,69]. Given the diverse array of treatment options for liver cancers, radiogenomics holds promise for improving the clinical management of these malignancies, including hepatocellular carcinoma (HCC).
Wang et al. [70] developed a radiomics model that leverages CEUS features from the arterial, portal venous, and delayed phases to predict the T cell–inflamed GEP of 18 genes in patients with HCC (Table 4). The ML model, which utilized features from the US arterial phase, exhibited excellent performance in predicting T cell–inflamed GEP, achieving an average AUC of 0.905 in 5-fold cross-validation. This pioneering study in radiogenomics demonstrated the potential of CEUS radiomics to predict T cell–inflamed GEP, indicative of a T cell–activated tumor microenvironment, thus offering valuable insights that could inform therapeutic decisions in immunotherapy.
Prostate
Prostate cancer represents the second most common cancer among men, exhibiting higher incidence and mortality rates in the Western world [71]. Several US-based radiomics studies have analyzed quantitative data from US images or radiofrequency (RF) signals, demonstrating their potential utility in detecting [72,73] and localizing [74] prostate cancer, as well as predicting biochemical recurrence [75].
By characterizing cancer-associated fibroblasts (CAFs), Ageeli et al. [76] explored the prostate tumor microenvironment. They correlated tissue stiffness measurements obtained from transrectal US SWE with IHC and genomics techniques. Their findings demonstrated a positive correlation between US SWE–measured tissue stiffness and the expression of smooth muscle actin α and platelet-derived growth factor receptor β in CAFs (Table 4). In another recent study, proteomic data were linked to qualitative features of BUS and CEUS in prostate cancer, revealing the biological functions that underpin the ultrasonic phenotype [77]. These radiogenomic insights suggest that each US imaging modality, whether SWE or CEUS, could serve as a non-invasive tool for visualizing the tumor microenvironment. This capability improves our understanding of the biological mechanisms at play in prostate cancer and may inform personalized treatment strategies.
Ovary
Pelvic US scans are routinely utilized in clinical settings for gynecological examinations. Women with gBRCA1 mutations are at a heightened risk of developing ovarian cancer, one of the deadliest cancers affecting women [80].
To predict gBRCA1/2 status in women with healthy ovaries, Nero et al. [78] developed an automated ML model using features extracted from transvaginal pelvic US images based on real-world data (Table 4). This model could help identify patients more likely to carry gBRCA1/2 pathogenic variants, thus reducing the need for unnecessary genetic testing. By integrating this model with existing clinical and family history criteria, they also conducted a cost-effectiveness analysis comparing various methods of identifying BRCA1/2 carriers in the general population [81]. Their findings suggest that the incidence of BRCA-related cancers could be significantly lowered by offering genetic testing to women selected through the radiogenomic model. The model offered an effective, cost-efficient, and sustainable approach to preventing BRCA-related cancers, setting the stage for its future incorporation into cancer management protocols.
Brain
Gliomas constitute the majority of malignant brain tumors [82]. In line with the World Health Organization Classification of Tumors of the Central Nervous System guidelines, molecular parameters are now incorporated as biomarkers for the accurate classification of gliomas [83]. Radiogenomics is an intriguing field that hold great promise for improving the management of gliomas via personalized medicine [84].
US RF signals are echo signals produced by the interaction of US with human tissues. Xie et al. [79] developed a DL modeling method that leveraged the temporal-spatial characteristics of US RF signals, leading to an effective intraoperative diagnosis of glioma biomarkers, including isocitrate dehydrogenase 1, chromosome arms 1p and 19q, and telomerase-reverse transcriptase gene promoter (Table 4). The spatial-temporal integration model derived from US RF signals outperformed two classical models in diagnosing molecular biomarkers, achieving a mean AUC of 0.96. This model thus provides accurate diagnoses and predictions of prognosis. While many studies have focused on achieving preoperative molecular diagnosis using medical imaging modalities such as CT and MRI [85], their use during surgery is limited. The US-based method offers a rapid intraoperative molecular detection solution that better meets clinical needs for personalized medicine.
Challenges and Future Direction
Radiogenomics plays a pivotal role in the realm of precision medicine for cancer, offering clinicians valuable insights to inform cancer detection and treatment strategies. This development is a natural progression in the shift toward precision medicine, with radiogenomics not only providing a cost-effective alternative to traditional gene sequencing but also yielding information about the entire tumor, as opposed to a limited biopsy specimen. Radiogenomics studies primarily utilize MRI, CT, and PET to study a variety of tumors, including those in the brain, lung, breast, liver, prostate, kidney, colorectum, and gynecological areas [86]. US-based radiogenomics is still in its nascent phase, with research largely focusing on the relationships between imaging features and specific genes or molecular subtypes, particularly in thyroid and breast cancers. Nonetheless, US-based radiogenomics offers several distinct benefits: it is radiation-free, time-efficient, and cost-effective. The capacity to perform real-time scanning and rapid diagnosis has allowed US to be integrated into the surgical workflow, thus aiding surgeons. Moreover, image resolution and quality have been improved by advancements in the latest US technologies. Various US imaging modalities, such as CEUS and SWE, deliver multifaceted information regarding lesions, thus optimizing disease diagnosis. In this section, the authors explore the challenges encountered by US-based radiogenomics and highlight potential future directions.
Compared to automated image acquisition in radiogenomics using MRI or CT, US imaging demonstrates poor objectivity and reproducibility. This is largely due to the high operator dependence in the acquisition and evaluation of US images, which presents challenges in maintaining consistent image quality within US-based radiogenomics workflows. Furthermore, the complex nature of the radiomics workflow—which includes collecting images, delineating ROIs, extracting optimal features, and building appropriate models—contributes to variability in results. The absence of standardized processes has resulted in a lack of repeatability and reproducibility in radiogenomic studies [87]. To address these challenges, multiple international consortiums have been established. The proposed radiomics quality score aims to increase the robustness of radiomics research, while the IBSI was developed to provide a standardized approach for feature computation and image processing in radiomics analysis [1,35]. Consequently, efforts should be made to establish uniform standards for US operation and image processing procedures to ensure the production of high-quality and reproducible studies.
In contrast to the substantial and publicly accessible datasets found in other medical imaging fields, the availability of public datasets in medical US is currently limited. This scarcity of publicly available databases hampers researchers’ ability to conduct studies with large sample sizes [88]. The Cancer Genome Atlas (TCGA) data, which includes genomic and clinical biomarkers from various cancer types, can be linked to corresponding imaging data available for cancer research in The Cancer Imaging Archive (TCIA). The use of TCGA-TCIA data has been instrumental in advancing radiogenomics research that utilizes MRI, CT, and PET imaging modalities [89,90]. The development of large-scale imaging, genomics, or combined datasets as a resource for research teams worldwide could greatly improve the quality of US-based radiogenomics studies and accelerate their integration into clinical practice.
The integration of various US imaging modalities in multimodal US fusion analysis enables thorough evaluation of the imaged tissue. Multimodal radiomics has demonstrated substantial progress in improving clinical diagnostic accuracy [15]. Another key imaging modality in medical imaging is three-dimensional (3D) US, which offers considerable potential in US-based clinical applications and provides more information than two-dimensional images. Cutting-edge 3D AI algorithms are anticipated to play a pivotal role in radiogenomics based on US, as these are designed to perform a variety of tasks in medical US analysis [91].
While US-based radiogenomics is currently limited to a few cancer types, the scope of radiomics utilizing US has expanded to include investigations into gallbladder cancer [92], renal cancer [93], rectal cancer [94], endometrial cancer [95], soft-tissue tumors [96], and others. Thus, it is reasonable to anticipate broader applications of US-based radiogenomics in as-yet-unexplored areas. Moreover, radiogenomics research that employs MRI and CT imaging has demonstrated potential as an effective tool for predicting therapeutic responses and assessing recurrence risk. This suggests a promising avenue for further research into US-based radiogenomics, particularly in its capacity to correlate imaging characteristics with patient outcomes and genomic data.
When designing a radiogenomics study, researchers must recognize the importance of standardized protocols, as decisions made at each stage will impact subsequent steps in the radiogenomics workflow. Furthermore, it is advisable for future radiogenomics research to prioritize prospective, multicenter studies with large sample sizes to improve the robustness of the research and drive progress in this field.
Conclusion
In this review, we discuss the current state of radiogenomics studies that utilize US and explore the challenges and prospects of this field. US-based radiogenomics research is still in its infancy, with only a limited number of studies conducted thus far. However, it represents an emerging and promising area for genotype identification due to its wide availability, cost-effectiveness, and non-invasive nature. To address the identified challenges, future US-based radiogenomics research will require high-quality, multicenter, and multi-geographical studies with a standardized workflow. With the expansion of clinical data and advancements in AI approaches, further investigation in this rapidly expanding field will be crucial for the successful integration of radiogenomics into clinical practice.
Notes
Author Contributions
Conceptualization: Wang SR, Huang B, Xu HX. Data acquisition: Wang SR, Shen YT. Data analysis or interpretation: Shen YT, Xu HX. Drafting of the manuscript: Wang SR, Shen YT. Critical revision of the manuscript: Huang B, Xu HX. Approval of the final version of the manuscript: all authors.
Conflict of Interest
No potential conflict of interest was reported by the authors.
Acknowledgments
This work was supported by the Ministry of Science and Technology (2023YFC2414204), the Shanghai Municipal Health Commission (Grant SHSLCZDZK 03502), the Science and Technology Commission of Shanghai Municipality (Grant 19DZ2251100), Fudan University (IDF152076), and Zhongshan Hospital of Fudan University (Grant 2022ZSQD07).
References
Article information Continued
Notes
Key points
Radiogenomics, when applied to ultrasound (US), examines the relationships between US imaging features and gene expression patterns. The implementation of US-based radiogenomics offers a rapid and non-invasive approach to identifying gene variations and molecular subtypes in cancers. Future US-based radiogenomics research should prioritize prospective, multicenter studies with large sample sizes and standardization.