v
Search
Advanced

Publications > Journals > Journal of Clinical and Translational Pathology> Article Full Text

  • OPEN ACCESS

Artificial Intelligence in Breast Pathology: Recent Advances in Multimodal Models, Explainability, and Clinical Applications

  • Yan Hu1,
  • Alan Zhu2,
  • Robert Wesolowski3,
  • Maryam Tahir1,
  • Gary Tozbikian1,
  • Anil V. Parwani1,
  • Ziyu Su1,
  • Khalid Niazi1 and
  • Zaibo Li1,* 
 Author information 

Abstract

Background and objectives

Artificial intelligence (AI) is increasingly reshaping diagnostic pathology, with breast pathology representing one of the most advanced and clinically impactful areas of adoption. Despite rapid progress, many practicing pathologists remain unfamiliar with core AI concepts and their practical implications. This review provides a concise and accessible overview of AI in breast pathology, focusing on foundational principles, current clinical applications, and future directions.

Methods

Pertinent literature was reviewed. Personal experiences were also summarized and incorporated.

Results

Key AI concepts, including algorithms, models, architectures, machine learning, deep learning, neural networks, and multimodal and foundational models, are introduced to establish a common framework. Important distinctions among generative, black-box, and explainable AI are highlighted, emphasizing the need for transparency and interpretability in clinical settings. The evolution of AI in breast pathology is reviewed, from early rule-based computer-assisted diagnostic systems to modern deep learning approaches leveraging large-scale whole-slide imaging datasets. Current applications span multiple domains, including detection of lymph node metastases, Nottingham grading, classification of benign and malignant lesions, and automated quantification of critical biomarkers. AI-based approaches to prognosis, risk stratification, prediction of treatment response, and analysis of the tumor microenvironment are also discussed. Finally, the review addresses challenges associated with real-world implementation, including data quality, bias, regulatory considerations, cost, infrastructure, and workflow integration.

Conclusions

As AI continues to evolve toward large-scale, multimodal, and explainable models, it is expected to function as an augmentative tool rather than a replacement for pathologists, supporting diagnostic accuracy, standardization, and personalized management in breast cancer care.

Keywords

Digital pathology, Artificial intelligence, Breast cancer, Breast pathology, Machine learning, Multimodal model, Explainable artificial intelligence, Clinical applications

Introduction

Artificial intelligence (AI) has rapidly become a transformative force in medicine, with pathology among the most profoundly impacted specialties. Recent advances in digital imaging, computational power, and data availability have accelerated the integration of AI-driven tools into pathology workflows. In a discipline where visual pattern recognition, quantitative assessment, and synthesis of complex data are central to daily practice, AI offers unique opportunities to enhance diagnostic accuracy, efficiency, and reproducibility.1,2

Breast pathology stands out as one of the most active and clinically mature areas for AI adoption. Breast pathology relies heavily on histopathologic evaluation, biomarker assessment, grading, staging, and increasingly, molecular characterization. These tasks are time-consuming and prone to inter- and intra-observer variability, particularly in tumor grading, lymph node evaluation, and immunohistochemical scoring. The digitization of slides through whole-slide imaging (WSI) has laid the foundation for AI applications, enabling large-scale image analysis and the development of deep learning models that extract diagnostically relevant features directly from histologic images.

Despite growing research and regulatory approvals, a gap persists between technological progress and practical understanding among general pathologists. Clear explanations of AI fundamentals such as algorithms, models, architectures, and learning paradigms are often missing from clinically oriented literature. Additionally, concerns about data bias, explainability, regulatory compliance, workflow integration, and professional accountability continue to influence perceptions of AI adoption. This review aims to bridge these gaps by providing an accessible overview of AI in breast pathology, outlining essential concepts and current clinical applications in diagnosis, grading, biomarker quantification, prognostication, and treatment response prediction. The goal of this review is to provide pathologists with a clear understanding of foundational and contemporary AI principles, summarize current applications in breast pathology, and delineate the major challenges and future directions necessary for successful clinical adoption.

Basic concepts and milestones in AI

AI combines logical principles with modern computing infrastructure to enable tasks traditionally requiring human intelligence, a concept introduced by Dr. John McCarthy in 1956.3 AI processes large volumes of training data using algorithms—structured procedures—to identify patterns and generate models, which are the resulting trained systems that make predictions or decisions autonomously.

Algorithms, models, and architectures

Algorithms are step-by-step procedures that instruct a computer on how to make decisions when given specific data. Through repeated application, these procedures enable computers to learn patterns and eventually make decisions independently, which can then be applied to solve complex problems. Most algorithms used in AI fall into several major families.4,5

Machine learning algorithms analyze statistical patterns in data to predict future outcomes and guide decision-making. They are typically categorized as supervised (trained on labeled data for classification tasks) or unsupervised (trained on unlabeled data to uncover intrinsic patterns). Search and optimization algorithms identify the most effective solution to a defined objective by evaluating numerous possible alternatives. Natural language processing algorithms convert raw text or speech into machine-readable data, enabling tasks such as machine translation, speech recognition, and tokenization (transforming data into secure digital representations). Computer vision algorithms extract and learn visual patterns from images or videos to perform tasks such as object detection, segmentation, and prediction.

By contrast, AI models are complete systems that, when provided with data, apply the algorithms they incorporate to make decisions. Models can be simple (such as linear regression) or highly complex, such as deep neural networks. Finally, AI architecture refers to the overarching framework that supports the development and operation of algorithms and models. It serves as a blueprint for the necessary software and hardware components, including computing infrastructure, data pipelines, and model development environments.

Neural network models and deep learning

Neural networks are learning structures produced through training algorithms. They can be compared to the structure of the brain, with each biological neuron having an artificial counterpart known as a node.

Deep learning is a subset of machine learning that uses neural networks to process extremely large datasets and learn complex patterns.6 These models often contain numerous hidden layers that progressively extract higher-level features from raw input data. Deep learning systems frequently rely on multiple training algorithms to tune their internal weighting systems and are widely used in image recognition and speech analysis. Common deep learning models include convolutional neural networks (CNNs), recurrent neural networks (RNNs), and transformer networks.

CNNs process grid-like data such as images.3,7 They contain convolutional layers that apply filters (kernels) to detect important visual features in the data. In pathological applications, CNNs are commonly used to analyze slides by detecting cancerous cells or quantifying biomarkers in an automated and precise manner.

RNNs process time-dependent or sequential data to capture temporal relationships.3,8 Their input includes the output of the previous cycle, which simulates a form of short-term memory. These models are often used in time-series prediction or predictive text. Limitations include difficulty retaining long-term dependencies.

Transformer networks operate differently from RNNs, as they do not rely on sequential recurrence. Instead, they weigh all tokens in a sequence simultaneously, enabling them to capture long-range dependencies more effectively. This parallel processing provides improved performance on large datasets and more complex modeling capability. A well-known example of such a model is ChatGPT.

Models: In-depth

Foundational and multimodal models are currently among the most widely used types of AI models.9 Multimodal models process multiple data types simultaneously, such as text, images, and audio. Foundational models are extremely large models trained on vast and diverse datasets and capable of performing multiple complex tasks, enabling them to adapt rapidly to a wide variety of applications. Examples include large language models such as ChatGPT, Claude, Gemini, and DeepSeek.

Models are trained through a structured process. First, the problem or objective must be clearly defined, followed by data collection and preprocessing. Next, an appropriate algorithm is selected based on the task. Model parameters are then initialized, and initial predictions are generated in a forward pass. The accuracy of these predictions is evaluated by comparing them to true values, and adjustments are made through backpropagation to reduce error. This iterative process, repeated over multiple epochs, continues until performance improves. The model is then validated on unseen data to ensure generalizability, tested on a separate dataset for accuracy, and deployed for real-world applications. Ongoing monitoring and periodic retraining are essential to maintain reliability and optimize performance over time.10

Generative, black box, and explainable AI (XAI)

Generative AI is a branch of AI that creates new content based on patterns in existing input data. It uses generative models trained on massive datasets to produce original outputs when prompted.11 Although these outputs appear “new,” they are fundamentally derived from training data and may reflect embedded biases. Additionally, they can be inaccurate because they do not involve true reasoning but instead rely on pattern recognition. Training generative AI models often requires substantial datasets and computational resources, raising concerns related to privacy, copyright, and energy consumption.

Black box AI refers to complex neural networks whose internal decision-making processes are difficult for humans to interpret. The term “black box” reflects the inability to clearly explain how such models transform inputs into outputs.

XAI refers to methods that make these systems more interpretable to humans.12 A major early milestone in this field was the Defense Advanced Research Projects Agency initiative in 2015, which marked the first formal research effort into interpreting black box AI systems.13 By improving transparency, XAI enhances both model reliability and interpretability. This is particularly important in medical contexts, where understanding the rationale behind predictions is essential for clinical trust and patient safety. In pathology, XAI is especially valuable, as it helps maintain accountability while enabling physicians to understand how AI systems classify or interpret slides.

AI in pathology

Pathological applications of AI rely heavily on deep learning to analyze large datasets such as digital whole-slide images, molecular profiles, and patient information. By combining and analyzing this information, these AI models are able to enhance diagnostic accuracy and detect subtler patterns that could be missed by human pathologists. Furthermore, AI models are also able to streamline laboratory workflows by reducing time spent on redundant tasks, integrating oncologic genomic data more quickly than could be done otherwise, and better calculating recurrence risk scores given the presence of certain biomarkers.

The advent of foundational models in computational pathology has also enabled widespread, generalizable analysis of diverse tissue morphologies, allowing them to perform numerous diagnostic tasks concurrently without the need for additional retraining.14 By scaling up the same principles used in narrower models, they are able to integrate data and perform tasks in a manner that resembles the human ability to adapt and multitask in pathology work.

This capability has been further extended by multimodal models,15 which are able to bypass the limitations of unimodal models by leveraging the complementary strengths of multiple modalities to increase overall performance. Traditional histological images better capture the precise details of tissue morphology, while pathological reports may better capture a patient’s overall clinical context. Multimodal models are able to work with both types of data simultaneously, sharply improving prediction accuracy, robustness, and resilience to noise. Also of note is their ability to infer missing information by synthesizing these two datasets. In analyzing all this data in conjunction, multimodal AI is able to most accurately emulate the way human pathologists synthesize information in practice.

Recent multimodal AI assistants such as PathChat,15 PathAsst,16 SmartPath,17 and SlideChat integrate WSI analysis with natural language processing to support complex pathological tasks and assist in confirming physician diagnoses.18 Tasks they commonly assist with include diagnostic reasoning, morphological feature analysis, and biomarker testing recommendations. These systems allow pathologists to streamline their workflow and easily retrieve clinical information relevant to a given patient. In this way, these models can enhance both the precision and speed of a physician’s workflow and may also aid in standardizing reporting across different medical institutions.

AI is often able to outperform humans in pattern recognition, scalability, efficiency, and breadth of diagnostic consideration. In this way, it is an effective approach for reducing physician workload. However, it is often complicated to integrate into existing clinical systems and requires large amounts of patient data for training. This poses numerous issues, particularly concerning patient privacy, the difficulty of obtaining sufficient data to construct datasets, and susceptibility to bias depending on the populations represented.19 Furthermore, AI is often expensive to implement and still faces significant regulatory challenges in complying with healthcare confidentiality standards. As previously mentioned, the “black box” problem is especially problematic, given that understanding model behavior is crucial for interpreting why patients are given certain diagnoses.

To successfully implement AI in pathology, the process should begin by clearly defining clinical objectives, such as minimizing diagnostic errors or improving biomarker quantification. Next, the necessary digital infrastructure should be established, including WSI scanners, data storage systems, and adequate computational resources. Once the infrastructure is in place, datasets, clinical notes, and WSIs should be prepared and annotated to ensure high-quality input for model development. The subsequent step involves selecting, developing, and validating appropriate AI models, accompanied by rigorous testing and bias assessment. Integration into hospital workflows should follow, which may include embedding models into Laboratory Information Systems to automate specific tasks. Comprehensive staff training on AI-assisted processes is essential, along with strict adherence to HIPAA and other regulatory requirements. Finally, continuous monitoring and iterative improvement of AI models are necessary to maintain performance and reliability over time.

AI applications in breast pathology

Early applications of AI in breast pathology primarily involved computer-assisted diagnostic systems, which relied on rule-based algorithms to quantify nuclear size, mitotic figures, and staining intensity in breast tissue slides. With the emergence of machine learning in the 2010s, techniques such as support vector machines and random forests were introduced to analyze histopathology images, paving the way for widespread adoption of WSIs. This advancement enabled the creation of large-scale datasets that could be mined for patterns and biomarkers, leading to significant progress in tumor detection and grading. These algorithms also facilitated automated learning directly from image data, supporting the development of models capable of identifying and classifying tumors with improved accuracy and efficiency.20 This review summarizes recent advances, highlights AI’s potential to improve diagnostic accuracy and efficiency, and addresses key challenges for clinical implementation.

Detection of lymph node metastasis

Automatic detection of lymph node metastasis in breast cancer using AI significantly reduces the risk of missed findings and improves diagnostic consistency. Su et al.21 developed a prototype-based neural network deep learning model (i.e., cross-attention-based salient instance inference multiple instance learning) and evaluated it on 500 WSIs from five different centers, achieving an area under the curve (AUC) ranging from 0.79 to 0.96. Ehteshami Bejnordi et al.22 evaluated 32 algorithms, the majority of which (25 of 32) were based on deep CNNs, on 110 WSIs with metastasis and 160 without, achieving AUCs ranging from 0.556 to 0.994 and, on average, outperforming a panel of 11 pathologists. This demonstrates AI’s potential to enhance sensitivity and reduce human error in critical staging decisions. Steiner et al.23 further showed that pathologists using deep-learning-assisted models achieved higher accuracy than both unassisted pathologists and the algorithms alone when reviewing 70 WSIs, highlighting the value of AI as a collaborative tool rather than a replacement. Basaad et al.24 combined a large language model (BERT) with a graph neural network to predict metastatic breast cancer based on pathological reports. This model achieved a detection rate of 0.98 and an AUC of 0.98 in identifying MBC patients.

Challa et al.25 applied Visiopharm AI (deep learning) to 594 WSIs, reporting decreased reading time and improved efficiency with 100% sensitivity and accuracy, which translates into faster turnaround times and reduced workload for pathology teams. Similarly, a single-center clinical trial using Visiopharm AI (deep learning) on 190 WSIs found that AI-assisted diagnoses had higher sensitivity (60%) and negative predictive value (88.2%) compared to unassisted diagnoses, improved detection of micrometastases and isolated tumor cells, and shortened slide review times. These findings underscore AI’s ability to streamline workflows, standardize reporting, and support timely treatment decisions, ultimately improving patient care and resource utilization in busy pathology practices.26Figure 1 demonstrates detection of tumor metastasis in lymph nodes using a deep learning AI tool (Visiopharm Integrator System).

Detecting lymph node metastasis using an artificial intelligence (AI) tool in breast cancer.
Fig. 1  Detecting lymph node metastasis using an artificial intelligence (AI) tool in breast cancer.

(a) Lymph node with breast tumor metastasis without AI annotation, low magnification (1×). (b) Lymph node with breast tumor metastasis annotated by AI (red line), low magnification (1×). (c) Lymph node with breast tumor metastasis without AI annotation, high magnification (10×). (d) Lymph node with breast tumor metastasis annotated by AI (red line), high magnification (10×). (e) Lymph node with metastatic lobular carcinoma annotated by AI (red line), low magnification (5×). (f) Lymph node with metastatic lobular carcinoma annotated by AI (red line), high magnification (20×).

Nottingham grading

AI has demonstrated strong capability in performing Nottingham grading through deep learning applied to WSIs, significantly reducing interobserver variability and improving reproducibility and accuracy in breast cancer grading.27–38

Jaroensri et al.39 developed a deep learning model trained on 1,600 WSIs and tested on 878 WSIs from The Cancer Genome Atlas (TCGA), achieving results that correlated closely with both patient outcomes and pathologist assessments, underscoring its prognostic value. Similarly, Dominik et al.40 utilized automated image analysis tools (CellProfiler and Tanagra) on 1,937 WSIs, followed by traditional machine learning for classification of Nottingham grades. Their study reported substantial agreement between AI models and pathologists, with Kappa indices of 0.91 for tubular score, 0.55 for nuclear score, and 0.49 for mitotic index. These findings highlight AI’s potential to standardize grading, reduce subjectivity, and enhance efficiency in pathology workflows. By automating complex grading tasks, AI not only improves diagnostic consistency but also accelerates turnaround times, enabling pathologists to focus on higher-level interpretive work and personalized treatment planning.

Classification and diagnosis

AI models have also demonstrated a strong capability in diagnosing and classifying breast lesions, a critical step for guiding personalized treatment strategies. These models can accurately distinguish between benign and atypical lesions (atypical ductal hyperplasia, atypical lobular hyperplasia, etc.) and malignant tumors, as well as specific histologic subtypes such as ductal, lobular, mucinous, and papillary carcinomas, along with detecting microcalcifications.41–44

Hameed et al.45 employed a deep learning CNN model to analyze 845 WSIs (437 with carcinoma, 408 without), achieving a sensitivity of 97.73%, an overall accuracy of 95.29%, and an F1 score of 95.29%, underscoring the robustness of AI in carcinoma detection. Similarly, Abdulaal et al.46 applied advanced CNN architectures (InceptionV3 and VGG19) to two widely used datasets, BACH and BreaKHis. BACH includes WSIs categorized into normal tissue, benign lesions, in situ carcinoma, and invasive carcinoma, while BreaKHis comprises 7,909 WSIs classified as benign or malignant, with further subclassifications across four magnification levels (40×, 100×, 200×, and 400×). Their models achieved exceptional performance, with binary classification accuracies of up to 98.83% for BreaKHis and 99.25% for BACH, demonstrating the scalability of AI across diverse datasets.

Building on this, Irmak et al.47 introduced a multi-magnification CNN approach using architectures such as ConvNeXt, InceptionNeXt, and EfficientNetV2, which improved classification accuracy across tissue scales. The study used the BreakHis dataset and a multi-magnification approach at 40×, 100×, 200×, and 400×. The CNN architecture included ConvNeXt, InceptionNeXt, and EfficientNetV2. InceptionNeXt and ConvNeXt achieved the best binary classification accuracy of 99.52% at 100× magnification, with ConvNeXt also achieving the best performance in multi-class classification, with an accuracy of 95.24% at 40× magnification.

Beyond image-only models, Karimian et al.48 developed a multimodal contrastive language-image pretraining (CLIP) model for histology that paired text and image features to classify and grade breast cancer subtypes. CLIP-IT used a CLIP model pre-trained on histology image–text pairs from a separate dataset to retrieve the most relevant unpaired textual reports for each image in the downstream unimodal dataset. CLIP-IT showed improved classification accuracy over other unimodal and multimodal models, using pathology reports from the TCGA and BACH datasets.48 Jaikumar et al.49 further advanced this concept by integrating XAI techniques into a residual tabular network (ResTabNet) that combines WSIs with protein expression profiles and clinical data. The BRCA dataset contained WSIs labeled as either benign or malignant, along with protein expression profiles for 223 proteins and relevant clinical information related to breast cancer. Using the BreakHis and BRCA datasets, this multimodal model achieved outstanding diagnostic metrics, with an accuracy of 98.56%, precision of 98.10%, recall of 98.00%, F1-score of 98.03%, and an AUC of 0.99, while improving interpretability, a key requirement for clinical adoption.

These advancements highlight AI’s transformative potential in breast pathology, offering not only high diagnostic accuracy but also the ability to integrate multimodal data for comprehensive disease characterization. Such capabilities can streamline workflows, reduce interobserver variability, and support precision oncology by enabling more accurate subtype classification and treatment planning.

Quantification

AI has been increasingly applied to automate breast cancer biomarker scoring, focusing on key markers such as estrogen receptor (ER), progesterone receptor (PR), HER2/neu, Ki-67, and programmed death-ligand 1 (PD-L1). These biomarkers are essential for accurate diagnosis, prognostication, and guiding targeted therapies. Traditional manual scoring is time-consuming and subject to interobserver variability, whereas AI-based approaches offer faster, more reproducible, and less subjective results.50–68Figure 2 demonstrates automated quantification of HER2 immunohistochemistry (IHC) using an AI tool.

Automated quantification of HER2 immunohistochemistry (IHC) using an AI tool in breast cancer.
Fig. 2  Automated quantification of HER2 immunohistochemistry (IHC) using an AI tool in breast cancer.

(a) HER2 IHC staining (positive, 3+) in a breast cancer biopsy specimen (1×). (b) HER2 membrane connectivity (green line) detected by an AI algorithm (1×). Red circles highlight controls, which are excluded from image analysis. (c) HER2 IHC staining (positive, 3+) in a breast cancer biopsy specimen (20×). (d) HER2 membrane connectivity (green line) detected by an AI algorithm (20×).

Akbarnejad et al.69 developed a dataset of 185,538 images and demonstrated that attention-based multiple-instance deep learning AI can predict Ki-67, ER, PR, and HER2 status directly from hematoxylin and eosin (H&E)-stained slides, bypassing IHC with prediction performance around 90%. This approach could significantly reduce costs and turnaround time by eliminating additional staining steps.

Lodge et al.70 introduced and validated HALO Breast AI for automated HER2, ER, PR, and Ki-67 IHC scoring using routine diagnostic cases from three institutions. HALO Breast AI accurately detected tumor regions and tumor cells within breast cancer tissue and demonstrated strong agreement with pathologists. Additionally, HALO Breast AI achieved good generalizability, with consistent performance across external and independent datasets.70

Lu et al.71 developed a deep learning model and proposed an innovative Ki-67 colocalization (Ki-67CL) score based on the spatial distribution of Ki-67 expression in luminal breast carcinoma. This model stratified ER+/HER2− patients with high prognostic significance for breast cancer-specific survival (P < 0.00001) and distant metastasis-free survival (P = 0.0048), offering a valuable tool for identifying patients who may benefit from adjuvant chemotherapy.71

Collectively, these advancements underscore AI’s potential to transform biomarker evaluation by improving accuracy, reproducibility, and efficiency while reducing subjectivity. Integration of such tools into routine workflows can accelerate diagnostic turnaround, enhance treatment planning, and support precision oncology.

Prognosis, risk stratification, and prediction of treatment response

AI is increasingly applied in breast cancer prognosis, risk stratification, and prediction of therapeutic response. Modern AI models can automate Nottingham grading directly from H&E-stained slides while maintaining prognostic performance comparable to pathologists.39,72–77

Sharma et al.78 validated Stratipath Breast, a CE-IVD-marked deep learning AI tool that stratifies patients into high- and low-risk groups using H&E WSIs from resected tumors, demonstrating strong prognostic accuracy across independent cohorts. In the ER+/HER2− subgroup, the hazard ratio (HR) for progression-free survival (PFS) was 2.76 (95% confidence interval (CI): 1.63–4.66, P < 0.001) between low- and high-risk groups after adjusting for established factors. In the ER+/HER2− NH grade 2 subgroup, the HR was 2.20 (95% CI: 1.22–3.98, P = 0.009).

AI models now frequently integrate histopathology, imaging, genomics, and clinical data to guide treatment decisions.

Mondol et al.79 developed a multimodal deep learning survival model that improved risk stratification by combining histopathology, genetic, and clinical data. It employed MaxViT vision transformers to extract image features, applied self-attention to model patient-level relationships, fused image and genetic data through dual cross-attention, and incorporated clinical variables in the final layer to improve prediction. This model used H&E WSIs from The Cancer Genome Atlas Breast Cancer (TCGA-BRCA) dataset, containing 249 WSIs from the Genomic Data Commons portal, with 149 samples of the molecular subtype Luminal A and 100 samples of Luminal B. TCGA-BRCA also contained PAM50 genes selected for analysis, as well as processed clinical data including tumor grade, size, patient age, and lymph node status. The study showed enhanced predictive accuracy for survival risk stratification in ER+ breast cancer patients, with a C-index of 0.64, which was superior to both unimodal models (0.53) and pathologist diagnoses (0.47).

Oncotype DX, the 21-gene recurrence score assay, is currently used clinically to provide critical prognostic and predictive insights. Several studies have developed AI models to predict Oncotype DX recurrence score categories.80–84 For example, Guo et al.81 developed a bio-inspired prototype-guided deep learning model (BPMambaMIL) using a weakly supervised learning framework that integrates the Mamba mechanism with prototypical guidance to predict Oncotype DX score intervals directly from pathology images. The model achieved an AUC of 0.839 and demonstrated robust predictive performance, particularly in identifying high-risk score ranges (accuracy: 0.714).81

Barseghyan et al.85 incorporated XAI into machine learning for breast cancer risk prediction using a large heterogeneous dataset of 1.5 million records from the Breast Cancer Surveillance Consortium, emphasizing transparency to maintain trust. A set of advanced machine learning models was developed, including XGBoost, SVMs, artificial neural networks, and a Dempster–Shafer-based classifier. To improve interpretability, the models were analyzed using SHAP, LIME, and layer-wise relevance propagation. Integrating XAI techniques into breast cancer prediction models enhances interpretability without significantly reducing performance. Risk factors included biopsy history and age, among others. The findings highlight the need to balance accuracy and transparency in clinical AI and support wider use of explainable approaches to promote trust, clarity, and ethical practice in healthcare AI systems.

AI is being actively used to predict breast cancer treatment response, especially for neoadjuvant chemotherapy, endocrine therapy, and targeted agents. These models use imaging, histopathology, and radiomics to forecast pathological complete response (pCR) and guide personalized treatment, offering a potentially low-cost and widely accessible alternative or complement to molecular and imaging biomarkers.86–88 Multimodal models integrating pathologic and radiologic imaging and clinical characteristics have improved accuracy.89–91 For example, Krishnamurthy et al.92 showed that an H&E-based AI model achieved an AUC of 0.75. Huang et al.88 introduced IMPRESS, a deep learning AI pipeline integrating H&E and multiplex IHC, achieving AUCs of 0.8975 for HER2-positive and 0.7674 for triple-negative breast cancer, outperforming manual assessments. Wang et al.93 developed an innovative deep learning model centered on the novel architecture ResponseNet. They applied this model to histopathology images to accurately predict treatment response and prognosis, demonstrating that AI can stratify patients likely to benefit from specific therapies. This study used BreaKHis and TCGA-BRCA datasets, with ResponseNet achieving accuracies of 81.87% and 77.92%, respectively. It also achieved higher precision and F1 scores.

Tumor microenvironment (TME)

AI is now being used to analyze breast cancer TME and has uncovered spatial patterns of immune cells, stromal interactions, and prognostic biomarkers. The TME, including tumor-infiltrating lymphocytes, plays an important prognostic role in breast cancers.94,95 Deep learning-based models have greatly improved the consistency of TME quantification and accuracy. AI-derived tumor-infiltrating lymphocyte scores correlate strongly with manual pathologist assessments but offer far superior reproducibility. Beyond simple density measurements, the spatial arrangement of immune cells in relation to tumor nests provides critical biological and clinical insights. AI-driven computational pathology enables comprehensive spatial profiling of lymphocytes and other immune cells within the TME. Using deep learning–based cell segmentation and classification, AI models can precisely map thousands of cells across entire slides and compute spatial metrics such as nearest-neighbor distances, cell clustering patterns, and immune infiltration gradients.96 Eweje et al.97 applied deep learning–based AI single-cell analysis of H&E WSIs to predict the pathologic response and survival benefit of immune checkpoint inhibition in 65 patients with invasive breast carcinoma treated with anti-PD-1 immunotherapy. The study used AI to map immune and stromal cell distributions in breast cancer tissue and showed that AI can link spatial TME features to patient outcomes to predict pCR and PFS in treated invasive BC, with AUC = 0.894 (95% CI 0.783–1.000) for pCR and HR of 0.30 (95% CI 0.13–0.69, P = 0.003) for PFS.97

Breast cancer molecular pathology

AI in molecular pathology of breast cancer primarily focuses on predicting molecular subtypes, biomarker status, and genomic alterations directly from histopathology images. These approaches aim to integrate digital pathology with precision oncology.

Xu et al.98 developed a foundational model integrating three modalities: microscopic pathology slides, macroscopic pathology reports, and molecular gene expression data, creating 26,169 slide-level pairs from 10,275 patients across 32 cancer types, totaling over 116 million image patches. They introduced a whole-slide pretraining paradigm, Multimodal SelfTAught PRetraining (mSTAR), designed to address a broad range of oncologic tasks, including gene mutation prediction, IHC biomarker prediction, and molecular subtyping. mSTAR outperformed prior pathology foundation models (PLIP, CONCH, UNI, CHIEF, GigaPath) and classical architectures, excelling in molecular prediction, report-related tasks, and multimodal fusion. For breast cancer, mSTAR improved mutation prediction for GATA3 (+3.2%), PIK3CA (+2.46%), and TP53 (+2.04%) (all P < 0.001), and enhanced IHC biomarker prediction for ER, PR, HER2, and CK5, while also performing strongly in molecular subtyping.98

Breast biomarker status is typically determined by IHC or molecular testing, but recent studies show that deep learning models can predict these markers directly from H&E slides with high accuracy. Couture et al.99 trained CNNs on large cohorts, achieving AUCs of 0.90 for ER, 0.86 for PR, and 0.88 for HER2. Farahmand et al.100 used a CNN model and reported HER2 prediction AUCs of 0.81–0.89 across multiple cohorts, approaching the performance of IHC or molecular testing. Visualization revealed that models focus on membrane morphology and growth patterns, consistent with HER2 traits. While not replacements for IHC/ISH, these models may serve as adjuncts or quality assurance tools, especially where confirmatory testing is limited.

Deep learning models also predict intrinsic molecular subtypes (Luminal A, Luminal B, HER2-enriched, and Basal-like) directly from H&E slides.101 Bychkov et al.102 achieved AUCs of 0.80–0.90 using CNNs trained on thousands of WSIs linked to gene expression data. These models likely learn complex morphologic features correlating with transcriptional programs, offering a rapid, low-cost alternative to gene profiling for early risk stratification.

AI-based inference has extended to mutation status, with deep learning models predicting BRCA and TP53 mutations from H&E slides (AUC: 0.80).103 XAI tools such as Class Activation Mapping and attention heatmaps show that models focus on tumor epithelium, nuclear morphology, stromal context, and lymphocytic infiltration, supporting biological plausibility and revealing potential genotype–phenotype links.100,103

Huang et al.104 created a model using a graph neural network and a multi-layer perceptron graph-level readout method to extract gene expression and gene interaction data for classifying 5-year overall survival of breast cancer patients. Their results show that the model outperformed random forest classifiers and deep neural networks.104 A recent study by Wang et al.105 integrated multi-omics data and machine learning to characterize molecular heterogeneity in hepatocellular carcinoma, identifying four molecular subtypes with distinct prognoses, immune features, and therapeutic sensitivities. The study demonstrated the potential utility of such an integrative approach to improve robustness and biological interpretability in AI-driven prognosis prediction and immunotherapy stratification.105

Commercially available AI tools for breast pathology

As AI continues to demonstrate strong performance in tasks such as tumor detection, grading, biomarker quantification, and risk prediction, AI solutions have moved beyond research into clinical practice. These tools are designed to integrate seamlessly into pathology workflows, offering standardized, reproducible results and supporting regulatory compliance. Some commercially available AI tools for breast pathology are summarized in Table 1.

Table 1

Summary of some commercially available AI tools for breast pathology

ProductCompanyCountryApplicationsRegulatory approval
Aiforia Clinical Suites: AI modelsAiforiaFinlandBiomarker quantification, gradingCE-IVD marked
AIRA Clinical productsAIRA MatrixIndiaBiomarker quantification, grading
Galen BreastIBEXIsraelDiagnosis/classificationFDA Breakthrough device
HALO Breast IHC AIIndica LabsUSADiagnosis/classification, biomarker quantificationCE-IVD certified
Breast Ki-67, HER2, ER/PRMindpeakGermanyBiomarker quantificationCE-IVD marked
Nucleai AtomNucleaiIsraelBiomarker quantification, spatial analysis
RlapsRiskOWKINFrancePrediction of recurrent riskCE-IVD marked
Paige Breast SuitePaige AIUSADiagnosis/classification, biomarker quantificationFDA Breakthrough device
QAi LYMPH NODE Dx, Ki-67 QUANT, BREAST HER2 QUANTQritiveSingaporeDiagnosis/classification, quantification, detection, prediction
STRATIPATH BREASTStratipathSwedenPrediction of recurrent riskCE-IVD marked
Biomarker quantification, lymph node detectionVisiopharmDenmarkQuantification, detectionCE-IVD certified

Limitations

Despite substantial progress, several challenges continue to limit the widespread clinical implementation of AI in breast pathology. These include variability in data quality and annotation standards, limited dataset diversity, potential algorithmic bias, and concerns regarding model generalizability across institutions and populations. Additional barriers include infrastructure and computational requirements, integration into existing pathology workflows, and the need to maintain patient privacy, data security, and ethical standards. Furthermore, issues related to interpretability, regulatory approval, reimbursement, and standardized validation protocols remain critical considerations for safe and equitable adoption.

Future directions

Future efforts will focus on developing interpretable, clinically validated, and seamlessly integrated AI systems that complement pathologists in routine breast pathology practice. Emerging technologies such as multimodal foundation models, generative AI, and spatial TME analysis are expected to enable more robust prediction of treatment response, recurrence risk, and patient outcomes through integration of histopathology, radiology, genomics, clinical information, and other multi-omics data. Continued advances in XAI will further improve transparency and trust, supporting broader clinical adoption and regulatory acceptance. In parallel, the establishment of standardized validation frameworks, regulatory guidelines, and reimbursement strategies will be essential to facilitate safe, scalable, and equitable implementation of AI-driven pathology solutions.

Conclusions

AI is transforming breast pathology by improving diagnostic accuracy, efficiency, and reproducibility across multiple applications, including tumor detection, Nottingham grading, biomarker quantification, risk stratification, and prognostic prediction. The field has rapidly evolved from early rule-based approaches to sophisticated deep learning and multimodal foundation models capable of comprehensive disease characterization and supporting increasingly personalized treatment strategies. By reducing interobserver variability, streamlining workflows, and enhancing precision medicine, AI is becoming an indispensable partner to pathologists rather than a replacement for them. Ultimately, the integration of computational intelligence with human expertise has the potential to significantly advance breast cancer diagnosis, treatment, and patient outcomes.

Declarations

Acknowledgement

None.

Funding

None.

Conflict of interest

Zaibo Li is a Deputy Editor-in-Chief of Journal of Clinical and Translational Pathology. The other authors have no conflicts of interest to declare.

Authors’ contributions

Conception and design (YH, ZL), manuscript writing (YH, AZ, RW, MT, GT, AVP, ZS, KN, ZL), and final approval of manuscript (YH, AZ, RW, MT, GT, AVP, ZS, KN, ZL).

References

  1. Rakha EA, Toss M, Shiino S, Gamble P, Jaroensri R, Mermel CH, et al. Current and future applications of artificial intelligence in pathology: a clinical perspective. J Clin Pathol 2021;74(7):409-414 View Article PubMed/NCBI
  2. Verma KK, Palmal R, Yadav P. Unveiling the future: the impact of artificial intelligence in diagnostic pathology. Surg Exp Pathol 2025;8(1):23 View Article
  3. Russell SN, Norvig P. Artificial Intelligence: A Modern Approach. 4th ed. Harlow: Pearson; 2020
  4. GeeksforGeeks. Artificial Intelligence (AI) Algorithms. 2025. Available from: https://www.geeksforgeeks.org/artificial-intelligence/ai-algorithms/
  5. Goodfellow I, Bengio Y, Courville A. Deep Learning. Cambridge (MA): The MIT Press; 2016
  6. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature 2015;521(7553):436-444 View Article PubMed/NCBI
  7. Litjens G, Sánchez CI, Timofeeva N, Hermsen M, Nagtegaal I, Kovacs I, et al. Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Sci Rep 2016;6:26286 View Article PubMed/NCBI
  8. Campanella G, Hanna MG, Geneslaw L, Miraflor A, Werneck Krauss Silva V, Busam KJ, et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med 2019;25(8):1301-1309 View Article PubMed/NCBI
  9. Li C, Gan Z, Yang Z, Yang J, Li L, Wang L, et al. Multimodal foundation models: From specialists to general-purpose assistants. Found Trends Comput Graph Vis 2024;16(1-2):1-214 View Article
  10. LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE 1998;86(11):2278-2324 View Article
  11. Banh L, Strobel G. Generative artificial intelligence. Electron Mark 2023;33(1):63 View Article
  12. Confalonieri R, Coba L, Wagner B, Besold TR. A historical perspective of explainable artificial intelligence. Wiley Interdisciplinary Reviews: Data Min Knowl Discov 2021;11(1):e1391 View Article
  13. Gunning D, Aha D. DARPA’s explainable artificial intelligence (XAI) program. AI Mag 2019;40(2):44-58 View Article
  14. Chen RJ, Ding T, Lu MY, Williamson DFK, Jaume G, Song AH, et al. Towards a general-purpose foundation model for computational pathology. Nat Med 2024;30(3):850-862 View Article PubMed/NCBI
  15. Lu MY, Chen B, Williamson DFK, Chen RJ, Zhao M, Chow AK, et al. A multimodal generative AI copilot for human pathology. Nature 2024;634(8033):466-473 View Article PubMed/NCBI
  16. Sun Y, Zhu C, Zheng S, Zhang K, Sun L, Shui Z, et al. PathAsst: A generative foundation AI assistant towards artificial general intelligence of pathology. In: Proceedings of the AAAI Conference on Artificial Intelligence, AAAI-24; 2024 Feb 27-Mar 4; Vancouver, Canada. Palo Alto, CA: AAAI Press; 2024. p. 5034-5042 View Article
  17. Xu Z, Liu Z, Hou J, Ma J, Jin C, Wang Y, et al. A versatile pathology co-pilot via reasoning enhanced multimodal large language model. arXiv 2025 View Article
  18. Chen Y, Wang G, Ji Y, Li Y, Ye J, Li T, et al. SlideChat: A large vision language assistant for whole slide pathology image understanding. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); 2025 Jun 11-15; Nashville, TN, USA. Piscataway, NJ: IEEE; 2025. p. 5134-5143 View Article
  19. Hanna MG, Pantanowitz L, Jackson B, Palmer O, Visweswaran S, Pantanowitz J, et al. Ethical and Bias Considerations in Artificial Intelligence/Machine Learning. Mod Pathol 2025;38(3):100686 View Article PubMed/NCBI
  20. Li W, Ye S, Jin Z, Chen L, Chao Y, Wei G, et al. Artificial Intelligence in digital pathology of breast cancer, new era of practice?. Int J Surg 2025;111(11):8270-8283 View Article PubMed/NCBI
  21. Su Z, Rezapour M, Sajjad U, Niu S, Gurcan MN, Niazi MKK. Cross-Attention-Based Saliency Inference for Predicting Cancer Metastasis on Whole Slide Images. IEEE J Biomed Health Inform 2024;28(12):7206-7216 View Article PubMed/NCBI
  22. Ehteshami Bejnordi B, Veta M, Johannes van Diest P, van Ginneken B, Karssemeijer N, Litjens G, et al. Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer. JAMA 2017;318(22):2199-2210 View Article PubMed/NCBI
  23. Steiner DF, MacDonald R, Liu Y, Truszkowski P, Hipp JD, Gammage C, et al. Impact of Deep Learning Assistance on the Histopathologic Review of Lymph Nodes for Metastatic Breast Cancer. Am J Surg Pathol 2018;42(12):1636-1646 View Article PubMed/NCBI
  24. Basaad A, Basurra S, Vakaj E, Eldaly AK, Abdelsamea MM. A BERT-GNN Approach for Metastatic Breast Cancer Prediction Using Histopathology Reports. Diagnostics (Basel) 2024;14(13):1365 View Article PubMed/NCBI
  25. Challa B, Tahir M, Hu Y, Kellough D, Lujan G, Sun S, et al. Artificial Intelligence-Aided Diagnosis of Breast Cancer Lymph Node Metastasis on Histologic Slides in a Digital Workflow. Mod Pathol 2023;36(8):100216 View Article PubMed/NCBI
  26. van Dooijeweert C, Flach RN, Ter Hoeve ND, Vreuls CPH, Goldschmeding R, Freund JE, et al. Clinical implementation of artificial-intelligence-assisted detection of breast cancer metastases in sentinel lymph nodes: the CONFIDENT-B single-center, non-randomized clinical trial. Nat Cancer 2024;5(8):1195-1205 View Article PubMed/NCBI
  27. Das A, Nair MS, Peter DS. Batch Mode Active Learning on the Riemannian Manifold for Automated Scoring of Nuclear Pleomorphism in Breast Cancer. Artif Intell Med 2020;103:101805 View Article PubMed/NCBI
  28. Veta M, van Diest PJ, Willems SM, Wang H, Madabhushi A, Cruz-Roa A, et al. Assessment of algorithms for mitosis detection in breast cancer histopathology images. Med Image Anal 2015;20(1):237-248 View Article PubMed/NCBI
  29. Mantrala S, Ginter PS, Mitkari A, Joshi S, Prabhala H, Ramachandra V, et al. Concordance in Breast Cancer Grading by Artificial Intelligence on Whole Slide Images Compares With a Multi-Institutional Cohort of Breast Pathologists. Arch Pathol Lab Med 2022;146(11):1369-1377 View Article PubMed/NCBI
  30. Balkenhol MCA, Tellez D, Vreuls W, Clahsen PC, Pinckaers H, Ciompi F, et al. Deep learning assisted mitotic counting for breast cancer. Lab Invest 2019;99(11):1596-1606 View Article PubMed/NCBI
  31. Romo-Bucheli D, Janowczyk A, Gilmore H, Romero E, Madabhushi A. Automated Tubule Nuclei Quantification and Correlation with Oncotype DX risk categories in ER+ Breast Cancer Whole Slide Images. Sci Rep 2016;6:32706 View Article PubMed/NCBI
  32. Nateghi R, Danyali H, Helfroush MS. A deep learning approach for mitosis detection: Application in tumor proliferation prediction from whole slide images. Artif Intell Med 2021;114:102048 View Article PubMed/NCBI
  33. Li C, Wang X, Liu W, Latecki LJ. DeepMitosis: Mitosis detection via deep detection, verification and segmentation networks. Med Image Anal 2018;45:121-133 View Article PubMed/NCBI
  34. Sebai M, Wang X, Wang T. MaskMitosis: a deep learning framework for fully supervised, weakly supervised, and unsupervised mitosis detection in histopathology images. Med Biol Eng Comput 2020;58(7):1603-1623 View Article PubMed/NCBI
  35. Mahmood T, Arsalan M, Owais M, Lee MB, Park KR. Artificial Intelligence-Based Mitosis Detection in Breast Cancer Histopathology Images Using Faster R-CNN and Deep CNNs. J Clin Med 2020;9(3):749 View Article PubMed/NCBI
  36. Wang Y, Acs B, Robertson S, Liu B, Solorzano L, Wählby C, et al. Improved breast cancer histological grading using deep learning. Ann Oncol 2022;33(1):89-98 View Article PubMed/NCBI
  37. Elsharawy KA, Gerds TA, Rakha EA, Dalton LW. Artificial intelligence grading of breast cancer: a promising method to refine prognostic classification for management precision. Histopathology 2021;79(2):187-199 View Article PubMed/NCBI
  38. Ibrahim A, Jahanifar M, Wahab N, Toss MS, Makhlouf S, Atallah N, et al. Artificial Intelligence-Based Mitosis Scoring in Breast Cancer: Clinical Application. Mod Pathol 2024;37(3):100416 View Article PubMed/NCBI
  39. Jaroensri R, Wulczyn E, Hegde N, Brown T, Flament-Auvigne I, Tan F, et al. Deep learning models for histologic grading of breast cancer and association with disease prognosis. NPJ Breast Cancer 2022;8(1):113 View Article PubMed/NCBI
  40. Dominik L, Paulo César Ribeiro B, Robson Dettmann J, Célio Siman Mafra N, Isabela Passos Pereira Q, Samuel Santana S. Histological Grading of Breast Cancer Malignancy Using Automated Image Analysis and Subsequent Machine Learning. Glob J Med Res 2023;23(C3):39-45 View Article
  41. Han Z, Wei B, Zheng Y, Yin Y, Li K, Li S. Breast Cancer Multi-classification from Histopathological Images with Structured Deep Learning Model. Sci Rep 2017;7(1):4172 View Article PubMed/NCBI
  42. Abdolahi M, Salehi M, Shokatian I, Reiazi R. Artificial intelligence in automatic classification of invasive ductal carcinoma breast cancer in digital pathology images. Med J Islam Repub Iran 2020;34:140 View Article PubMed/NCBI
  43. Mercan E, Mehta S, Bartlett J, Shapiro LG, Weaver DL, Elmore JG. Assessment of Machine Learning of Breast Pathology Structures for Automated Differentiation of Breast Cancer and High-Risk Proliferative Lesions. JAMA Netw Open 2019;2(8):e198777 View Article PubMed/NCBI
  44. Tahir M, Hu Y, Kumar H, Shaker N, Kellough D, Goodman S, et al. A Comprehensive AI-Based Approach in Classifying Breast Lesions: Focusing on Improving Pathologists’ Accuracy and Efficiency. Clin Breast Cancer 2025;25(6):e818-e825 View Article PubMed/NCBI
  45. Hameed Z, Zahia S, Garcia-Zapirain B, Javier Aguirre J, María Vanegas A. Breast Cancer Histopathology Image Classification Using an Ensemble of Deep Learning Models. Sensors (Basel) 2020;20(16):4373 View Article PubMed/NCBI
  46. Abdulaal AH, Yassin RA, Valizadeh M, Abdulwahhab AH, Jasim AM, Mohammed AJ, et al. Cutting-Edge CNN approaches for breast histopathological classification: The impact of spatial attention mechanisms. ShodhAI J Artif Intell 2024;1(1):109-130 View Article
  47. Irmak G, Saygılı A. Deep learning-based histopathological classification of breast tumors: a multi-magnification approach with state-of-the-art models. Signal Image Video Process 2025;19(7):578 View Article
  48. Karimian B, Avanzato G, Belharbi S, Guichemerre A, McCaffrey L, Shateri M, et al. CLIP-IT: CLIP-based Pairing of Histology Images with Privileged Textual Information. arXiv 2025 View Article
  49. Jaikumar SS, Praveena SM. Breast cancer diagnosis from histopathological images and molecular signatures by fusing features with an explainable AI-based residual tabular network model. J Comput Aided Mol Des 2026;40(1):3 View Article PubMed/NCBI
  50. Turbin DA, Leung S, Cheang MC, Kennecke HA, Montgomery KD, McKinney S, et al. Automated quantitative analysis of estrogen receptor expression in breast carcinoma does not differ from expert pathologist scoring: a tissue microarray study of 3,484 cases. Breast Cancer Res Treat 2008;110(3):417-426 View Article PubMed/NCBI
  51. Gokhale S, Rosen D, Sneige N, Diaz LK, Resetkova E, Sahin A, et al. Assessment of two automated imaging systems in evaluating estrogen receptor status in breast carcinoma. Appl Immunohistochem Mol Morphol 2007;15(4):451-455 View Article PubMed/NCBI
  52. Press MF, Pike MC, Chazin VR, Hung G, Udove JA, Markowicz M, et al. Her-2/neu expression in node-negative breast cancer: direct tissue quantitation by computerized image analysis and association of overexpression with increased risk of recurrent disease. Cancer Res 1993;53(20):4960-4970 PubMed/NCBI
  53. Brügmann A, Eld M, Lelkaitis G, Nielsen S, Grunkin M, Hansen JD, et al. Digital image analysis of membrane connectivity is a robust measure of HER2 immunostains. Breast Cancer Res Treat 2012;132(1):41-49 View Article PubMed/NCBI
  54. Helin HO, Tuominen VJ, Ylinen O, Helin HJ, Isola J. Free digital image analysis software helps to resolve equivocal scores in HER2 immunohistochemistry. Virchows Arch 2016;468(2):191-198 View Article PubMed/NCBI
  55. Holten-Rossing H, Møller Talman ML, Kristensson M, Vainer B. Optimizing HER2 assessment in breast cancer: application of automated image analysis. Breast Cancer Res Treat 2015;152(2):367-375 View Article PubMed/NCBI
  56. Koopman T, Buikema HJ, Hollema H, de Bock GH, van der Vegt B. Digital image analysis of Ki67 proliferation index in breast cancer using virtual dual staining on whole tissue sections: clinical validation and inter-platform agreement. Breast Cancer Res Treat 2018;169(1):33-42 View Article PubMed/NCBI
  57. Hartage R, Li AC, Hammond S, Parwani AV. A Validation Study of Human Epidermal Growth Factor Receptor 2 Immunohistochemistry Digital Imaging Analysis and its Correlation with Human Epidermal Growth Factor Receptor 2 Fluorescence In situ Hybridization Results in Breast Carcinoma. J Pathol Inform 2020;11:2 View Article PubMed/NCBI
  58. Yue M, Zhang J, Wang X, Yan K, Cai L, Tian K, et al. Can AI-assisted microscope facilitate breast HER2 interpretation? A multi-institutional ring study. Virchows Arch 2021;479(3):443-449 View Article PubMed/NCBI
  59. Christgen M, von Ahsen S, Christgen H, Länger F, Kreipe H. The region-of-interest size impacts on Ki67 quantification by computer-assisted image analysis in breast cancer. Hum Pathol 2015;46(9):1341-1349 View Article PubMed/NCBI
  60. Stålhammar G, Fuentes Martinez N, Lippert M, Tobin NP, Mølholm I, Kis L, et al. Digital image analysis outperforms manual biomarker assessment in breast cancer. Mod Pathol 2016;29(4):318-329 View Article PubMed/NCBI
  61. Ács B, Madaras L, Kovács KA, Micsik T, Tőkés AM, Győrffy B, et al. Reproducibility and Prognostic Potential of Ki-67 Proliferation Index when Comparing Digital-Image Analysis with Standard Semi-Quantitative Evaluation in Breast Cancer. Pathol Oncol Res 2018;24(1):115-127 View Article PubMed/NCBI
  62. Stålhammar G, Robertson S, Wedlund L, Lippert M, Rantalainen M, Bergh J, et al. Digital image analysis of Ki67 in hot spots is superior to both manual Ki67 and mitotic counts in breast cancer. Histopathology 2018;72(6):974-989 View Article PubMed/NCBI
  63. Klauschen F, Wienert S, Schmitt WD, Loibl S, Gerber B, Blohmer JU, et al. Standardized Ki67 Diagnostics Using Automated Scoring—Clinical Validation in the GeparTrio Breast Cancer Study. Clin Cancer Res 2015;21(16):3651-3657 View Article PubMed/NCBI
  64. Koopman T, Buikema HJ, Hollema H, de Bock GH, van der Vegt B. What is the added value of digital image analysis of HER2 immunohistochemistry in breast cancer in clinical practice? A study with multiple platforms. Histopathology 2019;74(6):917-924 View Article PubMed/NCBI
  65. Røge R, Riber-Hansen R, Nielsen S, Vyberg M. Proliferation assessment in breast carcinomas using digital image analysis based on virtual Ki67/cytokeratin double staining. Breast Cancer Res Treat 2016;158(1):11-19 View Article PubMed/NCBI
  66. Bankhead P, Loughrey MB, Fernández JA, Dombrowski Y, McArt DG, Dunne PD, et al. QuPath: Open source software for digital pathology image analysis. Sci Rep 2017;7(1):16878 View Article PubMed/NCBI
  67. Humphries MP, Hynes S, Bingham V, Cougot D, James J, Patel-Socha F, et al. Automated Tumour Recognition and Digital Pathology Scoring Unravels New Role for PD-L1 in Predicting Good Outcome in ER-/HER2+ Breast Cancer. J Oncol 2018;2018:2937012 View Article PubMed/NCBI
  68. Tavolara TE, Niazi MKK, Tozbikian G, Wesolowski R, Gurcan MN. Predicting HER2 scores from registered HER2 and H&E images. In: Tomaszewski JE, Ward AD, Levenson RM (eds). Medical Imaging 2022: Digital and Computational Pathology. Vol. 12039. SPIE; 2022:120390C View Article
  69. Akbarnejad A, Ray N, Barnes PJ, Bigras G. Toward Accurate Deep Learning-Based Prediction of Ki67, ER, PR, and HER2 Status From H&E-Stained Breast Cancer Images. Appl Immunohistochem Mol Morphol 2025;33(3):131-141 View Article PubMed/NCBI
  70. Lodge M, Ironside A, Graham A, Polonia A, Reinhard S, Solass W, et al. Development & validation of an AI‑supported workflow for clinical scoring of HER2, ER, PR & Ki67 immunohistochemistry in breast cancer tissue. In: Proceedings of the 2023 San Antonio Breast Cancer Symposium; 2023 Dec 5‑9; San Antonio, TX. Philadelphia (PA): AACR; Cancer Res 2024;84(9 Suppl):Abstract nr PO3‑07‑03
  71. Lu W, Lashen AG, Wahab N, Miligy IM, Jahanifar M, Toss M, et al. AI-based intra-tumor heterogeneity score of Ki67 expression as a prognostic marker for early-stage ER+/HER2- breast cancer. J Pathol Clin Res 2024;10(1):e346 View Article PubMed/NCBI
  72. Fernandez G, Prastawa M, Madduri AS, Scott R, Marami B, Shpalensky N, et al. Development and validation of an AI-enabled digital breast cancer assay to predict early-stage breast cancer recurrence within 6 years. Breast Cancer Res 2022;24(1):93 View Article PubMed/NCBI
  73. Ahn JS, Shin S, Yang SA, Park EK, Kim KH, Cho SI, et al. Artificial Intelligence in Breast Cancer Diagnosis and Personalized Medicine. J Breast Cancer 2023;26(5):405-435 View Article PubMed/NCBI
  74. McCaffrey C, Jahangir C, Murphy C, Burke C, Gallagher WM, Rahman A. Artificial intelligence in digital histopathology for predicting patient prognosis and treatment efficacy in breast cancer. Expert Rev Mol Diagn 2024;24(5):363-377 View Article PubMed/NCBI
  75. Lu C, Romo-Bucheli D, Wang X, Janowczyk A, Ganesan S, Gilmore H, et al. Nuclear shape and orientation features from H&E images predict survival in early-stage estrogen receptor-positive breast cancers. Lab Invest 2018;98(11):1438-1448 View Article PubMed/NCBI
  76. Albusayli R, Graham JD, Pathmanathan N, Shaban M, Raza SEA, Minhas F, et al. Artificial intelligence-based digital scores of stromal tumour-infiltrating lymphocytes and tumour-associated stroma predict disease-specific survival in triple-negative breast cancer. J Pathol 2023;260(1):32-42 View Article PubMed/NCBI
  77. Ivanova M, Pescia C, Trapani D, Venetis K, Frascarelli C, Mane E, et al. Early Breast Cancer Risk Assessment: Integrating Histopathology with Artificial Intelligence. Cancers (Basel) 2024;16(11):1981 View Article PubMed/NCBI
  78. Sharma A, Lövgren SK, Eriksson KL, Wang Y, Robertson S, Hartman J, et al. Validation of an AI-based solution for breast cancer risk stratification using routine digital histopathology images. Breast Cancer Res 2024;26(1):123 View Article PubMed/NCBI
  79. Mondol RK, Millar E, Sowmya A, Meijering E. MM-SurvNet: Deep Learning-Based Survival Risk Stratification in Breast Cancer Through Multimodal Data Fusion. IEEE J Biomed Health Inform 2024;28(9):5290-5302 View Article PubMed/NCBI
  80. Zhang R, Wang K, Wang S, Wang C, Cao T, Ci C, et al. Multimodal deep learning model for prediction of breast cancer recurrence risk and correlation with oncotype DX. Breast Cancer Res 2025;27(1):178 View Article PubMed/NCBI
  81. Guo Y, Su Z, Koyun OC, Lu H, Wesolowski R, Tozbikian G, et al. BPMambaMIL: A bio-inspired prototype-guided multiple instance learning for oncotype DX risk assessment in histopathology. Comput Methods Programs Biomed 2025;272:109039 View Article PubMed/NCBI
  82. Su Z, Niazi MKK, Tavolara TE, Niu S, Tozbikian GH, Wesolowski R, et al. BCR-Net: A deep learning framework to predict breast cancer recurrence from histopathology images. PLoS One 2023;18(4):e0283562 View Article PubMed/NCBI
  83. Williams AD, Pawloski KR, Wen HY, Sevilimedu V, Thompson D, Morrow M, et al. Use of a supervised machine learning model to predict Oncotype DX risk category in node-positive patients older than 50 years of age. Breast Cancer Res Treat 2022;196(3):565-570 View Article PubMed/NCBI
  84. Su Z, Guo Y, Wesolowski R, Tozbikian G, O’Connell NS, Khan Niazi MK, et al. Computational Pathology for Accurate Prediction of Breast Cancer Recurrence: Development and Validation of a Deep Learning-Based Tool. Mod Pathol 2025;38(12):100847 View Article PubMed/NCBI
  85. Barseghyan S, Babajanyan A, Balyan Z, Asatryan A, Kumar S, et al. Explainable AI for breast cancer risk prediction: evaluating the accuracy-explainability trade-off. Mach Learn Comput Sci Eng 2025;1:25 View Article
  86. Liu Y, Han D, Parwani AV, Li Z. Applications of Artificial Intelligence in Breast Pathology. Arch Pathol Lab Med 2023;147(9):1003-1013 View Article PubMed/NCBI
  87. Khan H, Su Z, Zhang H, Wang Y, Ning B, Wei S, et al. Predicting Neoadjuvant Chemotherapy Response in Triple-Negative Breast Cancer Using Pre-Treatment Histopathologic Images. Cancers (Basel) 2025;17(15):2423 View Article PubMed/NCBI
  88. Huang Z, Shao W, Han Z, Alkashash AM, De la Sancha C, Parwani AV, et al. Artificial intelligence reveals features associated with breast cancer neoadjuvant chemotherapy responses from multi-stain histopathologic images. NPJ Precis Oncol 2023;7(1):14 View Article PubMed/NCBI
  89. Soliman A, Li Z, Parwani AV. Artificial intelligence’s impact on breast cancer pathology: a literature review. Diagn Pathol 2024;19(1):38 View Article PubMed/NCBI
  90. Xu Z, Zhou Z, Son JB, Feng H, Adrada BE, Moseley TW, et al. Deep Learning Models Based on Pretreatment MRI and Clinicopathological Data to Predict Responses to Neoadjuvant Systemic Therapy in Triple-Negative Breast Cancer. Cancers (Basel) 2025;17(6):966 View Article PubMed/NCBI
  91. Jimenez JE, Abdelhafez A, Mittendorf EA, Elshafeey N, Yung JP, Litton JK, et al. A model combining pretreatment MRI radiomic features and tumor-infiltrating lymphocytes to predict response to neoadjuvant systemic therapy in triple-negative breast cancer. Eur J Radiol 2022;149:110220 View Article PubMed/NCBI
  92. Krishnamurthy S, Jain P, Tripathy D, Basset R, Randhawa R, Muhammad H, et al. Predicting Response of Triple-Negative Breast Cancer to Neoadjuvant Chemotherapy Using a Deep Convolutional Neural Network-Based Artificial Intelligence Tool. JCO Clin Cancer Inform 2023;7:e2200181 View Article PubMed/NCBI
  93. Wang B, Chen S, Li W. Predicting breast cancer treatment response and prognosis using AI-based image classification. Front Oncol 2025;15:1619994 View Article PubMed/NCBI
  94. Maley CC, Koelble K, Natrajan R, Aktipis A, Yuan Y. An ecological measure of immune-cancer colocalization as a prognostic factor for breast cancer. Breast Cancer Res 2015;17(1):131 View Article PubMed/NCBI
  95. Heindl A, Sestak I, Naidoo K, Cuzick J, Dowsett M, Yuan Y. Relevance of Spatial Heterogeneity of Immune Infiltration for Predicting Risk of Recurrence After Endocrine Therapy of ER+ Breast Cancer. J Natl Cancer Inst 2018;110(2):166-175 View Article PubMed/NCBI
  96. Le H, Gupta R, Hou L, Abousamra S, Fassler D, Torre-Healy L, et al. Utilizing Automated Breast Cancer Detection to Identify Spatial Distributions of Tumor-Infiltrating Lymphocytes in Invasive Breast Cancer. Am J Pathol 2020;190(7):1491-1504 View Article PubMed/NCBI
  97. Eweje F, Li Z, Yuan K, Olguin F, Bergstrom C, Nirschl J, et al. Artificial intelligence-enabled spatial tumor microenvironment profiling predicts response to immunotherapy in invasive breast carcinoma. Lab Invest 2025;105(3):102367 View Article
  98. Xu Y, Wang Y, Zhou F, Ma J, Jin C, Yang S, et al. A multimodal knowledge-enhanced whole-slide pathology foundation model. Nat Commun 2025;16(1):11406 View Article PubMed/NCBI
  99. Couture HD, Williams LA, Geradts J, Nyante SJ, Butler EN, Marron JS, et al. Image analysis with deep learning to predict breast cancer grade, ER status, histologic subtype, and intrinsic subtype. NPJ Breast Cancer 2018;4:30 View Article PubMed/NCBI
  100. Farahmand S, Fernandez AI, Ahmed FS, Rimm DL, Chuang JH, Reisenbichler E, et al. Deep learning trained on hematoxylin and eosin tumor region of Interest predicts HER2 status and trastuzumab treatment response in HER2+ breast cancer. Mod Pathol 2022;35(1):44-51 View Article PubMed/NCBI
  101. Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, et al. Molecular portraits of human breast tumours. Nature 2000;406(6797):747-752 View Article PubMed/NCBI
  102. Bychkov D, Linder N, Tiulpin A, Kücükel H, Lundin M, Nordling S, et al. Deep learning identifies morphological features in breast cancer predictive of cancer ERBB2 status and trastuzumab treatment efficacy. Sci Rep 2021;11(1):4037 View Article PubMed/NCBI
  103. Wang X, Zou C, Zhang Y, Li X, Wang C, Ke F, et al. Prediction of BRCA Gene Mutation in Breast Cancer Based on Deep Learning and Histopathology Images. Front Genet 2021;12:661109 View Article PubMed/NCBI
  104. Huang TC, Hsu TC, Hsieh YH, Che-Lin. Utilizing Graph Neural Networks for Breast Cancer Prognosis Prediction with High-dimensional Genomic Data. Annu Int Conf IEEE Eng Med Biol Soc 2023;2023:1-4 View Article PubMed/NCBI
  105. Wang X, Yan B, Yang J, Cheng Z, Song W, Yang Y, et al. Multi-omics integration and machine learning define robust molecular subtypes and prognostic signatures in hepatocellular carcinoma. J Transl Med 2025;24(1):207 View Article PubMed/NCBI

About this Article

Cite this article
Hu Y, Zhu A, Wesolowski R, Tahir M, Tozbikian G, Parwani AV, et al. Artificial Intelligence in Breast Pathology: Recent Advances in Multimodal Models, Explainability, and Clinical Applications. J Clin Transl Pathol. Published online: May 20, 2026. doi: 10.14218/JCTP.2026.00007.
Copy        Export to RIS        Export to EndNote
Article History
Received Revised Accepted Published
January 25, 2026 March 8, 2026 April 23, 2026 May 20, 2026
DOI http://dx.doi.org/10.14218/JCTP.2026.00007
  • Journal of Clinical and Translational Pathology
  • pISSN 2993-5202
  • eISSN 2771-165X
Back to Top

Artificial Intelligence in Breast Pathology: Recent Advances in Multimodal Models, Explainability, and Clinical Applications

Yan Hu, Alan Zhu, Robert Wesolowski, Maryam Tahir, Gary Tozbikian, Anil V. Parwani, Ziyu Su, Khalid Niazi, Zaibo Li
  • Reset Zoom
  • Download TIFF