Mitigating Domain shifts -

Deep neural networks often perform well on trained data. However, on unseen data they usually fail to generalize and accompany performance degradation (Vu et al., 2019). This degradation of performance affects systems deployed in real-world environments such as processing images for self-driving cars, processing street views, generating text, and examining cells and tissues through various scanners deployed.

Figure 1: Illustration of domain adaptation and generalization mechanisms. Every color is indicative of sample with domain shift with pink consistently representing the test-data. Legends 'training' and test-time' provide an insight of the source training and target adaptation process.

To deal with such scenarios, domain adaptation (Figure 1a) and domain generalization emerged. Domain adaptation (Pandey et al., 2020) assumes access to target data and focuses on training a source model on the source domain, indicated with blue and pink samples in the training legend. At test time, it only focuses on evaluating the trained model on precomputed test data.
Common domain adaptation techniques focus on utilizing unlabeled target data. This assumption does not hold true in most of the cases since access to target data is only sometimes available. For instance, in scenarios such as self-driving cars (Vu et al., 2019), it is hard to collect all variations of road, weather, and scenes. Hence, domain generalization emerged to counter this.

Domain generalization (Muandet et al., 2013) (Figure 1b) assumes no access to target data while training a source model on multiple source domains indicated with different colors in the training legend. At test time, akin to domain adaptation, they evaluate precomputed test data. Existing methods focus on aligning the source distributions for invariant learning, training the model with meta-learning, augmenting domain data to resemble target or increase variations, and simulating the target data. However, since one cannot access target data or its distribution, these methods often come across overfitting and performance issues on the unseen target domain, known as the adaptivity gap.

To get closer to practical scenarios where access to target data is often not always precomputed, a recent paradigm Test-time adaptation (Wang et al., 2020) emerged (Figure 1c). In test-time adaptation, the source model is finetuned on target data as and when it arrives in small batches, as shown in the Test-time legend. For instance in scenarios this is the usual case in medical imaging applications where the imaging data from scanners is often available as and when the patient arrives. Moreover, the focus is to make the source model more target-specific by finetuning the model on target data. However, since access to such an unseen batch of samples is unlaballed, finetuning the model is challenging. Common methods utilize source model predictions and entropy on these batches to finetune the model.

Test-time generalization (Ambekar et al., 2023) (Figure 1d) focuses on training source models on multiple source domains during training while following the test-time adaptation setting at test-time. Recent methods such as (Ambekar et al., 2023; Xiao et al., 2022) focus on simulating domain shifts and test-domain and meta-learn the ability to generalize on unseen test data. Moreover, several methods also focus on addressing pseudo labeling, consistency, clustering, and self-supervision with auxiliary tasks.

Datasets and applications

Common tasks in deep learning include but not limited to Classification, speech recognition, medical imaging, computer vision, natural language processing. Moreover the common datasets utilized to evaluate method in classification and segmentation include DomainNet, Office-Home, Cifar corrupted, PACS, VLCS, Office-31, and NICO++.

Related topics

Transfer learning: One usually assumes access to labeled target data to evaluate and finetune the model. However, one doesn't assume access to labelled target data to fine-tune the model for all the paradigms above.

Zero-shot learning: This focuses on unseen label space changes between source and target, while the paradigms above focus primarily on generalizing the model to the unseen domain where new label space can seldom be a part of it.

Semi-supervised learning: Here, one assumes access to partial target set where the training and testing distributions come from the same distribution. However, the above paradigms do not focus on addressing the same distributions.

In summary, well-performing target models adapting to unseen target data is an omnipresent problem. Addressing these while being reminiscent of practical scenarios while being efficient remains an open problem.

References

Pandey, Prashant, Aayush Kumar Tyagi, Sameer Ambekar, and A. P. Prathosh. "Unsupervised domain adaptation for semantic segmentation of NIR images through generative latent search." In ECCV 2020

Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Kurt Keutzer, Trevor Darrell, and Han Zhao. Learning invariant representations and risks for semi-supervised domain adaptation. CVPR 2021.

Ambekar, Sameer, Zehao Xiao, Jiayi Shen, Xiantong Zhen, and Cees GM Snoek. "Learning Variational Neighbor Labels for Test-Time Domain Generalization.", CoLLAs 2024

Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, and Patrick Pérez. Advent: Adversarial entropy minimization for domain adaptation in semantic segmentation. CVPR 2019.

Dequan Wang, Evan Shelhamer, Shaoteng Liu, Bruno Olshausen, and Trevor Darrell. Tent: Fully test-time adaptation by entropy minimization. ICLR 2021

Abhimanyu Dubey, Vignesh Ramanathan, Alex Pentland, and Dhruv Mahajan. Adaptive methods for real-world domain generalization. CVPR 2021

Muandet, Krikamol, David Balduzzi, and Bernhard Schölkopf. "Domain generalization via invariant feature representation." ICML 2013.

Xiao, Zehao, Xiantong Zhen, Ling Shao, and Cees GM Snoek. "Learning to generalize across domains on single test samples." ICLR 2022.

June 26, 2026 / Tzu-Yuan Huang

Robotic Decision Making via Diffusion Models

Machine learning for robot decision making In recent years, an increasing amount of research1 has focused on enabling robots to perform diverse tasks in complex environments using machine learning (ML)-based techniques. Major technology companies, such as NVIDIA and Tesla, are also advancing efforts to introduce service robots capable of extensive human interaction in daily life. The ability … Read more

... more

June 3, 2026 / Lukas Gosch

A Beginner’s Guide to Certifiable Robustness

Image Credit: Generated by Google Gemini 3. Machine Learning (ML) models will be a cornerstone of our technical progress in this and the following decades. Especially since the launch of ChatGPT in November 2022, the transformative power of these models across a wide range of areas in our society has become clear to the wider public. What … Read more

... more

April 15, 2026 / Balian He

Responsible Textual Generative Models (Part I): Generating Truthful Content

Figure 1: Multimodal illustration (MLLM). Subfigure (a): intrinsic hallucination—the output is inconsistent with the input (no fence appears in the image). Subfigure (b): extrinsic hallucination—the output adds a geographic claim that conflicts with a widely accepted fact (the species is associated with North America, not the United Kingdom). Source: Adapted from Ji et al. (2023). 2 The … Read more

... more

March 17, 2026 / Aswathi

Random Convolutions: A Simple Way to Boost Generalization

Figure 1: Source: [2] AI and deep learning have recently transformed medical imaging by enabling automated analysis of complex radiological data, such as detecting lesions, segmenting organs, and predicting disease progression. These methods learn visual representations directly from large datasets and have achieved impressive results across many clinical tasks. In standard computer vision tasks, deep learning models … Read more

... more

January 30, 2026 / Ahmed Abdelrahman

Neuromorphic Computing: A Brain-inspired Approach to Robot Intelligence

Figure 1: Depiction of a humanoid robot and brain-inspired neural networks. (Note: The Craiyon tool was used to generate the image of the robot.) Looking to the Brain for Next-Gen AI With the explosive advent of artificial intelligence (AI), from impressively articulate conversational agents to increasingly autonomous robots of various embodiments, it is easy to forget the … Read more

... more

November 5, 2025 / Shengqiang Zhang

Introduction to Embodied Instruction Following

Figure: A home robot helps to place the book following human instruction. The figure is generated by Gemini 2.5 Flash AI model. Imagine asking your home robot: ”Hey, robot – can you go check if there is a blue book on the table? If so, please place it on the shelf.” This isn’t just a scene from … Read more

... more

July 7, 2025 / Maximilian Fleissner

From Unlucky Strikers to Statistical Learning Theory

Figure: A footbal fan excited for his team. Image generated by an AI model. Suppose a new striker joins your favorite Bundesliga team. Fans are excited, the club has paid an enormous transfer fee, and expectations are huge. The new season starts. And then, he only scores a single goal in his first ten games. As a … Read more

... more

June 23, 2025 / Unai Fischer Abaigar

Performative Prediction

Performative Prediction Machine learning systems are increasingly used to support decision-making processes (Fischer-Abaigar et al., 2024). Yet, these systems do not merely reflect the world—they also reshape it. Once deployed, predictions can influence behaviors, alter policies, and redirect resources, creating feedback loops that change the very data-generating processes they aim to model. Consider a traffic routing application … Read more

... more

October 10, 2024 / Moritz Knolle

What even is differential privacy?

Machine learning (ML) technologies are set to revolutionize various fields and sectors. ML models can learn from text, image and various other forms of data by automatically detecting patterns. Their successful application, however, relies heavily on access to extremely large datasets (some state-of-the-art language models are trained on the whole internet). For many interesting applications, such datasets … Read more

... more

July 23, 2024 / Lisa Wimmer

A gentle introduction to uncertainty quantification

Success stories about artificial intelligence (AI) focus on its remarkable predictive power. Take, for instance, your smartphone’s ability to recognize your face on photos and collect them into a “Selfies” folder ready to supply snaps for social media. When it comes to more safety-critical tasks, like using facial recognition for security at a high-stakes research lab, simple … Read more

... more

July 18, 2024 / Blog Editorial Team

Welcome to the relAI Blog

Welcome to the relAI blog of the Konrad Zuse School of Excellence in Reliable AI (relAI). This blog will serve as a platform to share cutting-edge research and developments from our school, highlighting the significant strides we are making towards making AI systems safer, more trustworthy, and privacy-preserving. The vision of the relAI program is to train … Read more

... more

Author: Sameer Ambekar

Datasets and applications

Related topics

References

RELATED

Robotic Decision Making via Diffusion Models

A Beginner’s Guide to Certifiable Robustness

Responsible Textual Generative Models (Part I): Generating Truthful Content

Random Convolutions: A Simple Way to Boost Generalization

Neuromorphic Computing: A Brain-inspired Approach to Robot Intelligence

Introduction to Embodied Instruction Following

From Unlucky Strikers to Statistical Learning Theory

Performative Prediction

What even is differential privacy?

A gentle introduction to uncertainty quantification

Welcome to the relAI Blog