Overview

The Visual Bias Mitigator is an open-source framework designed to empower researchers in the field of bias mitigation in computer vision. This codebase provides a comprehensive environment where users can easily implement, run, and evaluate existing visual bias mitigation methods.

With the increasing awareness of bias in AI systems, it is crucial for researchers to have access to robust tools that facilitate the exploration and development of mitigation approaches. The Visual Bias Mitigator (VB-Mitigator) serves this purpose by offering:

🚀 Implemented Methods: A collection of established visual bias mitigation methods that can be directly utilized, allowing researchers to replicate and understand their functionality.
🔧 Extensibility: Researchers can exploit this code-base to develop custom bias mitigation approaches tailored to their specific needs. The framework is designed with flexibility in mind, enabling easy integration of new approaches.
📊 Performance Comparison: The framework facilitates the performance comparison between custom methods and state-of-the-art.

The aim of this repository is to facilitate research in the domain of visual bias mitigation. By providing a comprehensive codebase that allows researchers to easily implement and build upon existing methodologies, we encourage the development of new approaches for addressing biases in computer vision tasks.

Quickstart

Get started with Visual Bias Mitigator quickly:

Clone the git repository:

git clone https://github.com/gsarridis/vb-mitigator.git

Create a virtual environment using either pip or conda and install the required packages:

# create a virtual conda environmnet
conda create -n vb-mitigator python=3.11

# activate the environment
conda activate vb-mitigator

# install the required packages
pip install -r requirements.txt

Run a sample script:

# run BAdd method on UTKFace dataset
bash ./scripts/utkface/badd/badd.sh

Check logs for results and metrics. The output is stored in the outputs/utkface_baselines/badd directory.

Output Structure:

├── outputs
│   ├── utkface_baselines
│   │   ├── badd
│   │   │   ├── logs.csv
│   │   │   ├── out.log
│   │   │   ├── best.pth
│   │   │   ├── latest.pth
│   │   │   └── train.events

Modules

Methods

Currently, Visual Bias Mitigator offers the following methodologies for mitigating biases in deep learning models:

Method	Full Name
ERM	Empirical Risk Minimization
GroupDro	Group Distributionally Robust Optimization
Debian	Debiasing Alternate Networks
DI	Domain Independent
EnD	Entangling and Disentangling
LfF	Learning from Failure
SD	Spectral Decouple
BB	Bias Balance
FLAC	Fairness Aware Representation Learning
BAdd	Bias Addition
Mavias	Mitigate any Visual Bias

To add a new method, follow the instructions in Add New Method.

Datasets

Currently, Visual Bias Mitigator supports the following datasets:

Name	Biases	Target
Biased UTKFace	Race or Age	Gender
Biased CelebA	Gender	Blonde or Makeup
Waterbirds	Background (Water or Land)	Bird Species
Biased MNIST	Background Colour	Digits
FB-Biased MNIST	Background and Foreground Colours	Digits
ImageNet9	Unknown	Dog, Bird, Vehicle, Reptile, Carnivore, Insect, Instrument, Primate, Fish

Note that all datasets are automatically downloaded and preprocessed by the framework. The only extra requirement is for ImageNet9, which requires manually download the ImageNet dataset beforehand.

To add a custom dataset, follow the instructions in Add New Dataset.

Evaluation Metrics

Metrics used for fairness evaluation include:

Metric	Description
Accuracy	Measures the overall model performance. This metric can be used for balanced test sets.
Unb/BA/BC Accuracy	Evaluates the model’s performance on unbiased, bias-aligned, and bias-conflicting samples.
WG and Ovr Accuracy	Evaluates the least-performing subgroup and overall model performance.

To add a new metric, follow the instructions in Add New Metric.

Configuration

The configuration for the Visual Bias Mitigator is managed through a cfg.py file utilizing the YACS (Yet Another Configuration System) library. This design allows for easy customization of various parameters related to experiments, models, datasets, and mitigators.

Key Components of the Configuration

Experiment Settings: Specify project details, experiment names, GPU settings, and random seed for reproducibility.
Model Configuration: Define the type of model to be used (e.g., ResNet) and whether to use pretrained weights.
Solver Parameters: Set up hyperparameters for training, including batch size, learning rate, weight decay, momentum, and training duration.
Logging Options: Configure logging details such as the frequency of TensorBoard updates, checkpoint saving intervals, and logging directory.
Dataset Specifications: Define the dataset type and associated biases, along with any dataset-specific parameters, such as image size and root directories for data.
Mitigator Settings: Configure the bias mitigation approach, specifying the type of mitigator and any additional parameters relevant to specific mitigation methods.

Example Usage

The configuration can be accessed and displayed using the show_cfg function, which logs the current configuration parameters to help users understand their settings. Here’s a brief example of how this is done:

from yacs.config import CfgNode as CN
from tools.utils import log_msg

def show_cfg(cfg, logger):
    dump_cfg = CN()
    dump_cfg.EXPERIMENT = cfg.EXPERIMENT
    dump_cfg.MODEL = cfg.MODEL
    dump_cfg.DATASET = cfg.DATASET
    dump_cfg.MITIGATOR = cfg.MITIGATOR
    dump_cfg.SOLVER = cfg.SOLVER
    dump_cfg.LOG = cfg.LOG
    if cfg.MITIGATOR.TYPE in cfg:
        dump_cfg.update({cfg.MITIGATOR.TYPE: cfg.get(cfg.MITIGATOR.TYPE)})
    log_msg("CONFIG:\n{}".format(dump_cfg.dump()), "INFO", logger)


# CFG example

CFG = CN()
CFG.EXPERIMENT = CN()
CFG.EXPERIMENT.PROJECT = "biased_mnist"
CFG.EXPERIMENT.NAME = "dev"
CFG.EXPERIMENT.TAG = "flac"
CFG.EXPERIMENT.GPU = "cuda:0"
CFG.EXPERIMENT.SEED = 1

Logs

The logging system is designed to store and organize all experiment-related outputs in a structured and accessible format. This setup facilitates tracking, visualization, and reproducibility of results. Each experiment’s outputs are stored under the outputs directory with the following structure:

├── outputs
│   ├── <dataset>_baselines
│   │   ├── <method>>
│   │   │   ├── logs.csv
│   │   │   ├── out.log
│   │   │   ├── best.pth
│   │   │   ├── latest.pth
│   │   │   └── train.events

Description of Files

`logs.csv` Contains a tabular record of the training process, including key metrics such as accuracy, loss, and validation performance for each epoch. This format is ideal for further analysis and visualization with external tools.
`out.log` A detailed log file capturing all information about the experiment run, including configuration settings, training progress, validation results, and any warnings or errors encountered.
`best.pth` Stores the model checkpoint with the best performance during training, based on the defined evaluation metric (e.g., validation accuracy).
`latest.pth` Stores the most recent model checkpoint, allowing experiments to resume from the latest training state.
`train.events` TensorBoard event file, storing detailed training and validation statistics for visualization through TensorBoard. This includes metrics such as loss curves and performance trends.

Benefits

Readable Logs: The out.log and logs.csv files are human-readable, with the latter providing a structured tabular format to facilitate data processing and visualization.
Visualization: The train.events file integrates seamlessly with TensorBoard for real-time monitoring and analysis.
Reproducibility: By saving both best.pth and latest.pth, users can easily reproduce results or continue training.
Organized Structure: The directory hierarchy organizes outputs by experiment and method, ensuring clarity and ease of navigation.

Note that, VB-Mitigator also supports Wandb logging for real-time monitoring and visualization of experiments. To enable Wandb logging, set LOG.WANDB to True in the configuration file.

Integrate New Components

Method

To integrate a new method into the framework, you can use the BaseTrainer class provided in mitigators/base_trainer.py. This base class defines abstract functions for every step in the training pipeline, allowing you to override and customize the steps for your specific method. Below are the steps to add a new method:

Step 1: Create a New Method File

Navigate to the mitigators directory.
Create a new Python file for your method (e.g., new_method.py).

Inherit the BaseTrainer class and override the functions where your new method intervenes. For example:

from .base_trainer import BaseTrainer

class NewMethod(BaseTrainer):

    def _train_iter(self, batch):
        # Override to define a custom training iteration
        pass

Step 2: Add the Method to `mitigators/__init__.py`

Open mitigators/__init__.py.

Register the new method in the dictionary that maps method names to their respective classes. For example:

from .new_method import NewMethod

method_to_trainer = {
    "erm": ERM,
    "new_method": NewMethod,  # Add your method here
}

Step 3: Add Default Hyperparameters to `./configs/cfg.py`

Open cfg.py.

Add a new configuration node for your method under CFG.MITIGATOR. For example:

CFG.MITIGATOR.NEW_METHOD = CN()
CFG.MITIGATOR.NEW_METHOD.ALPHA = 0.1  # Example hyperparameter
CFG.MITIGATOR.NEW_METHOD.BETA = 0.5  # Example hyperparameter

Step 4: Create a Config File for the Experiment

Navigate to the configs/<dataset> directory.
Create a new directory for your method (e.g., new_method).

Create a YAML file for the experiment (e.g., dev.yaml). Define the configuration settings specific to your experiment. For example:

EXPERIMENT:
  NAME: "new_method"
  TAG: "dev"
  PROJECT: "biased_mnist_baselines"
DATASET:
  TYPE: "biased_mnist"
  BIASES: ["color"]
MITIGATOR:
  TYPE: "new_method"
SOLVER:
  BATCH_SIZE: 64
  EPOCHS: 80
  LR: 0.001
  TYPE: "Adam"
  SCHEDULER:
    LR_DECAY_STAGES: [13, 56]
    LR_DECAY_RATE: 0.1
MODEL:
  TYPE: "simple_conv"
METRIC: "acc"

Step 5: Create a Script for the Experiment

Navigate to the scripts/<dataset>/ directory.
Create a new directory for your method (e.g., new_method).
Inside the directory, create a shell script (e.g., exp.sh) to run the experiment. For example:
```
#!/bin/bash
python tools/train.py --cfg configs/biased_mnist/new_method/dev.yaml
```

Summary

By following these steps, you will:

Define your new method in a modular and extensible way.
Register the method to make it available for use.
Configure and script an experiment for testing your method.

Dataset

The framework provides flexibility for adding new datasets. Follow these steps to integrate a new dataset into the framework:

Create the Dataset Class - Implement the dataset class in datasets/<new_dataset>.py. - In the __getitem__ method, return a dictionary in the following format:

def __getitem__(self, index):
    img, target, bias = (
        self.data[index],
        int(self.targets[index]),
        int(self.biased_targets[index]),
    )
    img = Image.fromarray(img.astype(np.uint8), mode="RGB")

    if self.transform is not None:
        img = self.transform(img)

    if self.target_transform is not None:
        target = self.target_transform(target)

    return {"inputs": img, "targets": target, "<bias_name>"": bias, "index": index}

Update the Dataset Builder - Modify datasets/builder.py to include your new dataset. - Add the logic to create dataloaders for training, validation, and testing. For Biased MNIST example:

train_loader = get_color_mnist(
    cfg.DATASET.BIASED_MNIST.ROOT,
    batch_size=cfg.SOLVER.BATCH_SIZE,
    data_label_correlation=cfg.DATASET.BIASED_MNIST.CORR,
    n_confusing_labels=9,
    split="train",
    seed=cfg.EXPERIMENT.SEED,
    aug=False,
)

val_loader = get_color_mnist(
    cfg.DATASET.BIASED_MNIST.ROOT,
    batch_size=256,
    data_label_correlation=0.1,
    n_confusing_labels=9,
    split="train_val",
    seed=cfg.EXPERIMENT.SEED,
    aug=False,
)

test_loader = get_color_mnist(
    cfg.DATASET.BIASED_MNIST.ROOT,
    batch_size=256,
    data_label_correlation=0.1,
    n_confusing_labels=9,
    split="valid",
    seed=cfg.EXPERIMENT.SEED,
    aug=False,
)

Construct a dataset dictionary with the following structure:

dataset = {}
dataset["num_class"] = 10
dataset["num_groups"] = 10 * 10
dataset["biases"] = ["background"]
dataset["dataloaders"] = {
    "train": train_loader,
    "val": val_loader,
    "test": test_loader,
}
dataset["root"] = cfg.DATASET.BIASED_MNIST.ROOT
dataset["target2name"] = {
    0: "number zero",
    1: "number one",
    2: "number two",
    3: "number three",
    4: "number four",
    5: "number five",
    6: "number six",
    7: "number seven",
    8: "number eight",
    9: "number nine"
}

Update Configuration - Add any default hyperparameters specific to the new dataset to cfg.py under the CFG.DATASET.<NEW_DATASET> section. - For example:

CFG.DATASET.NEW_DATASET = CN()
CFG.DATASET.NEW_DATASET.ROOT = "./data/new_dataset"
CFG.DATASET.NEW_DATASET.IMAGE_SIZE = 224
CFG.DATASET.NEW_DATASET.BIAS = "bias_type"
CFG.DATASET.NEW_DATASET.TARGET = "target_attribute"

Create Experiment Configuration - Add a new configuration file in the configs directory specific to the new dataset.
Add Experiment Script - Create a script for running experiments with the new dataset in ./scripts/<new_dataset>/<method>/exp.sh.

By following these steps, you can seamlessly integrate new datasets into the framework while maintaining compatibility with its existing structure.

Metric

To integrate a new metric into the framework, follow these steps:

Define the Metric Function - Create a function for the new metric at tools/metrics/<new_metric.py>. - The function should compute the metric and return a dictionary containing the metric’s value(s). Example:
def new_metric(data_dict): predictions = data_dict["predictions"] targets = data_dict["targets"] metric_value = some_computation(predictions, targets) # you can return multiple values as a dictionary return {"new_metric_name": metric_value}
- Additionally, define a dictionary that specifies optimization preferences (e.g., whether the metric is “high” or “low” for better performance (e.g. performance or error based metrics), and the value wrt which the best model is defined). For example:
```
new_metric_dict = {"best": "high", "performance": "new_metric_name"}
```

Register the Metric - Add the metric dictionary and function to the metrics/__init__.py file by including them in metrics_dicts and get_performance:

from .new_metric import new_metric_dict, new_metric

metrics_dicts = {
    "acc": acc_dict,
    "unb_bc_ba": unb_bc_ba_dict,
    "wg_ovr": wg_ovr_dict,
    # Register your new metric dictionary
    "new_metric": new_metric_dict,
}

get_performance = {
    "acc": acc,
    "unb_bc_ba": unb_bc_ba,
    "wg_ovr": wg_ovr,
    # Register your new metric function
    "new_metric": new_metric,
}

Select the metrics in cfg - If the new metric requires specific hyperparameters, add them to the configuration file (cfg.py) under the appropriate section. For example:
METRIC: "new_metric"

By following these steps, you can extend the framework with new metrics to suit your evaluation needs.

Contributing

We welcome contributions! The ultimate goal of the Visual Bias Mitigator is to provide a comprehensive environment for bias mitigation research in computer vision and thus requires a community effort to intergrate new methods, datasets, and metrics.

To contribute:

Fork the repository.
Integrate your new feature.
Submit a pull request with a detailed description.

FAQ

Q: Can I use this for non-CV tasks?

A: Currently, Visual Bias Mitigator is optimized for computer vision but can be adapted for other tasks with custom methods.

Changelog

Version 1.0.0:

Initial release with support for 10 competitive bias mitigation approaches and 6 benchmarks.
Extensive documentation and tutorials for easy integration and usage.
Support for custom datasets, metrics, and methods.

Acknowledgments

This research was supported by the EU Horizon Europe projects MAMMOth (Grant Agreement 101070285) and ELIAS (Grant Agreement 101120237).