Setup

The following guide shows how to setup an environment in order to train or evaluate Deep-Learning models using the GOOSE Datasets.

Installation

In this guide, we use the SuperGradients [1] to conveniently load and evaluate different models. You can use either conda or venv to create a new virtual environment with the following dependencies:

name: sg_env
channels:
  - pytorch
  - nvidia
dependencies:
  - python==3.9
  - pytorch==1.13.1
  - torchvision==0.14.1
  - pytorch-cuda==11.7
  - pip
  - pip:
    - super_gradients==3.2.0
    - matplotlib
    - pillow
    - numpy
    - torchmetrics==0.8.0

Save the requirements as file and create the environment (e.g. using conda env create -f env.yaml). Then activate the environment (e.g. using conda activate sg_env) and you are good to go.

Download Dataset

License

The GOOSE Datset is published under the CC BY-SA 4.0 License.

You can directly download the preconfigured and zipped raw image, raw point cloud, and ground truth files:

GOOSE 2D Images

GOOSE 3D Point Clouds

GOOSE-Ex 2D Images

GOOSE-Ex 3D Point Clouds

Read our page on the Dataset Structure to understand how the annotated datasets are organized.

Pretrained Weights

We provide pretrained weights for some network architectures both for 2D and 3D semantic segmentation.

2D Image Segmentation

We mainly evaluate our dataset on ppliteseg and ddrnet networks, which both have a good tradeoff between realtime capabilities and quality.

  • PP-LiteSeg [2] uses an encoder-decoder structure with a lightweight attention-based fusion model.
  • DDRNet [3] uses a typical two-stream architecture which fuses both branches at different depths within the network.
Model Model Name [Download] Dataset Resolution # Classes mIoU / %
PP-LiteSeg ppliteseg_category_512 GOOSE-2D 512x512 12 67.21
PP-LiteSeg ppliteseg_class_512 GOOSE-2D 512x512 64 45.09
DDRNet ddrnet_category_512 GOOSE-2D 512x512 12 70.23
DDRNet ddrnet_class_512 GOOSE-2D 512x512 64 46.53

References

  1. Aharon et al. "Super-Gradients" https://zenodo.org/record/7789328 (2021)
  2. Peng et al. "PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model" https://arxiv.org/abs/2204.02681 (2022)
  3. Pan et al. "Deep Dual-Resolution Networks for Real-Time and Accurate Semantic Segmentation of Traffic Scenes" in IEEE Trans. Intell. Transp. Syst. (2022)