Efficient and Robust Semantic Mapping for Indoor Environments

This repository contains the code to our paper "Efficient and Robust Semantic Mapping for Indoor Environments" (IEEE Xplore, arXiv).

(Click on the image to open YouTube video)

License and Citations

The source code and the network weights are published under BSD 3-Clause license, see license file for details.

If you use the source code or the network weights, please cite the following paper:

Seichter, D., Langer, P., Wengefeld, T., Lewandowski, B., Höchemer, D., Gross, H.-M. Efficient and Robust Semantic Mapping for Indoor Environments in IEEE International Conference on Robotics and Automation (ICRA), pp. 9221-9227, 2022.

@inproceedings{semanticndtmapping2022icra,
  title={Efficient and Robust Semantic Mapping for Indoor Environments},
  author={Seichter, Daniel and Langer, Patrick and Wengefeld, Tim and Lewandowski, Benjamin and H{\"o}chemer, Dominik and Gross, Horst-Michael},
  booktitle={IEEE International Conference on Robotics and Automation (ICRA)},
  year={2022},
  volume={},
  number={},
  pages={9221-9227}
}

@article{semanticndtmapping2022arXiv,
  title={Efficient and Robust Semantic Mapping for Indoor Environments},
  author={Seichter, Daniel and Langer, Patrick and Wengefeld, Tim and Lewandowski, Benjamin and H{\"o}chemer, Dominik and Gross, Horst-Michael},
  journal={arXiv preprint arXiv:2203.05836},
  year={2022}
}

Note that the preprint was accepted to be published in IEEE International Conference on Robotics and Automation (ICRA).

Setup

Clone repository:

# do not forget the '--recursive' ;)
git clone --recursive https://github.com/TUI-NICR/semantic-mapping.git

cd /path/to/this/repository

Set up anaconda environment including all dependencies:

# option 1: create conda environment from YAML file
conda env create -f semantic_mapping.yaml
conda activate semantic_mapping

# option 2: create new environment (see last tested versions)
conda create -n semantic_mapping python==3.8.12 anaconda==2021.11
conda activate semantic_mapping
pip install onnx==1.11.0
pip install opencv-python==4.2.0.34
pip install tqdm==4.62.3
# ONNXRuntime with CUDA support
conda install -c conda-forge cudnn==8.2.1.32
pip install onnxruntime-gpu==1.11.0


# finally, install our package for preparing and using the Hypersim dataset
pip install ./lib/nicr-scene-analysis-datasets[with_preparation]

Usage

Prepare the Hypersim dataset:

# download and extract raw dataset (2x ~1.8TB)
HYPERSIM_DOWNLOAD_PATH='./datasets/hypersim_preparation'
wget https://raw.githubusercontent.com/apple/ml-hypersim/6cbaa80207f44a312654e288cf445016c84658a1/code/python/tools/dataset_download_images.py
python dataset_download_images.py --downloads_dir $HYPERSIM_DOWNLOAD_PATH

# prepare dataset (~157.5 GB, extract required data, convert to our format, blacklist some scenes/trajectories)
python -m nicr_scene_analysis_datasets.datasets.hypersim.prepare_dataset \
    ./datasets/hypersim \
    $HYPERSIM_DOWNLOAD_PATH \
    --additional-subsamples 2 5 10 20 \
    --multiprocessing

# just in case you want to delete the downloaded raw data (2x ~1.8TB)
rm -rf $HYPERSIM_DOWNLOAD_PATH

For further details, we refer to the documentation of our nicr-scene-analysis-datasets python package.

Download pretrained model:
We provide the weights of our selected ESANet-R34-NBt1D (enhanced ResNet34-based encoder utilizing the Non-Bottleneck-1D block) trained on the Hypersim dataset. To ease both application and deployment, we removed all dependencies (PyTorch, ...) and provide the weights in ONNX format.

Click here to download the model and extract it to ./trained_models or use:
```
pip install gdown    # last tested: 4.4.0
gdown 1zUxSqq4zdC3yQ4RxiHvTh8CX7-115KUg --output ./trained_models/
tar -xvzf ./trained_models/model_hypersim.tar.gz -C ./trained_models/
```
The model was selected based on the mean intersection over union (mIoU) on the validation split: 0.4591184410660463 at epoch 498. On the test split, the model achieves a mIoU of 0.41168890871760977. Note, similar to other approaches, we only evaluate up to a reasonable maximum distance of 20m from the camera. For more detail, see evaluate.py.

Extract predicted semantic segmentation:

# use default paths (~74.3GB for topk with k=3)
python predict.py \
	--onnx-filepath ./trained_models/model_hypersim.onnx \
	--dataset-path ./datasets/hypersim \
	--dataset-split test \
	--topk 3 \
	--output-path ./datasets/hypersim_predictions

# for more details, see:
python predict.py --help

For the example above, the predicted segmentations are stored at ./datasets/hypersim_predictions/test/. See the semantic_40_topk subfolder for the predicted topK segmentation outputs and semantic_40/ or semantic_40_colored/ for the predicted (colored) top1 labels.

Run your semantic mapping experiments and store the results with the following folder structure:

path/to/results/
└── test
	├── results1
	│   ├── ai_001_010
	│   │   ├── cam_00
	│   │   │   ├── 0000.png
	│   │   │   ├── ...
	├── results2
	│   ├── ai_001_010
	│   │   ├── cam_00
	│   │   │   ├── 0000.png
	│   │   │   ├── ...

You may have a look at ./lib/nicr-scene-analysis-datasets/nicr_scene_analysis_datasets/mira/_hypersim_reader.py for a starting point. This class shows, how the Hypersim dataset is processed in our pipelines.

Run evaluation:

# use default paths
python evaluate.py \
	--dataset-path ./datasets/hypersim \
	--dataset-split test \
	--predictions-path ./datasets/hypersim_predictions
	[--result-paths path/to/results/test/results1 path/to/results/test/results2]

# for more details, see:
python evaluate.py --help

For the predicted segmentation of our ONNX model, you should obtain measures similar to:

miou_gt_masked: 0.41168890871760977
mean_pacc_gt_masked: 0.5683601556433829
invalid_ratio: 0.0
invalid_mean_ratio_gt_masked: 0.0
vwmiou_gt_masked: 0.41168890871760977
vwmean_pacc_gt_masked: 0.5683601556433829

Check the created results.json at the predictions folder for more measures (e.g. ./datasets/hypersim_predictions/test/semantic_40/results.json)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.vscode		.vscode
datasets		datasets
lib		lib
trained_models		trained_models
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
evaluate.py		evaluate.py
predict.py		predict.py
semantic_mapping.yaml		semantic_mapping.yaml
tox.ini		tox.ini
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.vscode

.vscode

datasets

datasets

lib

lib

trained_models

trained_models

.gitignore

.gitignore

.gitmodules

.gitmodules

LICENSE

LICENSE

README.md

README.md

evaluate.py

evaluate.py

predict.py

predict.py

semantic_mapping.yaml

semantic_mapping.yaml

tox.ini

tox.ini

utils.py

utils.py

Repository files navigation

Efficient and Robust Semantic Mapping for Indoor Environments

License and Citations

Setup

Usage

About

Languages

License

TUI-NICR/semantic-mapping

Folders and files

Latest commit

History

Repository files navigation

Efficient and Robust Semantic Mapping for Indoor Environments

License and Citations

Setup

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Languages