GAC3D: Improving monocular 3D object detection with ground-guide model and adaptive convolution

Introduction

This work proposes a novel approach for 3D object detection by employing a ground plane model that utilizes geometric constraints, named GAC3D. This approach improves the results of the deep-based detector. Furthermore, we introduce a depth adaptive convolution to replace the traditional 2D convolution to deal with the divergent context of the image's feature, leading to a significant improvement in both training convergence and testing accuracy. We demonstrate our approach on the KITTI 3D Object Detection benchmark, which outperforms existing monocular methods.

The structure of this repo is as follows:

GAC3D
├── kitti_format # for training and evaluating on trainval set
├── kitti_test # for testing on official test set
├── src # source code for implementing the framework
├── trt_src # source code for deploying the framework on NVidia Jetson board
├── readme # readme files

Requirements

Our framework is implemented and tested with Ubuntu 18.04, CUDA 10.2, CuDNN 7.5.1, Python 3.6, Pytorch 1.7, single NVIDIA RTX 2080.

Dataset preparation

Download the official KITTI 3D object detection dataset and organize the downloaded files as follows:

GAC3D
├── kitti_format
│   ├─ data
│   │   ├── kitti
│   │   |   ├── annotations
│   │   │   |   ├── kitti_train.json
│   │   │   |   ├── kitti_val.json
│   │   │   |   ├── kitti_trainval.json
│   │   │   ├── calib /000000.txt .....
│   │   │   ├── image(left[0-7480] right[7481-14961]) /000000.png .....
│   │   │   ├── label /000000.txt .....
|   |   |   ├── train.txt val.txt trainval.txt

Installation

For training and testing on mainframe computer: INSTALL.md
For testing on embedded device (we use NVidia Jetson Xavier NX): INSTALL_JETSON.md

Demo

Please refer to GETTING_STARTED.md to learn more usage about this project.

NVidia Jetson deployment

We deployed and tested our framework on NVidia Jetson XavierNX with Jetpack 4.5.1. Please follow these steps below to run the framework on Jetson board:

Install packages on Jetson: INSTALL_JETSON.md
Build TensorRT src and additional plugins: BUILD_TENSORRT.md
Infer model on Jetson: DEMO_JETSON.md

Acknowledgement

Portions of the code are borrowed from:

CenterNet
RTM3D
DCNv2 (Deformable Convolutions)
PAC (Pixel Adaptive Convolution)
kitti_eval (KITTI dataset evaluation).

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
images		images
readme		readme
src		src
trt_src		trt_src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GAC3D: Improving monocular 3D object detection with ground-guide model and adaptive convolution

Introduction

Requirements

Dataset preparation

Installation

Demo

NVidia Jetson deployment

Acknowledgement

About

Releases

Packages

Languages

ngoductuanlhp/GAC3D

Folders and files

Latest commit

History

Repository files navigation

GAC3D: Improving monocular 3D object detection with ground-guide model and adaptive convolution

Introduction

Requirements

Dataset preparation

Installation

Demo

NVidia Jetson deployment

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages