More Than YOLO

TensorFlow & Keras & Python

YOLOv3, YOLOv3-tiny, YOLOv4, YOLOv4-tiny

[Unofficial] YOLOv4-tiny, YOLOX

Docker

The Dockerfiles embed every requirement for this project.

GPU dedicated Dockerfile

The GPU dedicated Dockerfile relies on a BASE_IMAGE (default: nvidia/cuda:11.2.2-cudnn8-runtime-ubuntu20.04) which depends on the NVIDIA GPUs and the CUDA driver which is installed on the Host OS.

For further details, please refer to Installing Docker and The Docker Utility Engine for NVIDIA GPUs.

General

The following parameters have to be used at the docker run:

--env=DISPLAY: Exports DISPLAY env variable for X server
--env=QT_X11_NO_MITSHM=1: Prevents QT-based applications from using X’s shared memory, which Docker isolation blocks.
--volume=/tmp/.X11-unix:/tmp/.X11-unix:rw: Mounts the X11 socket

Note: Running xhost +local:docker on the Host OS allows docker to access X server.

Optional parameters:

--volume=${env:HOME}/.gitconfig:/etc/gitconfig: Share Git config
--gpus all: Share the GPUs with docker. Should be used only if you plan to use the GPU dedicated docker image.

0. Please Read Source Code for More Details

Official weight files can be downloaded from https://github.com/AlexeyAB/darknet/releases.

1. Samples

1.1 Data Annotations

The data can be annotated with labelImg.

labelImg.mp4

1.2 Data File

A special data format is used:

path/to/image1 x1,y1,x2,y2,label x1,y1,x2,y2,label
path/to/image2 x1,y1,x2,y2,label
...

Firstly, the labelImg annotated data have to be converted using:

labelImg_to_yolox.mp4

python3 tools/voc_to_yolox/voc_to_yolox_converter.py

1.3 K-Means

kmeans.py can be used to computed the anchor boxes

python3 kmeans.py --dataset_path=example/hole_detector/data/train.txt --image_size=416

Output:

Dataset has 52 annotations
K-Means Result:
5,4 6,6 7,7 7,5 8,7 9,9

1.4 Configure

# yolox/example/hole_detector/cfgs/hole.yaml
yolo:
  type: "yolox" # must be 'yolov3', 'yolov3_tiny', 'yolov4', 'yolov4_tiny' ‘unofficial_yolov4_tiny’ or 'yolox'.
  iou_threshold: 0.5
  score_threshold: 0.45
  max_boxes: 100
  strides: "32,16"
  anchors: "5,4 5,5 6,6 7,5 7,7 9,8" # From kmeans.py
  mask: "3,4,5 0,1,2"
  name_path: "./example/hole_detector/data/hole.names"

train:
  label: "hole_detector"
  anno_path: "./example/hole_detector/data/train.txt"
  image_size: "224,256,288,320,352,384,416" # "416" for single mini batch size, "352,384,416,448,480" for Dynamic mini batch size.

  train_times: 20 # Repeat N times the dataset (using data augmentation)

  batch_size: 4
  init_weight_path: "./ckpts/yolov4-tiny.h5" # From scratch if empty ("")
  save_weight_path: "./ckpts"

  loss_type: "CIoU+FL" # Must be "L2", "DIoU", "GIoU", "CIoU" or something like "L2+FL" for focal loss

  mosaic: true
  label_smoothing: false
  normal_method: true

  ignore_threshold: 0.5

test:
  anno_path: "./example/hole_detector/data/valid.txt"
  image_size: "416"
  batch_size: 1
  init_weight_path: "./example/hole_detector/models/tiny_yolox.h5"

Note1: Loss types are described in iou.py.

Note2: Other examples are available in cfgs.

1.5 Inference

A Simple Script for Images

python3 simple_detector.py --image_folder=example/hole_detector/data/Images/ --model=example/hole_detector/models/tiny_yolox --class_names=example/hole_detector/data/hole.names

simple_detector.mp4

A Script for Video, Device or Image

Only support mp4, avi, device id, rtsp, png, jpg (Based on OpenCV)

python3 detector.py --config=example/hole_detector/cfgs/hole.yaml --media=example/hole_detector/data/Images/frame0007.jpg

2. Train

python3 train.py --config=example/hole_detector/cfgs/hole.yaml

The trained models and weights will be located at yolox/ckpts.

mAP-0.XXXX files represent the full model in Keras format. Those models can be used with simple_detector.py.
mAP-0.XXXX.h5 files contain only the weights.

Note: The trained model can be converted into a frozen graph or a ONNX model thanks to model_converter.py. For further detail, please use python3 model_converter.py --help.

3. Evaluation

python3 eval.py --config=example/hole_detector/cfgs/hole.yaml

Note: In the configuration file (.yaml file), the weights and the annotation which are evaluated here are located in test:

test:
  anno_path: "./example/hole_detector/data/valid.txt"
  image_size: "416"
  batch_size: 1
  init_weight_path: "./ckpts/tmp/hole_detector/202202151131/mAP-0.5998.h5"

4. Experiment

4.1 Speed

i7-9700F+16GB

Model	416x416	512x512	608x608
YOLOv3	219 ms	320 ms	429 ms
YOLOv3-tiny	49 ms	63 ms	78 ms
YOLOv4	344 ms	490 ms	682 ms
YOLOv4-tiny	51 ms	66 ms	83 ms
Unofficial-YOLOv4-tiny	64 ms	86 ms	110 ms
YOLOX	67 ms	83 ms	104 ms

i7-9700F+16GB / RTX 2070S+8G

Model	416x416	512x512	608x608
YOLOv3	59 ms	66 ms	83 ms
YOLOv3-tiny	28 ms	30 ms	33 ms
YOLOv4	73 ms	74 ms	91 ms
YOLOv4-tiny	30 ms	32 ms	35 ms
Unofficial-YOLOv4-tiny	30 ms	31 ms	34 ms
YOLOX	42 ms	45 ms	50 ms

4.2 Logs

Augmentations

Name	Abbr
Standard Method	SM
Dynamic mini batch size	DM
Label Smoothing	LS
Focal Loss	FL
Mosaic	M
Warm-up LR	W
Cosine Annealing LR	CA

Standard Method Package includes Flip left and right, Crop and Zoom(jitter=0.3), Grayscale, Distort, Rotate(angle=7).

YOLOv3-tiny(Pretrained on COCO; Trained on VOC)

SM	DM	LS	FL	M	Loss	AP	AP@50	AP@75
✔					L2	26.6	61.8	17.2
✔	✔				L2	27.3	62.4	17.9
✔	✔	✔			L2	26.7	61.7	17.1
✔	✔				CIoU	30.9	64.2	25.0
✔	✔		✔		CIoU	32.3	65.7	27.6
✔	✔		✔	✔	CIoU

YOLOv3(TODO; Pretrained on COCO; Trained on VOC; only 15 epochs)

SM	DM	LS	FL	M	Loss	AP	AP@50	AP@75
✔	✔		✔		CIoU	46.5	80.0	49.0
✔	✔		✔	✔	CIoU

YOLOv4-tiny(TODO; Pretrained on COCO; Trained on VOC)

SM	DM	LS	FL	M	Loss	AP	AP@50	AP@75
✔	✔		✔		CIoU	35.1	70.2	30.0
✔	✔		✔	✔	CIoU

YOLOv4(TODO; Pretrained on COCO; Trained on VOC)

SM	DM	LS	FL	M	Loss	AP	AP@50	AP@75
✔	✔		✔		CIoU
✔	✔		✔	✔	CIoU

Unofficial-YOLOv4-tiny(TODO; Pretrained on COCO, part of YOLOv3-tiny weights; Trained on VOC)

SM	DM	LS	FL	M	Loss	AP	AP@50	AP@75
✔	✔		✔		CIoU	35.0	65.7	33.8
✔	✔		✔	✔	CIoU

YOLOX(TODO; Pretrained on COCO, part of YOLOv4-tiny weights; Trained on VOC)

SM	DM	LS	FL	M	Loss	AP	AP@50	AP@75
✔	✔		✔		CIoU	40.6	72.2	40.3
✔	✔		✔	✔	CIoU

How to use a python-based inference on a C++ environnement

./tutorials/cpp_integration is a mini python/c++ project which used a tf/keras trained model in a C++ environnement. Here, we use data from the hole detector example.

Build and run:

cd tutorials/
mkdir -p cpp_integration_build/ && cd cpp_integration_build/
cmake -G Ninja ../cpp_integration/
ninja && ./tutorial_cpp_integration --model ../../example/hole_detector/models/tiny_yolox --img ../../example/hole_detector/data/Images/frame0000.jpg

The first number represents the class id and the second number ([%]) represents the detection score.

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
cfgs		cfgs
core		core
data		data
docker_images		docker_images
example/hole_detector		example/hole_detector
misc		misc
tools		tools
tutorials/cpp_integration		tutorials/cpp_integration
.gitignore		.gitignore
README.md		README.md
detector.py		detector.py
eval.py		eval.py
kmeans.py		kmeans.py
simple_detector.py		simple_detector.py
train.py		train.py

lagadic/yolox

Folders and files

Latest commit

History

Repository files navigation