Install Object Detection In 5 Minutes On Ubuntu 18.10

Published by: Emil // Date 14.5.2018 // Views: 916 // Add to Twitter :: Facebook

Image

Nowadays Image Recognition became a realy easy task, thanks to some Open Source Libraries like YoloV3 or TensorFlow. In five minutes it is possible to install and use such a library, if your project requires it as feature.

I recently tested YoloV2, YoloV3 and TensorFlow on Ubuntu 18.10, both ideal for classifying images and must say, both are working out of the box, without any special installation requirements and are enough fast even without GPU power.


Yolo Installation


For Yolo Installation is required to clone the git repo and to compile it after.



# Install Yolo Package

git clone https://github.com/pjreddie/darknet.git
cd darknet
make

# Download weights files
wget https://pjreddie.com/media/files/yolov3.weights
wget https://pjreddie.com/media/files/yolo.weights
wget https://pjreddie.com/media/files/yolov2.weights
wget https://pjreddie.com/media/files/yolo-tiny.weights


# Run script Using A Pre-Trained Model
./darknet detect cfg/yolov3.cfg yolov3.weights data/dog.jpg -out prediction
./darknet detector test cfg/coco.data cfg/yolov3.cfg yolov3.weights data/dog.jpg

The gist file:
https://gist.github.com/maranemil/0886b01126130fcd3a41870e9cd10e9e



Tensorflow Installation


For Tensorflow, before starting the clone part, be sure that you have php or pip3 installed.




# Install Dependency Packages

sudo apt install python3-pip -y
pip3 install tensorflow
pip3 install numpy
pip3 install pandas

# Clone Models Repo
git clone https://github.com/tensorflow/models

# Run script
cd models/tutorials/image/imagenet


The gist file:
https://gist.github.com/maranemil/8eccb0d85b962e6e3e0f7f0f078f3611


Conclusion:



Performance is strong related with the size of input. If you use big pictures than it will take longer for the detection process. My recommnedation is to batch all pictures to a low resolution, something like 320x240 or something similar, if the amount of pictures is really huge. For a 1280x720 image it would take 15 seconds with Yolo2 to detect objects on Intel i5 460M Processor with 4 threads. Probably on GPU works faster.

The pre-trained models are good for general purpose, Tensorflow models are better than Yolo models, but for fun you can use both of them and combine the results of both in a word matrix. If you need somethings special to detect, than you have to train your own model for that.




Resources:

TensorFlow (C++, Python)
https://www.tensorflow.org/tutorials/image_recognition
https://github.com/tensorflow
https://github.com/tensorflow/models


YoloV3 (C and CUDA)
https://pjreddie.com/darknet/yolo/
https://github.com/pjreddie/darknet/