Giter Site home page Giter Site logo

honsa / object-detection-anki-overdrive-cars Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ibm/object-detection-anki-overdrive-cars

0.0 1.0 0.0 60.54 MB

TensorFlow Object Detection for Anki Overdrive Cars

License: Apache License 2.0

Dockerfile 0.09% Python 1.72% Swift 3.11% Ruby 0.03% Shell 0.11% Objective-C 0.14% Objective-C++ 2.35% C++ 0.22% Jupyter Notebook 92.23%

object-detection-anki-overdrive-cars's Introduction

TensorFlow Object Detection for Anki Overdrive Cars

The object-detection-anki-overdrive-cars project includes a trained deep learning model to recognize items on Anki Overdrive tracks and it includes documentation how to train TensorFlow Object Detection models:

  • With an iOS app cars and phones can be detected on Anki Overdrive tracks. When an obstacle (phone) is detected, the cars stop. This project contains a trained TensorFlow Object Detection model based on MobileNet and instructions how to run it in iOS apps and Jupyter notebooks.

  • The project also contains documentation how to train models to recognize other objects via TensorFlow Object Detection. To make the setup of the development environment as simple as possible, Docker containers are provided.

This picture shows the iPad app which recognizes the two cars and the phone:

alt text

Check out the documentation folder for more pictures and screenshots.

Setup of the iOS App with the trained Model

In order to run the iOS app, a machine running MacOS is needed to compile TensorFlow 1.9.0:

$ brew install automake libtool
$ git clone https://github.com/tensorflow/tensorflow
$ cd tensorflow
$ git checkout v1.9.0
$ export ANDROID_TYPES="-D__ANDROID_TYPES_FULL__"
$ tensorflow/contrib/makefile/build_all_ios.sh
$ git clone https://github.com/nheidloff/object-detection-anki-overdrive-cars.git

I ran into compilation errors. Check out the workaround which worked for me.

Open tensorflow.xconfig in a text editor and point to the folder where you cloned the TensorFlow repo, for example: TENSORFLOW_ROOT=/Users/nheidloff/Development/tensorflow

To install the dependencies, invoke these commands from the 'ios' directory:

$ pod clean
$ pod install

Open tensorflowiOS.xcworkspace in Xcode, sign the app, connect an iOS device and launch the app.

The app recognizes now the red car, the blue car and phones.

Training of your own Model

Follow the next steps to train models with your own objects.

1) Development Environment Setup

Invoke the following commands to download all necessary files and to set up a local development environment:

$ git clone https://github.com/nheidloff/object-detection-anki-overdrive-cars.git
$ cd object-detection-anki-overdrive-cars
$ my_project_dir=$(pwd)
$ export PROJECT_DIR=$my_project_dir
$ docker build -t tensorflow-od .
$ cd $PROJECT_DIR/data
$ wget http://download.tensorflow.org/models/object_detection/ssd_mobilenet_v2_coco_2018_03_29.tar.gz
$ tar xvzf ssd_mobilenet_v2_coco_2018_03_29.tar.gz
$ cp -R ${PROJECT_DIR}/data ${PROJECT_DIR}/volume/data

2) Labelling of Images and Creation of TFRecords

Take pictures from the objects you want to recognize and replace the pictures in the volume/data/images directory.

Use labelImg to create annotations. Store the annotation files in the volume/data/annotations directory.

Define your objects in volume/data/label_map.pbtxt. See the TensorFlow documentation for more details.

From what I've read these are some best practices how to create the training data:

  • Take/get at least 50 pictures per object.
  • Use a rather small resolution, for example 640 x 480.
  • Use different sizes of your objects and different angles.
  • Use pictures that have multiple objects in them.
  • When marking the objects with labelImg, put the rectangles as closely as possible around the objects.
  • In order to use the following script, use the JPG format.

Creation of TFRecords

TensorFlow Object Detection expects the input data in the TFRecord format. The following script generates two files (train.record and test.record) with a 80% / 20% split of the data.

Before running the script define your objects in create_tfrecord.py (line 33ff).

Start the Docker container:

$ docker run -v $PROJECT_DIR/volume:/tensorflow/models/research/volume -it --rm tensorflow-od bash

In the Docker container invoke these commands:

$ cd volume
$ python create_tfrecord.py
$ exit

3) Training of the Model

For testing purposes or if you have a machine with TensorFlow GPU support, you can train the models locally.

I've used Kubernetes on the IBM Cloud to train my model since it took multiple hours. You can either use the raw Kubernetes or Fabric for Deep Learning (FfDL) which is an open source operating system "fabric" for Deep Learning.

3a) Local Training of the Model

Start the Docker container:

$ docker run -v $PROJECT_DIR/volume:/tensorflow/models/research/volume -it --rm tensorflow-od bash

In the Docker container invoke these commands:

$ cd volume
$ python model_main.py --model_dir=./training --pipeline_config_path=ssd_mobilenet_v2_coco.config --num_train_steps=100  --alsologtostderr
$ exit

3b) Training of the Model on the IBM Cloud

Get a free IBM Cloud lite account (no time restriction, no credit card required).

Install the IBM Cloud CLI.

Create a free/lite Kubernetes cluster.

Install the Kubernetes CLI.

Follow the instructions in the IBM Cloud dashboard to access your cluster from a local terminal.

Build a Docker images. Replace 'nheidloff' with your Dockerhub account name.

$ cd $PROJECT_DIR
$ docker build --file DockerfileCloud -t nheidloff/train-od .
$ docker push nheidloff/train-od

Open the Kubernetes Dashboard and create a new application pointing to 'nheidloff/train-od' (see screenshot).

In the Kubernetes Dashboard open a terminal and invoke these commands (see screenshot):

$ cd volume
$ python model_main.py --model_dir=./models/train --pipeline_config_path=ssd_mobilenet_v2_coco.config --num_train_steps=18000 --alsologtostderr

To copy the files to a local directory, run these commands from a local terminal. Replace 'train-56cfd5b9f-8x6q4' with your pod name.

$ cd $PROJECT_DIR/volume/training
$ kubectl get pods
$ kubectl exec train-56cfd5b9f-8x6q4 -- ls /tensorflow/models/research/volume/models/train
$ kubectl cp default/train-56cfd5b9f-8x6q4:/tensorflow/models/research/volume/models/train .

3c) Training of the Model on the IBM Cloud with FfDL

Get a IBM Cloud account.

Install the IBM Cloud CLI.

Create a Kubernetes cluster.

Install the Kubernetes CLI.

Follow the instructions in the IBM Cloud dashboard to access your cluster from a local terminal.

Install FfDL. You need to follow the instructions in the Developer Guide at this point, since custom Docker images need to be enabled in the source code first.

Create an instance of the Cloud Object Storage service and create HMAC credentials by following these instructions. Make sure to use 'Writer' or 'Manager' access and note the aws_access_key_id and aws_secret_access_key for a later step.

Install and configure the AWS CLI by following these instructions.

Build a Docker images. Replace 'nheidloff' with your Dockerhub account name:

$ cd $PROJECT_DIR
$ docker build --file DockerfileCloud -t nheidloff/train-od .
$ docker push nheidloff/train-od

Create two buckets (use unique names):

$ aws --endpoint-url=http://s3-api.dal-us-geo.objectstorage.softlayer.net --profile ibm_cos s3 mb s3://nh-od-input
$ aws --endpoint-url=http://s3-api.dal-us-geo.objectstorage.softlayer.net --profile ibm_cos s3 mb s3://nh-od-output

Define your Object Storage credentials, bucket names and Docker image name in ffdl/manifest.yml.

At the end of the FfDL installation a URL is provided to open the FfDL web UI. Open this URL in browser. It looks similar to "http://169.48.99.155:30312/#/login?endpoint=169.48.99.154:32293&username=test-user".

Upload ffdl/manifest.yml and ffdl/model.zip and start the training.

To copy the output files to a local directory, run these commands from a local terminal. Replace 'training-AmbQ5RAmR' with your training id.

$ cd $PROJECT_DIR/volume
$ mkdir ffdl-output
$ cd ffdl-output
$ aws --endpoint-url=http://s3-api.dal-us-geo.objectstorage.softlayer.net --profile ibm_cos s3 sync s3://nh-od-output .
$ cp -R training-AmbQ5RAmR/training $PROJECT_DIR/volume/training

4) Save the Model

The training generates checkpoint files in the 'volume/train' directory. These files need to be converted into something that is called a frozen graph file. That file is used later in the iOS app and the Python notebook.

Start the Docker container:

$ docker run -v $PROJECT_DIR/volume:/tensorflow/models/research/volume -it --rm tensorflow-od bash

In the Docker container invoke the following commands. Replace '100' with the number of your training runs (your last checkpoint):

$ cd volume
$ python export_inference_graph.py \
    --input_type=image_tensor \
    --pipeline_config_path=ssd_mobilenet_v2_coco.config \
    --trained_checkpoint_prefix=./training/model.ckpt-100 \
    --output_directory=frozen-graph/
$ exit

5) Testing of the Model

Before using the model in the iOS app, you can test it in a Python notebook. You can also view the training results via Tensorboard.

Notebook

Add two test images to volume/testing/images and name them 'image1.jpg' and 'image2.jpg'.

Invoke the following commands:

$ cp $PROJECT_DIR/volume/frozen-graph/frozen_inference_graph.pb $PROJECT_DIR/volume/testing/
$ cp $PROJECT_DIR/volume/data/label_map.pbtxt $PROJECT_DIR/volume/testing/
$ docker build --file DockerfileNotebook -t tensorflow-od-test .
$ docker run -v $PROJECT_DIR/volume:/tensorflow/models/research/volume -it -p 8888:8888 --rm tensorflow-od-test

Copy the URL with the token from the terminal and open it in a web browser. Replace the host with 'localhost'. Navigate to the folder 'object_detection' and open the notebook object_detection_anki.ipynb. In the notebook run all cells.

Tensorboard

In order to use Tensorboard, you can invoke the following commands:

$ cd $PROJECT_DIR/volume/training
$ docker run -d -p 6006:6006 -v $(pwd):/logs --name my-tf-tensorboard volnet/tensorflow-tensorboard

Open http://localhost:6006 in a browser.

6) Setup of the iOS App

See the steps above how to set up the iOS app with Xcode.

In order to use your own model, copy the following two files:

$ cp $PROJECT_DIR/volume/frozen-graph/frozen_inference_graph.pb $PROJECT_DIR/ios/models/
$ cp $PROJECT_DIR/volume/data/label_map.pbtxt $PROJECT_DIR/ios/models/label_map.txt

Sign the app, connect an iOS device and launch the app.

Setup of the Demo to stop Cars when Obstacles are detected

In order to stop the Anki Overdrive cars when phones are put on the track, you need to set up additional components, especially the Node.js controller and the Watson IoT Platform. In order to do this, follow the instructions from my project node-mqtt-for-anki-overdrive.

Here is a diagram how the demo works:

alt text

You need to register a new device type 'camera' and a new device 'ios' in your IoT organization. After this you'll receive an authentication token.

Open the the file TensorflowGraph.mm in Xcode. Uncomment the lines 63 - 67 and 301 - 304 (search for 'MQTT').

Change the lines 63ff with your own IoT configuration.

Redeploy the app to your iOS device.

Rather than using the Node-RED flow of the original project you need to deploy the version from this project node-red-flow to your Node-RED instance.

object-detection-anki-overdrive-cars's People

Contributors

nheidloff avatar tomcli avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.