Project: Perception Pick & Place

In this project we explored the 3D perception of various objects using a variety of algorithms learned in the perception lectures

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Writeup / README

1. Provide a Writeup / README that includes all the rubric points and how you addressed each one. You can submit your writeup as markdown or pdf.

You're reading it!

Exercise 1, 2 and 3 pipeline implemented

1. Complete Exercise 1 steps. Pipeline for filtering and RANSAC plane fitting implemented.

The code of my work on Exercise 1 can be found here The comments in the code provide an explanation of what I was doing

2. Complete Exercise 2 steps: Pipeline including clustering for segmentation implemented.

The code of my work on Exercise 2 can be found here The comments in the code provide an explanation of what I was doing I outsourced the functions used in template.py into the python file segmentation.py. This proved to be useful, as we were using the same functionality in all following exercises. So I just had to enter the function calls in my main code and not copy all of the functionality over and over again.

2. Complete Exercise 3 Steps. Features extracted and SVM trained. Object recognition implemented.

The code of my work on Exercise 3 can be found here The comments in the code provide an explanation of what I was doing I outsourced the functions used in object_recognition.py into the python file segmentation.py

Pick and Place Setup

For all three tabletop setups (`test.world`), perform object recognition, then read in respective pick list (`pick_list_.yaml`). Next construct the messages that would comprise a valid `PickPlace` request output them to `.yaml` format.

I experimented with various training cycles for my SVM. I noticed that it was quite hard to get the accuracy better after reaching ~0.90 in the normalized confusion matrix. The set I was using in the end for the project took 100 cycles and looks like this:

I tried to get closer to 100% accuracy. In an extreme case, I tried doing 5000 cycles. It worked and my accuracy got to 97-99% which was really impressive. The downside on the other hand was that it produced a 209MB training-data file which took a whole lot of time to load in the project and therefore did not become very useful. Another downside was that I couldn't have added such a huge file to the git repository for the project submission. The small amount of better accuracy wasn't worth the bloat this produced.

World 1

World 1 YAML File: output_1.yaml Object detection rate: 3/3 All nice and shiny in world 1!

World 2

World 2 YAML File: output_2.yaml Object detection rate: 4/5 Algorithm kept mistaking the glue for soap2.

World 3

World 3 YAML File: output_3.yaml Object detection rate: 7/8 Algorithm kept mistaking the glue for soap2.

The Code

The code of my work on the project can be found here I outsourced the functions used in project_template.py in the python file perception_functions.py: perception functions

The comments in the code follow along what I was doing as per usual. In short:

contents of pcl_callback

The pcl_callback-function is called by the ROS-subscriber tyed to the virtual RGBD-Camera and gets a point cloud as input data. The following steps were taken to detect objects from this point cloud:

Convert the ROS-PointCloud data format into something the pcl library can work with
Remove noise
Downsample the data to reduce the number of points
Remove unnecessary data from point cloud to ease object detection by applying prior knowledge of object location
Remove the table from the scene
Identify clusters of points to be able to distiguish individual objects
Apply a different color to the individual objects ( useful for debugging in RViz)
Publish objects, table and coloured objects to ROS
Extract the color information for additional information guiding the object detection
Decide which object was found
Apply a label to the object
Calculate the centroid of the object

Technologies in use

Statistical Outlier Filter This was used for step 2 of the perception pipeline. It is a techniqe to remove noise by calculating distances between the points. The parameters for this filter are a mean-value (represents the distance between neighbors) and a standard deviation-value ( deviation from mean distance). I was achieving good results with a value of 20 for mean and a deviation of 0.3.
Voxel Grid Downsampling This was used for step 3 of the perception pipeline. The voxel grid filter downsamples the data by taking a spatial average of the points in a given cloud and combining point within a certain radius into one point. The parameter for this filter is this radius, called LEAF_SIZE. I achived good results with a LEAF_SIZE with 0.005.
Passthrough Filter This was used for step 4 of the perception pipeline. With prior information about the location of the target in the scene, the Pass Through Filter can remove useless data from the point cloud. It takes in locational parameters: axes on which the filter should be applied together with a min and max value for each axis. I applied to filters: one along the z-axis with min 0.6 and max 1.3 to remove the table legs and one along the y-axis with min -0.5 and max 0.5 to remove the table edges. what remains is the table with the items.
RANSAC Plane Fitting This was used for step 5 of the perception pipeline. The Random Sample Consensus algorithm devides a cloud into inliers (points belonging to a model) and outliers (the rest - to be discarded). This was used to delete the table and only leave the objects in the point cloud. The parameter of RANSAC is the maximum distance between points before they are regareded as outliers. I achieved good results with max_distance 0.01
Euclidean Clustering This was used for step 6 of the perception pipeline. By using a k-d tree this clustering technology divdes a given point cloud into individual objects. First we remove the color information from the points, as PCLs cluster implementation needs just location information. Then parameters for Cluster Tolerance, min and max cluster size and a search method are defined. The min and max values remove too big and too small clusters. The values I ended up with after a lot of trial and error are: cluster_tolerance 0.015, min 20, max 3000.
Cluster Visualization This was used for step 7 of the perception pipeline. To visualize the results of the clustering in RViz, each detected object was assigned a unique color.
Feature Association This was used for step 9 of the perception pipeline. In step 10, a Support Vector machine would finally decide which object has been found by looking at two features: shape and color. For the color part histogramm features were extracted from the input data.
Support Vector Machine This was used for step 10 of the perception pipeline. The SVM is a machine learning algorithm that devides the point cloud into discrete classes by making use of a trainingset. Each item in a trainingset is characterized by a feature vector and a label. The training takes place before the execution of the project and during execution of the project, the trainingset is used to determine which feature a given point cloud matches.

Improvements

For reasons yet unknown to me, the project keeps mixing up the glue and soap2. I would experiment a bit further with various training set sizes to see if this issue gets better, but because of the slow VM speed I refrained from that. I would also be worth checking the feature association step and checking if the histograms look to similar.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

writeup.md

writeup.md

Project: Perception Pick & Place

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Writeup / README

1. Provide a Writeup / README that includes all the rubric points and how you addressed each one. You can submit your writeup as markdown or pdf.

Exercise 1, 2 and 3 pipeline implemented

1. Complete Exercise 1 steps. Pipeline for filtering and RANSAC plane fitting implemented.

2. Complete Exercise 2 steps: Pipeline including clustering for segmentation implemented.

2. Complete Exercise 3 Steps. Features extracted and SVM trained. Object recognition implemented.

Pick and Place Setup

For all three tabletop setups (`test.world`), perform object recognition, then read in respective pick list (`pick_list_.yaml`). Next construct the messages that would comprise a valid `PickPlace` request output them to `.yaml` format.

World 1

World 2

World 3

The Code

contents of pcl_callback

Technologies in use

Improvements

Files

writeup.md

Latest commit

History

writeup.md

File metadata and controls

Project: Perception Pick & Place

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Writeup / README

1. Provide a Writeup / README that includes all the rubric points and how you addressed each one. You can submit your writeup as markdown or pdf.

Exercise 1, 2 and 3 pipeline implemented

1. Complete Exercise 1 steps. Pipeline for filtering and RANSAC plane fitting implemented.

2. Complete Exercise 2 steps: Pipeline including clustering for segmentation implemented.

2. Complete Exercise 3 Steps. Features extracted and SVM trained. Object recognition implemented.

Pick and Place Setup

For all three tabletop setups (test*.world), perform object recognition, then read in respective pick list (pick_list_*.yaml). Next construct the messages that would comprise a valid PickPlace request output them to .yaml format.

World 1

World 2

World 3

The Code

contents of pcl_callback

Technologies in use

Improvements

For all three tabletop setups (`test.world`), perform object recognition, then read in respective pick list (`pick_list_.yaml`). Next construct the messages that would comprise a valid `PickPlace` request output them to `.yaml` format.