Ashutosh Saxena, Justin Driemeyer, Andrew Y. Ng
Okan Yıldıran - 2015700153
Justin Driemeyer
Ashutosh Saxena
Andrew Y. Ng
PhD, Machine Learning, Stanford University with Andrew Y. Ng (advisor), Sebastian Thrun and Stephen Boyd.
Lots of awards.
Co-founded several companies.
MS, Computer Science - AI, Stanford University
Director of Engineering at Zynga until 2014
Chief Scientist of Baidu
Chairman and Co-founder of Coursera
Associate Professor (Research) of Stanford.
Founder of “Google Brain”
In this paper they solved the problem of grasping novel objects by using 2D images.
We present a learning algorithm
that neither requires, nor tries to build, a 3-d model
of the object.
For most of the objects, there is a small region that selected by humans as grasping point.
So, error is defined by how close our algorithm finds grasping point to that small region.
Instead of collecting real-world data, they generated a huge data-set that is automatically labeled in different lightning conditions, positions and with randomized features like colors and scales.
Divide image into small patches and try to estimate if it is grasping point or not.
For each patch look for edges, textures and color.
Convert image to YCbCr color space.
For each patch = 17 features
Also use different spatial scales (3 in paper)
For each patch
17 * 3 = 54 features
In a 5x5 window, compute neighbors
For each patch
= 17 * 3 + 24 * 17
= 459 features
Their algorithm calculates every possible cell grasp performance which is extremely inefficient
(over 110 seconds)
Instead they only computed places close to possible grasping points by 3σ
(about 1 second)
5-dof
7-dof
The average accuracy for classifying whether a 2-d image patch is a projection of a grasping point was 94.2%
Training was performed on synthetic images.
Test with real world objects with 5-dof robot.
Achieved an average grasping success rate of 80.0% in a total of 20 trials
They studied how they can use information about object type and location to improve the performance of the grasping algorithm.
7-dof robotic platform that is more capable manipulator showed improvement over grasping objects such as bowls.
We proposed an algorithm for enabling a robot to grasp an object that it has never seen before. Our learning algorithm neither tries to build, nor requires, a 3-d model of the object.