Learning from mistakes and transferable skills -- the attributes for a worker robot

November 04, 2019

Practise makes perfect ¬- it is an adage that has helped humans become highly dexterous and now it is an approach that is being applied to robots.

Computer scientists at the University of Leeds are using the artificial intelligence (AI) techniques of automated planning and reinforcement learning to "train" a robot to find an object in a cluttered space, such as a warehouse shelf or in a fridge - and move it.

The aim is to develop robotic autonomy, so the machine can assess the unique circumstances presented in a task and find a solution - akin to a robot transferring skills and knowledge to a new problem.

The Leeds researchers are presenting their findings today (Monday, November 4) at the International Conference on Intelligent Robotics and Systems in Macau, China.

Their paper can be read here.

The big challenge is that in a confined area, a robotic arm may not be able to grasp an object from above. Instead it has to plan a sequence of moves to reach the target object, perhaps by manipulating other items out of the way. The computer power needed to plan such a task is so great, the robot will often pause for several minutes. And when it does execute the move, it will often fail.

Developing the idea of practise makes perfect, the computer scientists at Leeds are bringing together two ideas from AI.

One is automated planning. The robot is able to "see" the problem through a vision system, in effect an image. Software in the robot's operating system simulates the possible sequence of moves it could make to reach the target object.

But the simulations that have been "rehearsed" by the robot fail to capture the complexity of the real world and when they are implemented, the robot fails to execute the task. For example, it can knock objects off the shelf.

So the Leeds team have combined planning with another AI technique called reinforcement learning.

Reinforcement learning involves the computer in a sequence of trial and error attempts - around 10,000 in total - to reach and move objects. Through these trial and error attempts, the robot "learns" which actions it has planned are more likely to end in success.

The computer undertakes the learning itself, starting off by randomly selecting a planned move that might work. But as the robot learns from trial and error, it becomes more adept at selecting those planned moves that have a greater chance of being successful.

Dr Matteo Leonetti, from the School of Computing, said: "Artificial intelligence is good at enabling robots to reason - for example, we have seen robots involved in games of chess with grandmasters.

"But robots aren't very good at what humans do very well: being highly mobile and dexterous. Those physical skills have been hardwired into the human brain, the result of evolution and the way we practise and practise and practise.

"And that is an idea that we are applying to the next generation of robots."

According to Wissam Bejjani, a PhD student who wrote the research paper, the robot develops an ability to generalise, to apply what it has planned to a unique set of circumstances.

He said: "Our work is significant because it combines planning with reinforcement learning. A lot of research to try and develop this technology focuses on just one of those approaches.

"Our approach has been validated by results we have seen in the University's robotics lab.

"With one problem, where the robot had to move a large apple, it first went to the left side of the apple to move away the clutter, before manipulating the apple.

"It did this without the clutter falling outside the boundary of the shelf."

Dr Mehmet Dogar, Associate Professor in the School of Computing, was also involved in the study. He said the approach had speeded up the robot's "thinking" time by a factor of ten - decisions that took 50 seconds now take 5 seconds.
The research received funding from the UK Engineering and Physical Sciences Research Council in a project to investigate 'human-like physics' in robotics.

University of Leeds

Related Learning Articles from Brightsurf:

Learning the language of sugars
We're told not to eat too much sugar, but in reality, all of our cells are covered in sugar molecules called glycans.

When learning on your own is not enough
We make decisions based on not only our own learning experience, but also learning from others.

Learning more about particle collisions with machine learning
A team of Argonne scientists has devised a machine learning algorithm that calculates, with low computational time, how the ATLAS detector in the Large Hadron Collider would respond to the ten times more data expected with a planned upgrade in 2027.

Getting kids moving, and learning
Children are set to move more, improve their skills, and come up with their own creative tennis games with the launch of HomeCourtTennis, a new initiative to assist teachers and coaches with keeping kids active while at home.

How expectations influence learning
During learning, the brain is a prediction engine that continually makes theories about our environment and accurately registers whether an assumption is true or not.

Technology in higher education: learning with it instead of from it
Technology has shifted the way that professors teach students in higher education.

Learning is optimized when we fail 15% of the time
If you're always scoring 100%, you're probably not learning anything new.

School spending cuts triggered by great recession linked to sizable learning losses for learning losses for students in hardest hit areas
Substantial school spending cuts triggered by the Great Recession were associated with sizable losses in academic achievement for students living in counties most affected by the economic downturn, according to a new study published today in AERA Open, a peer-reviewed journal of the American Educational Research Association.

Lessons in learning
A new Harvard study shows that, though students felt like they learned more from traditional lectures, they actually learned more when taking part in active learning classrooms.

Learning to look
A team led by JGI scientists has overhauled the perception of inovirus diversity.

Read More: Learning News and Learning Current Events
Brightsurf.com is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to Amazon.com.