Robots, Humans and Action

This research project expands the capabilities of robots by building representations and models that will allow a robot to understand human activity and the intent of human actions; with the ultimate goal to develop methods that facilitate robot-human interaction and cooperation.

Project Leader

Team Members

Project Aim

To understand human actions and intent, robots need to make inferences from visual and motion cues, just like humans do. This research project expands the capabilities of robots by building representations and models that allow a robot to understand human activity and the interaction of humans with objects in their environment. The research is important because it will ultimately enable robots to co-operate with humans to complete complex task in unstructured environments; for example, assembling a piece of furniture in the home.

Key Results

In 2020, the project team continue to make important scientific contributions to solving the problems of human activity recognition and forecasting. Two papers by Centre researchers (Fatemeh Saleh, Xin Yu and Hongdong Li) and colleagues received best paper nominations at the 2020 conference for Computer Vision and Pattern Recognition (CVPR), for work on saliency detection and sign language recognition, respectively, which also has applications in understanding human gestures for human-robot cooperation. Another paper by Centre researchers Cristian Rodriguez Opazo, Xin Yu, Hongdong Li and collaborator Dongxu Li also received an honourable mention at the 2020 Winter Conference on Applications of Computer Vision (WACV).

Project Leader, Stephen Gould, Professor Richard Hartley and Postdoctoral Fellow, Dylan Campbell presented a workshop at CVPR 2020 on Deep Declarative Networks, which received a Centre award for Best Profile Raising Event in Robotics and Computer Vision Communities. This was followed up with a tutorial on the same topic organised by Postdoctoral Fellow Itzik Ben-Shabat was held at the European Conference on Computer Vision (ECCV) and included presentations by colleagues from Stanford and Facebook.

The team also published the Ikea Assembly dataset at WACV. The dataset is a multi-modal and multi-view video collection of furniture assembly tasks to enable rich analysis and understanding of human activities. It contains 371 samples of furniture assemblies and their ground-truth annotations. Each sample includes 3 RGB views, one depth stream, atomic actions, human poses, object segments, object tracking, and extrinsic camera calibration. The videos, annotations and associated code for data processing have been publicly released to the research community and were including it in the Best of ACRV Repository that is available on the Centre’s Legacy Website.

The dataset enabled a demonstration of human-robot cooperation in assembling a small Ikea table in collaboration with the Manipulation Project. The demo led by Postdoctoral Fellow Itzik Ben-Shabat and PhD Student Zheyu Zhuang, was showcased at RoboVis 2020. Discussion with Ikea Sweden are underway on research collaborations that can further extend this work.

The end of the year saw several our research team take up new positions as the Centre comes to close. Postdoctoral Fellow Dylan Campbell has taken up a position with the Visual Geometry Group at Oxford University, Itzik Ben-Shabat has commenced a prestigious three-year Marie-Curie Fellowship, and PhD student Cristian Rodriguez successfully submitted his PhD titled “Video Analysis for Understanding Human Actions and Interactions”. Postdoctoral Fellow Fatemeh Saleh and PhD Student Sadegh Aliakbarian were also awarded a DSTG grant to research player analytics and forecasting.

2020 Annual Report

Robots, Humans and Action

This research project expands the capabilities of robots by building representations and models that will allow a robot to understand human activity and the intent of human actions; with the ultimate goal to develop methods that facilitate robot-human interaction and cooperation.

Project Leader

Stephen Gould

Team Members

Hongdong Li

Richard Hartley

Dylan Campbell

Fatemeh Saleh

Yizhak (Itzik) Ben-Shabat

Cristian Rodriguez Opazo

Frederic ‘Zhen’ Zhang

Sadegh Aliakbarian

Project Aim

Key Results