Suman Saha

Postdoctoral researcher at CVL (Computer Vision Lab) , ETH Zürich, Switzerland
  • Suman Saha,
  • Postdoctoral Research Fellow,
  • Room No. ETF D113,
  • Computer Vision Lab. (CVL)
  • ETH Zürich, Switzerland
  • Email: suman.saha [at] vision (dot) ee [dot] ethz (dot) ch
  • Google Scholar
  • Official Webpage
  • download cv

I am a postdoctoral researcher at CVL (Computer Vision Lab) ETH Zürich, Switzerland. My research interests lie in Computer Vision, Machine Learning (ML) and of course Deep Learning (DL). In particular, my current research aims at developing novel algorithms for multi-task and domain agnostic representation learning.
Multi-task learning (MTL) allows a deep network to simultaneously encode information from multiple tasks. The deep convolutional neural networks (CNNs) are good at learning representation for several vision based tasks like image clas-sification, object detection, semantic segmentation, monocular depth estimation. Typically, these tasks are handled by CNNs independently, i.e. a separate model is optimized for each task, resulting in a number of task-specific models. However, real-world problems are more complex and require models able to perform multiple tasks on demand, without significantly compromising each task’s performance.
Another challenging problem in ML is the domain shift due to which often models perform poorly on unseen target datasets (domains). Domain agnostic feature learning allows a ML model to generalize well on unseen target domains. For example, a self-driving car trained on datasets having road scenes of Europe might perform badly on a target location outside of Europe.
Click here to read more ...

Publications.

Reparameterizing Convolutions for Incremental Multi-Task Learning Without Task Interference

Menelaos Kanakis, David Bruggemann, Suman Saha, Stamatios Georgoulis, Anton Obukhov, Luc Van Gool

ECCV 2020

|   pdf   |    arXiv  |    Poster   |    Slides   |

Domain Agnostic Feature Learning for Image and Video Based Face Anti-spoofing

Suman Saha , Wenhao Xu, Menelaos Kanakis, Stamatios Georgoulis, Yuhua Chen, Danda Pani Paudel, Luc Van Gool

CVPR WORKSHOP 2020

|   Workshop Oral Video   |    Workshop Oral Slides  |    PDF   |    arxiv   |

Book chapter title: Spatio-Temporal Action Instance Segmentation and Localisation

Suman Saha , Gurkirt Singh, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin

Book title: Modelling Human Motion: From Human Perception to Robot Design

Publisher: Springer International Publishing, pages: 141-161, ISBN: 978-3-030-46732-6, year: 2020.

|   Springer Book   |   Project   |

Two-Stream AMTnet for Action Detection

Suman Saha , Gurkirt Singh, Fabio Cuzzolin

arXiv 2020.

|   arxiv   |

Unsupervised Deep Representations for Learning Audience Facial Behaviors

Suman Saha, Rajitha Navarathna, Leonhard Helminger, Romann M. Weber

CVPR 2018 Workshops

|   pdf   |    arXiv  |    Poster   |

Predicting Action Tubes

Gurkirt Singh, Suman Saha, Fabio Cuzzolin

ECCV 2018 Workshops

|   arXiv   |

Incremental Tube Construction for Human Action Detection

Harkirat Singh Behl, Michael Sapienza, Gurkirt Singh, Suman Saha, Fabio Cuzzolin, Philip H. S. Torr

BMVC 2018 (Oral)

|   arXiv   |

Spatio-temporal Human Action Detection and Instance Segmentation in Videos

Suman Saha

PhD thesis, Oxford Brookes University, United Kingdom, 2018

|   PhD Thesis PDF    |    PhD Thesis Defense Slides   |

TraMNet - Transition Matrix Network for Efficient Action Tube Proposals

Gurkirt Singh, Suman Saha, Fabio Cuzzolin

ACCV 2018

|   arxiv   |

Action Detection from a Robot-Car Perspective

Valentina Fontana, Manuele Di Maio, Stephen Akrigg, Gurkirt Singh, Suman Saha, Fabio Cuzzolin

arXiv 2018

|   arxiv   |

AMTnet: Action-Micro-Tube regression by end-to-end trainable deep architecture

Suman Saha, Gurkirt Singh, Fabio Cuzzolin

ICCV 2017

|   pdf   |   suppl. material   |   arxiv   |   poster   |   Code   |

Online Real-time Multiple Spatiotemporal Action Localisation and Prediction

Gurkirt Singh, Suman Saha, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin

ICCV 2017

|   pdf   |   suppl. material   |   arxiv   |   poster   |   ICCV 2017 Demo Video   |   Code   |

Spatio-temporal human action localisation and instance segmentation in temporally untrimmed videos

Suman Saha, Gurkirt Singh, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin

arXiv, 2017

|   Project Page Link   |   arxiv   |

Metric learning for Parkinsonian identification from IMU gait measurements

Fabio Cuzzolin, Michael Sapienza, Patrick Esser, Suman Saha, Miss Marloes Franssen, Johnny Collett, Helen Dawes

Gait & Posture, Volume 54, May 2017, Pages 127-132

|   ScienceDirect link   |

Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos

Suman Saha, Gurkirt Singh, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin

BMVC 2016

|   project page link  |   arxiv  |   Full Version  |   code  |   poster   |

A real-time monocular vision-based frontal obstacle detection and avoidance for low cost UAVs in GPS denied environment

Suman Saha, Ashutosh Natraj, Sonia Waharte

Aerospace Electronics and Remote Sensing Technology (ICARES), 2014 IEEE International Conference on

|   pdf  |   project page link   |

Face Recognition using PCA and Multilayer Feedforward Neural Networks

Suman Saha

European Journal of Applied Sciences and Technology [EUJAST] Volume 1 (1), March 2014

|   pdf   |

A Monocular Vision Approach for Obstacle Detection and Collision Avoidance for Low-cost Quadrocopters

Suman Saha

MSc Thesis , University of Bedfordshire, Uited Kingdom. January 2014

|   MSc Thesis  |   MSc Defense Poster  |   project page link   |

Rsearch Activities and Awards.

Read More ...

Before joining ETH Zürich, I was a Research Associate (RA) or a postdoctoral researcher at the Department of Computing and Communication Technologies, Oxford Brookes University, where I spent four wonderful years (included my PhD studies).
I have received my PhD degree under the supervision of Professor Fabio Cuzzolin at Oxford Brookes University, United Kingdom. Professor Nigel Crook and Dr Tjeerd Olde Scheper where my PhD co-supervisors.
My PhD thesis topic was Spatio-temporal Human Action Detection and Instance Segmentation in Videos. The two main objectives of my PhD thesis were to propose: (1) efficient algorithms to locate (in space and time) multiple co-occurring human action instances present in realistic videos; (2) powerful video level deep feature representation to improve the state-of-the-art action detection accuracy.
Besides, I was an active member of the Artificial Intelligence and Vision Research Group led by Professor Fabio Cuzzolin. I consider myself fortunate to have an opportunity to work closely with the world renowned Torr Vision Group (TVG) in the Department of Engineering Science at University of Oxford. More specifically, during my PhD, I worked with my PhD guide Dr Michael Sapienza and Professor Philip H. S. Torr. who is the founder of TVG.
During summer 2017, I received a wonderful opportunity to work with Dr Romann Weber Senior Research Scientist and Head of Machine Intelligence and Data Science Group at Disney Research Zurich (DRZ). At DRZ, I wokred for the project named unsupervised and semi-supervised learning of audience facial expressions using deep generative models. We improved the classification accuracy by 9% over the existing method.

I have completed my Master's study from the Department of Computer Science and Technology, University of Bedfordshire (UoB), United Kingdom. During my MSc thesis work (i.e., in 2013-2014), I proposed a novel realtime algorithm for frontal obstacle detection and avoidance for low cost unmanned aerial vehicles (UAVs). The related publication can be accessed using this link. My MSc thesis supervisors were Dr Ashutosh Natraj and Sonia Waharte , post doctoral researchers in the Department of Computer Science, University of Oxford.

Before pursuing my Master's study in UK, I worked as a Software Analyst at the the Research and Development and Scientific Services division, Tata Steel Ltd. India. At R&D Tata Steel, I worked under the supervision of DR. Sumitesh Das, Chief (Global Research Programmes) at Tata Steel Ltd. My CV can be viewed by clicking this link.

I received my Polytechnic Diploma Engineering degree in Computer Science from Siddaganga Polytechnic College, India.

Contact.

  • Suman Saha,
  • Postdoctoral Research Fellow,
  • Room No. ETF D113,
  • Computer Vision Lab. (CVL)
  • ETH Zürich, Switzerland
  • Email: suman.saha [at] vision (dot) ee [dot] ethz (dot) ch
  • Google Scholar
  • Official Webpage
  • download cv