Yiming Ma 🎮

Yiming Ma

Computer Vision Researcher

University of Warwick

👋 Hi, there! I’m Yiming Ma (马一铭 in Chinese). As a PhD candidate at MathSys CDT at the University of Warwick, my research focuses on multimodality in computer vision and its applications (e.g., in crowd counting and driver monitoring systems). I am passionate about bridging mathematics and deep learning to solve real-world problems.

Skills

Mathematics

80%

Statistics

50%

Python

95%

Recent Publications

Quickly discover relevant content by filtering publications.
(2023). Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-Attention. In CVPRW.

PDF Cite Code

Experience

 
 
 
 
 
Ford
Research Assistant
October 2021 – Present Coventry

This research projects aims to build an intelligent interior sensing systems for monitoring and recognising drivers’ activities from heterogeneous multi-view, multimodal data, encompassing video, audio, heart rate and driving trends. Efficient deep learning models will be developed to combine information from disparate modalities, detect relevant activities and accurately classify the detected activity. Additionally, the project will answer critical research questions including

  1. Which class of deep models is the best suited for the activity recognition task?
  2. Which subset or combination of these modalities will provide the most efficient means to monitor driver activities?
  3. What are the limitations of the SOTA models?

Responsibilities include:

  • Literature reviewing;
  • Data accessing;
  • Model designing.

Education

 
 
 
 
 
University of Warwick
PhD in Mathematics of Systems
October 2021 – Present Coventry, UK
  • Supervisors: Dr. Victor Sanchez & Dr. Tanaya Guha.
  • Memo: The first half year was an extension of my MSc individual project. We proposed an efficient multi-scale method that leverages extracted features directly and does not require extra modules to handle head size variation. Then funded by Ford, we moved towards multi-view multimodal driver monitoring systems, which exploit various cameras (e.g., RGB, IR, and depth) deployed at different in-car positions (e.g., on the top of the driver) to monitor the driver’s activities. We proposed a multi-branch model that employs the multi-head self-attention mechanism to fuse extracted feature maps. Our current focus is the general multimodal interaction/integration methods with applications including but not restricted to crowd counting and driver monitoring systems.
 
 
 
 
 
University of Warwick
MSc in Mathematics of Systems
October 2020 – October 2021 Coventry, UK
  • Modules: Data Analysis and Machine Learning, Stochastic Modelling and Random Processes, Numerical Algorithms and Optimisation, Fundamentals of Mathematical Modelling, Data Mining, Agent-Based Systems, Interdisciplinary Approaches to Machine Learning.
  • Group Project: Prediction of Oestrus Intervals for Guide Dogs, supervised by Prof. Colm Connaughton. This project is about predicting the next oestrus intervals of guide dogs based on features such as breeds, ages, foods, etc.
  • Individual Project: Digital Twins of Urban Crowds, supervised by: Dr. Victor Sanchez & Dr. Tanaya Guha. This project aims to utilise a single traffic camera feed to develop an end-to-end deep network that estimates the crowd density in a given scene in near real-time.
 
 
 
 
 
Southern University of Science and Technology
BSc in Mathematics and Applied Mathematics
September 2016 – June 2020 Shenzhen, China
  • Modules (GPA: 92.18/100):
    • Pure Maths: Mathematical Analysis, Linear Algebra, Complex Analysis, Real Analysis, ODE, PDE, Functional Analysis, Fourier Analysis, etc.
    • Applied Maths: Numerical Analysis, Optimisation, etc.
    • Statistics: Mathematical Statistics, Linear Statistical Models, Time Series Analysis, etc.
  • Thesis: Contraction Methods in Composite Optimisation, supervised by Prof. Bingsheng He.
  • Awards: the 1st (2019), 2nd (2017), 3rd (2018) Class of the Merit Student Scholarships.