Question to be solved

Can we use deep learning methods to derive digital measures that capture information about neurological impairments?

General Background

Roche is at the forefront of the digital health space, with regular deployments of digital health solutions in clinical studies of neurological diseases (Lipsmeier et al., 2018, Midaglia et al. 2018). One of the flagship tests for assessing fine motor control is the Draw-a-Shape test (Creagh et al. 2020), which involves tracing pre-determined shapes on a smartphone screen. Processing of the resulting touch data allows us to determine features that are associated with upper limb motor impairment and disease progression in neurological diseases.
The aim of this challenge is to determine whether machine learning approaches can be used to detect disease patterns that have not been captured by the existing Draw-a-Shape features. Both generative and discriminative modelling could be used. Note that the touch data can be thought of as a temporal sequence of inputs.

Data Types & Technologies

The input data will consist of touch traces of pre-defined shapes on a smartphone screen. Each test consists of the following shapes: Line (top to bottom), line (bottom to top), square, circle, figure eight, spiral. The data is collected during clinical studies of people with neurological disorders. Each participant performs the test daily for the duration of the study. The following are some approaches that could be considered for this challenge, but we are open to other solutions:

  • Neural decomposition methods (Märtens & Yau, 2020).
  • Classifier predicting clinical measures (such as tests of hand function).
  • Sequence processing models such as LSTM networks (Hochreiter & Schmidhuber, 1997), Temporal-Convolutional Networks (Bai et al., 2018), and Transformers (Vaswani et al., 2017).

Needed Skills

  • Good communication skills
  • Solid Python skills
  • Experience with deep learning frameworks, e.g. PyTorch, TensorFlow
  • Understanding of the statistical underpinnings of machine learning methods


Frank Dondelinger
Data Analysis Lead MS, Digital Biomarkers, pREDi

Marcin Elantkowski
Principal Associate Data Analyst, Digital Biomarkers, pREDi

Form of Cooperation

Internship, 3 months. Preference full-time, part-time possible.

How to present your Idea

Preferred presentation format: 3-5 slides. Other forms of presentation are possible if they serve a purpose. Knowledge of Python and machine learning will be checked during the pitch sessions.

