Tanmay Basu

Dr Tanmay Basu

Research Fellow


Tanmay joined the University of Birmingham in 2019 to work on developing methods and tools for information extraction from electronic health records (EHR). A computer scientist by training, Tanmay has knowledge and skills within text mining, natural language processing (NLP) and machine learning (ML) techniques. His Computer Science doctoral thesis explained novel text clustering and classification methods. Subsequently, he worked as a postdoctoral fellow at Northwestern University, Chicago and then at University of Michigan, Ann Arbor, USA on various projects related to information extraction from biomedical publications and EHR using ML and NLP. In 2017, Tanmay worked as a junior faculty member at Ramakrishna Mission Vivekananda Educational and Research Institute, West Bengal, India.

Martin Chapman

Dr Martin Chapman

Research Associate


Martin is a Research Associate working in Health Informatics at King’s College London. He is currently involved in developing models and tooling for computational phenotypes, funded by Health Data Research UK and the NIHR Biomedical Research Centre.

Vasa Curcin

Dr Vasa Curcin

Senior Lecturer in Health Informatics


Vasa is a Senior Lecturer in Health Informatics, leading the Biomedical Informatics Group at School of Population Health and Environmental Sciences at King’s College London. His research focuses on the theoretical informatics foundations and resulting software infrastructures for the Learning Health Systems, specifically in the areas of clinical trials and decision support systems, together with the role of data provenance in delivering reproducibility and auditability to scientific tasks.

Jim Davies

Prof Jim Davies

Professor of Software Engineering


Jim studied mathematics at New College, Oxford, joining the Computing Laboratory (now Department of Computer Science) in 1986 for a Masters’ and doctorate. After working as a researcher and lecturer in computer science, at Oxford and Reading, he became a lecturer in software engineering at Oxford in 1995. He has led the Software Engineering Programme since 2000, and was made Professor of Software Engineering in 2006.

Spiros Denaxas

Prof Spiros Denaxas (PI)

Professor in Biomedical Informatics


Spiros is Professor in Biomedical Informatics based at the UCL Institute of Health Informatics and leads the National Phenomics Resource Implementation project. His lab’s research focuses on creating and evaluating novel computational methods for risk prediction, data modelling, phenotyping and subphenotype discovery in structure electronic health records and other clinical and genomic data. He also leads CALIBER, a translational research resource which links electronic health records from primary care, hospital care and national mortality registries for ~10m patients.

Natalie Fitzpatrick

Natalie Fitzpatrick

HDR UK Phenomics Programme Manager


Natalie is the Phenomics Programme Manager for the Health Data Research UK (HDR UK) National Phenomics Resource and National Text Analytics Implementation projects. She is based at the UCL Institute of Health Informatics and co-leads the Institute’s Phenomics Group. Natalie has more than 20 years’ experience managing complex research programmes involving large linked electronic health record data. She also leads on facilitating research collaborations for the CALIBER platform ( and leads on patient and public engagement for the Institute.

Chuang Gao

Dr Chuang Gao

Data Scientist


Chuang is a Data Scientist working in the Health Informatics Centre (HIC), University of Dundee. His research focuses on creating and evaluating novel models and tooling for Electronic Health Record federation system. He is also currently involved in UK national phenotypes portal development, guideline development of multimorbidity clinical trials.

George Gkoutos

Prof George Gkoutos

Professor in Clinical Bioinformatics


George is a Professor in Clinical Bioinformatics, a joint appointment between the University of Birmingham Medical School and the University Hospitals Birmingham NHS Foundation Trust. He is a Faculty Affiliate Professor of Clinical Bioinformatics, Berkeley Lab, Associate Director of MRC Health Data Research UK, Co-Director of the WCH-Birmingham Research Institute and Alan Turing Fellow. His lab’s research lie in the fields of translational research and integrative systems biology, biomedical knowledge formalisation, standardisation and representation, AI based multimodal large data harmonisation, interoperability, integration and analysis, comparative phenomics, genomics and multiomics integration and analytics.

Emily Jefferson

Prof Emily Jefferson

Professor of Health Data Science and Director of Health Informatics Centre, Dundee


Emily is the academic lead and Director of the Health Informatics Centre (HIC) at the University of Dundee and Chair of Health Data Science. She has a team of >40 people providing research and services to over 600 different research projects bringing expertise in health informatics, data science, data management, governance, machine learning and AI. She is responsible for an ISO27001 certified infrastructure of >300 servers and a Scottish Government accredited Safe Haven. Emily’s main research interest is in innovative automated methods for the provision of sensitive linked data which meets both data governance requirements and those of the research community. HIC have provided >2000 linked data extracts for research utilising phenotype definitions. Her team will bring this expertise to the project and be able to utilise the portal to support reusable phenotypes at UK wide scale.

Andreas Karwath

Dr Andreas Karwath

Rutherford Research Fellow


Andreas is generally interested in research in and development of semantic-based “information commons for research” health care-related data frameworks. These frameworks should serve as fully integrated health research information platforms, with access to both deep clinical data and extensive unstructured and large data on individual patients and populations.

Shahzad Mumtaz

Dr Shahzad Mumtaz

Postdoctoral research assistant / Health data scientist at the Health Informatics Centre (HIC), University of Dundee


Shahzad has a PhD in computer science from Aston University, UK. His masters and BSc degrees are also in computer science from The Islamia University of Bahawalpur, Pakistan. Just after PhD, he worked as a postdoctoral researcher for an 11 months project at Aston University, UK. His research interests are in machine learning models both in terms of their mathematical foundations and their application to health care and bioinformatics problems. He has worked on projects like: An analysis of protein family of major histocompatibility complex, demand forecasting for National Health Services (NHS) emergency departments of West-Midlands hospitals, UK, probabilistic modelling of blood glucose via eye parameters and development of Data Visualisation and Modelling System.

Helen Parkinson

Dr Helen Parkinson

Head of Molecular Archival Resources, EMBL-EBI


Helen is Head of Molecular Archival Resources at EMBL-EBI. Helen’s team manages, analyses, and distributes complex high-throughput phenotypic data from mammals and promotes phenomic data integration internationally. They also develop open-source software tools for managing data, developing and integrating ontologies and data, and integrating semantic web technologies.

Daniel Thayer

Daniel Thayer

Senior Data Scientist


Dan leads a team of data scientists that support research with the SAIL Databank, a repository of anonymised Welsh population data. He is responsible for handling enquiries about SAIL, supporting all existing research projects, and developing tools and infrastructure to improve the research process, as well as sitting on the SAIL operations group.

Leandro Tramma

Leandro Tramma

Application Development Technical Manager


Leandro is an experienced software developer with great passion for beautifully crafted software and a keen interest in Application Lifecycle Management and good Software Design. In 2013 he joined HIC as a Senior Software Developer. Leandro currently manages multiple projects within the ADP team, providing technical expertise and guidance for the team. He is involved in the design and development of the Research Data Management Platform for HIC Services. Leandro graduated in Telecommunications Engineering at the University of Rome “La Sapienza” and in 2007 he moved to Scotland from Italy.

Susheel Varma

Dr Susheel Varma

Director of Engineering, Health Data Research UK


Susheel is Director of Engineering at Health Data Research UK (HDR UK), providing technology leadership and delivery of all technology infrastructure projects in HDR UK. He also provides alignment to HDR UK’s international strategy across HDR UK’s portfolio of research hubs, (meta)data alliance, training and infrastructure projects. He has been part of numerous European Commission and Research Councils UK funded projects and was a core member of the VPH-Share project leading the development of scientific workflows in multiple clinical research domains. He was the lead architect of GIMIAS and the VPH-Dare@IT platform for Dementia research and Translational Technology Officer (CTO) of CISTIB. He is also a lifetime Fellow of the Software Sustainability Institute and a member of the Society of Research Software Engineers. He has published a number of research works and invited keynote speaker at academic and international conferences. He was also the Human Genomics and Translational Data Platform Coordinator at ELIXIR, he leveraged his broad operating range to lead the development and operations of a portfolio of core infrastructure activities in EMBL-EBI to align with European Research Infrastructures – EMBL, ELIXIR & EOSC and international activities – GA4GH, where he continues co-lead three Cloud Computing standards in Genomics & Healthcare.

James Welch

James Welch

Doctoral student


James gained a BA in Computation from Oxford University in 2004, and stayed in Oxford to do a DPhil in Software Engineering. He is currently working on the MRC Data Support Service in conjunction with OeRC and Software Engineering. James’ current interests are in the development of enterprise software, “model-driven” technologies, the Unified Modelling Language, model transformations and metamodelling. He is also interested in formal methods, refinement and in particular the languages of Z and B. These interests are combined in a current project on the Booster language and Toolkit, and put into practice on a number of development projects.


Prof Sinead Brophy (Swansea University)

Prof Richard Dobson (King’s College London / UCL Institute of Health Informatics)

Prof Colin McCowan (University of St Andrews)

Prof Ruth Gilbert (UCL Great Ormond Street Institute of Child Health / UCL Institute of Health Informatics)

Dr Andreas Karwath (University of Birmingham)

Dr Arturo Gonzalez-Izquierdo (UCL Institute of Health Informatics)

Dr Harriet Forbes (London School of Hygiene & Tropical Medicine)

Kristiina Rannikmae (University of Edinburgh)

John Nolan

Dr Steve Harris (University College London Hospitals)