Events 32 x 13.1 ( with space) ppt.png

Events and Talks

 

In AI, Machine Learning and Data Science across the University and beyond.

Events

C2D3 event Conference In person

C2D3 Computational Biology Annual Symposium 2026

13 May 2026

Uni of Cambridge Workshop In person

Accelerate Programme: Lent Term Training Workshops

9 Feb 2026 - 23 Mar 2026

19 Mar 2026 - 20 Mar 2026

Uni of Cambridge Workshop In person

ArCH Hands on with the Hub

20 Mar 2026

Uni of Cambridge Talk

AI and the Future of Public Health

25 Mar 2026

Uni of Cambridge Workshop In person

Getting Started with SAS

26 Mar 2026

Uni of Cambridge Conference Hybrid

Bennett School of Public Policy Annual Conference 2026

26 Mar 2026

Uni of Cambridge Workshop In person

INI AI for Maths and Open Science

30 Mar 2026 - 1 Apr 2026

Turing Conference In person

AI for Science

31 Mar 2026

Uni of Cambridge Training In person

CRIT Building computational pipelines with Nextflow

14 Apr 2026 - 15 Apr 2026

20 Apr 2026 - 21 Apr 2026

6 Jul 2026 - 7 Jul 2026

13 Jul 2026 - 17 Jul 2026

14 Jul 2026 - 29 Jul 2026

How can your research influence policy? Uni of Cambridge
Data Profiling Workshop External
Turing Data Study Group External
FinHealthTech: New opportunities at the intersection of health and wealth. External
Fetch.ai Cambridge Winter Warmer External
CCIMI Colloquium: Mark Girolami - The Statistical Finite Element Method Uni of Cambridge
What is the Future of Digitally Enabled Service Business? Uni of Cambridge
Ensembl Rest API Workshop External
Ensembl Browser Workshop External
Cambridge Networks Day 2019
Automating the Crowd: Workshop 2
Who are the real people behind artificial intelligence?
Machine Learning for Environmental Sciences 2019
CCIMI Conference - Geometric and Topological Approaches to Data Analysis
Advances and challenges in Machine Learning Languages
Cambridge Big Data Research Symposium
Cybersecurity for Smart Infrastructure: Challenges and Opportunities
Ensembl browser workshop
Data Challenges in Cardiovascular Research
Personal Data Stores: A new approach to control of online privacy
'Scores of Scores': Possibilities and Pitfalls with Musical Corpora
Hands-off my health records: why sharing your health data matters
Cryptocurrencies and ICO : Trends and Opportunities
Big Data and personalised medicine
Manufacturing Analytics: Preliminary lessons and the way forward
Inaugural meeting for a Consortium for AI in Medicine at Cambridge
High Dimensional Big Data Engineering
Sensors and Data in Robotics
Environmental Science in the Big Data Era
An introduction to the Turing-HSBC partnership in Economic Data Science
Dodgy Data in the news: How to spot it and how to stop it
Big Data Analytics Service Forum
Big Data in Medicine: Tools, Transformation and Translation
Cambridge Networks Day 2017
The Future of Big Data Patent Analytics
National Physical Laboratory UK Workshop on Data Metrology & Standards
Digital Echoes: Understanding Patterns of Mass Violence with Data and Statistics
Scalable Data Processing for Big Data from Laptop, Multi-core, to Cluster Computing
Ethics of Big Data Workshop
Cantab Capital Institute for the Mathematics of Information - Launch Event
University of Cambridge Mathematics and Big Data Showcase
The Alan Turing Institute – Energy Summit
Our Digital Future - Multidisciplinary Perspectives on Long Term Data…
Big Data, Multimodality & Dynamic Models in Biomedical Imaging
EPSRC Centre for Mathematical and Statistical Analysis of Multimodal…
Ethics of Big Data in practice: Social media research
Ethics of Big Data in practice: Administrative data
Ethics of Big Data in practice: Patient record linkage in hospitals
Ethics of Big Data in practice: Health and Policy research in Africa
Workshop on Urban Data Science #wuds15

Talks

Upcoming related talks from talks@cam

Date Title Speaker Abstract
Beyond Surface Matching: Reasoning, Grounding, and Retrieval in Vision-Language Models Prof. Vicente Ordóñez-Román (Rice University) Abstract: Vision-language models have made remarkable progress on multimodal benchmarks, yet much of this performance relies on shallow pattern matching — single-vector compression in retrieval, brute-force training scaling in reasoning, and surface-level lexical cues in grounding. In this talk, I present recent work that addresses these limitations. I begin with MetaEmbed, a flexible multi-vector retrieval framework that introduces learnable meta tokens processed by a vision-language backbone, whose contextualized representations enable late interaction at variable granularity.
AI Metrics: Theoretical Foundations, Design, and Selection of Evaluation Metrics Based on Ground Truth Enrique Amigó (National University of Distance Education, Madrid, Spain) In this talk (based on a book draft, see this link) I propose a unified formal framework for ground truth based evaluation metrics and task characterization grounded in measurement theory. Building on this foundation, I analyze the formal properties of existing metrics and organize them into families according to task characteristics. The book covers a wide range of discriminative tasks, including classification, ranking, clustering, and sequence labelling, among others, as well as text generation.
Towards a Comprehensive View on Technology Transparency: Cross-Technology Investigations of Users’ Transparency Needs and Perceptions Ilka Hein, LMU Munich Users’ subjective experience of a technology’s transparency plays a pivotal role in human-computer interaction, shaping trust, satisfaction, and technology use. Moreover, as interactive systems become more autonomous and complex, industry and policy increasingly acknowledge users’ growing need to understand what a technology is doing, how it functions, and why it produces certain outcomes. Moving beyond the currently fragmented research landscape, this talk offers a comprehensive perspective on technology transparency.
How life finds a way: resilience in mammalian embryogenesis Sarah Bowling, PhD. Assistant Professor in the Department of Developmental Biology at Stanford University School of Medicine​ Speaker: Sarah Bowling, Ph.D. Assistant Professor in the Department of Developmental Biology at Stanford University School of Medicine​ Title: “How life finds a way: resilience in mammalian embryogenesis​” Abstract: TBC Short bio: Dr. Sarah Bowling is an Assistant Professor in the Department of Developmental Biology at Stanford University School of Medicine. Her laboratory focuses on understanding the mechanisms governing resilience in mammalian embryogenesis - i.e. determining how embryos withstand and recover from diverse genetic and environmental perturbations.
Compositional Design of Society-Critical Systems: From Autonomy to Future Mobility Gioele Zardini When designing complex systems, we need to consider multiple trade-offs at various abstraction levels and scales, and choices of single components need to be studied jointly. For instance, the design of future mobility solutions (e.g., autonomous vehicles, micromobility) and the design of the mobility systems they enable are closely coupled. Indeed, knowledge about the intended service of novel mobility solutions would impact their design and deployment process, while insights about their technological development could significantly affect transportation management policies.
Reinforcement Learning with Exogenous States and Rewards Professor Thomas G. Dietterich, School of EECS, Oregon State University Exogenous state variables and rewards can slow reinforcement learning by injecting uncontrolled variation into the reward signal. In this talk, I’ll describe our work on formalizing exogenous state variables and rewards. Then I’ll discuss our main result: if the reward function decomposes additively into endogenous and exogenous components, the MDP can be decomposed into an exogenous Markov Reward Process (based on the exogenous reward) and an endogenous Markov Decision Process (optimizing the endogenous reward).
to decide Kartik Tandon to decide
BSU Seminar: "A unifying framework for generalised Bayesian online learning in non-stationary environments" Gerado Duran-Martin, Oxford-Man Institute, University of Oxford We propose a unifying framework for methods that perform probabilistic online learning in non-stationary environments. We call the framework BONE, which stands for generalised (B)ayesian (O)nline learning in (N)on-stationary (E)nvironments. BONE provides a common structure to tackle a variety of problems, including online continual learning, prequential forecasting, and contextual bandits.
BSU Seminar: "Nonparametric causal decomposition of group disparities" Ang Yu, Hong Kong University of Science and Technology We introduce a new nonparametric causal decomposition approach that identifies the mechanisms by which a treatment variable contributes to a group-based outcome disparity. Our approach distinguishes three mechanisms: group differences in: (1) treatment prevalence, (2) average treatment effects, and (3) selection into treatment based on individual-level treatment effects.
CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models Zhijiang Guo (HKUST (GZ) | HKUST) In this talk, I will present CodeScaler, a novel framework designed to overcome the scalability bottlenecks of Reinforcement Learning from Verifiable Rewards (RLVR) in code generation. While traditional RLVR relies heavily on the availability of high-quality unit tests—which are often scarce or unreliable—CodeScaler introduces an execution-free reward model that scales both training and test-time inference.
A Novel Diffusion Model based Approach for Sleep Music Generation Kevin Monteiro, Department of Computer Science and Technology Sleep disorders, particularly insomnia, and mental health conditions affect a significant fraction of adults worldwide, posing seriousmmental and physical health risk. Music therapy offers promising, low-cost, and non-invasive treatment, but current approaches rely heavily on expert-curated playlists, limiting scalability and personalisation. We propose a low-cost generative system leveraging recent advances in diffusion models to synthesize music for therapy. We focus on insomnia and curate a dataset of waveform sleep music to generate audio tailored to sleep.
TBD Daniel Platt, Imperial College London TDB
Representational Geometry of Language Models Matthieu Téhénan (University of Cambridge) Abstract not available
Understanding the Interplay between LLMs' Utilisation of Parametric and Contextual Knowledge Prof Isabelle Augenstein (University of Copenhagen) Language Models (LMs) acquire parametric knowledge from their training process, embedding it within their weights. The increasing scalability of LMs, however, poses significant challenges for understanding a model's inner workings and further for updating or correcting this embedded knowledge without the significant cost of retraining. Moreover, when using these language models for knowledge-intensive language understanding tasks, LMs have to integrate relevant context, mitigating their inherent weaknesses, such as incomplete or outdated knowledge.
Talk by Aaron Mueller (Boston University) Aaron Mueller (Boston University) Abstract not available
C2D3 Computational Biology Annual Symposium 2026 Keynote: Natasha Latysheva (Google DeepMind) We warmly invite you to the C2D3 Computational Biology Annual Symposium 2026. This event is open to everyone in the Computational Biology Community. https://www.c2d3.cam.ac.uk/events/comp-bio-2026 Early Career Researcher: Abstract Submission We are inviting Early Career Researchers to present their research during the symposium. Talks should be 17 minutes each, and a short Q&A will follow. Abstract submission - Deadline 9am 1st April 2026. Registrations Registration is essential. A waitlist will open if capacity is reached. Registrations - Deadline 9am Monday 4th May 2026.
Title to be confirmed Arduin Findeis (University of Cambridge) Abstract not available
The AI Ecosystem as a Reasoning Maze: How Collaborative Intelligence Accelerates Scientific Discovery Yuri Yuri (Oxford) Scientific discovery emerges not from isolated reasoning, but from the intersection of diverse epistemic traditions. This talk proposes that the modern AI ecosystem, a structured network of heterogeneous reasoning agents spanning approximate and rigorous inference, constitutes a new form of collaborative intelligence for scientific inquiry. Drawing on Simon's conception of reasoning as adaptive search, we argue that such ecosystems do not merely accelerate known reasoning pathways, but create conditions under which genuinely novel representations may emerge.
The AI Ecosystem as a Reasoning Maze: How Collaborative Intelligence Accelerates Scientific Discovery Yuri Yuri (Oxford) Scientific discovery emerges not from isolated reasoning, but from the intersection of diverse epistemic traditions. This talk proposes that the modern AI ecosystem, a structured network of heterogeneous reasoning agents spanning approximate and rigorous inference, constitutes a new form of collaborative intelligence for scientific inquiry. Drawing on Simon's conception of reasoning as adaptive search, we argue that such ecosystems do not merely accelerate known reasoning pathways, but create conditions under which genuinely novel representations may emerge.
The AI Ecosystem as a Reasoning Maze: How Collaborative Intelligence Accelerates Scientific Discovery Yuri Yuri (Oxford) Scientific discovery emerges not from isolated reasoning, but from the intersection of diverse epistemic traditions. This talk proposes that the modern AI ecosystem, a structured network of heterogeneous reasoning agents spanning approximate and rigorous inference, constitutes a new form of collaborative intelligence for scientific inquiry. Drawing on Simon's conception of reasoning as adaptive search, we argue that such ecosystems do not merely accelerate known reasoning pathways, but create conditions under which genuinely novel representations may emerge.