Third Workshop on Multimodal AI · 16–17 September 2025 · London, UK
| Name | Title |
|---|---|
| Hazrat Ali | From Pixels to Procedures: Structured Surgical Scene Understanding via Multimodal Large Language Models |
| Sedat DOGAN | Early Prediction of Multimodal Cross-Lingual Meme Virality |
| Hanwen Xing | Multiomics Data Integration via Neighbourhood Preservation |
| Zainab Almugbel | Multi-Modal MAML: Revisiting Features Fusion for Discriminative Generalization and Class Distribution |
| Enrico Parisini | Concept-Based Modelling for Multimodal Flows |
| Emmanouil Benetos | Multimodal Music Understanding |
| Qifan Fu | Gesture Space Quantized Mixture of Experts |
| Alessandro Suglia | Pixel-Based Language Models: A Unified Approach to Multimodal AI Agents |
| Lu Gan | A Lightweight Multimodal Audio Scene Classification Framework via Knowledge Distillation |
| Siyi Du | STiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification |
| Konstantin Georgiev | MM-HealthFair: A Novel Framework for Quantifying and Mitigating Healthcare Biases in Multimodal AI Algorithms for Risk Prediction |
| Sabrina McCallum | GPTNT: Evaluating Multimodal Language Model Agents in Time-Critical Collaborative Tasks |
| Stephan Goerttler | Stochastic Graph Heat Modelling for Cross-Modal Connectivity Estimation |
| Adam Wynn | Semi-supervised Speech Confidence Detection using Whisper Embeddings |
| Mohammad Aadil Minhaz | Multimodal AI Security: Mitigating Prompt Attacks on AI Models using AI-Gateway |
| Wenrui Fan | Foundation-Model-Boosted Multimodal Learning for fMRI-based Neuropathic Pain Drug Response Prediction |
| Hanya Tamer Ahmed | Bridging Species with AI: A Cross-Species Deep Learning Model for Fracture Detection and Beyond |
| Junxi Zhang | Decoding Ambiguity: A Multimodal Dataset for Ambiguous Actions in Manufacturing |
| Tadiyos Hailemichael Mamo | Causal Learning for Enhanced Chronic Disease Management and Interventions |
| Minoru Dhananjaya Jayakody Arachchige | Multimodal Inspection of End-of-Life Components |
| Munib Mesinovic | MM-GraphSurv: Interpretable Multi-Modal Graph for Survival Prediction with Electronic Health Records |
| Fiona Young | Crossmodal Contrastive Learning with Pathology and Transcriptomics |
| Angeline Wang | Neural Substrates of Affective Empathy: Interactions between ACC and InC |
| Fan Guo | Enhancing Negotiation Policies via Spatio-Temporal Directed Graphs for Autonomous Interaction |
| Misbah Rafique | Realistic Galaxy Images Through Generative Adversarial Network |
| Mohammod Suvon | Multimodal Latent Fusion of ECG Leads for Early Assessment of Pulmonary Hypertension |
| Carolina Scarton | AI-TRACE: AI-driven mulTimodal and tempoRal disinformAtion analysis models in Continuous data strEams |
| Mingcheng Zhu | From Byte Pair to Token Pair: Efficient Prompt Compression for Large Language Models in Clinical Prediction |
| Boyu Chen | Robust Multimodal Autonomous Driving Perception under Occlusions |
| Jingzhi Ruan | A Multi-Scale Tactile-Visual-Text Alignment Framework Driven by Large Models |
| Chenqi Li | Multi-Teacher Distillation for Multimodal Biosignal Foundation Models |
| Chenqi Li | BioX-Bridge: Model Bridging for Unsupervised Cross-Modal Knowledge Transfer across Biosignals |
| David Western | Fusion of a Priori Clinical Text Enhances Abnormal EEG Classification |
| Lu Gan | Digital Twins and Multimodal AI for Net Zero Housing |
| Chen Chen | Advancing Cardiac Care through Multi-Modal Data Integration for Precise Scar Mapping |
| Farheen Ramzan | CLAIM: Clinically-Guided LGE Augmentation for Realistic and Diverse Myocardial Scar Synthesis and Segmentation |
| Jessica Fan | Tiered Vibe Mapping (TVM): A Feature-Space Decomposition for Aesthetic Modeling |
| Shibo Li | A Unified Multi-modal Foundation Model for Medical Imaging Synthesis and Diagnosis |
| Hegel Pedroza | Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords, and Scales |
| Wenjing Zhao | Multimodal Approach in Corner Case Generation for Autonomous Driving |
| Teng Gao | Non-causal Economic Model Predictive Control for Wave Energy Converter |
| Vincentius Versandy Wijaya | Non-causal Model Predictive Control for Wave Energy Converters Based on Physics-Informed Neural Networks |
| Kewei Zhu | ReadMOF: Structure-Free Semantic Embeddings from Systematic MOF Nomenclature for Machine Learning Applications |
| Gaoyun Fang | Understanding Multimodal Fusion through Cross-Modal Interaction |
| Ruby Wood | Multimodal AI for Prediction of Response to Immunotherapy in Cancer Patients |
| Rahul Singh Maharjan | FM-OVD: Towards Fast Open-Vocabulary Object Detection with Feature-wise Modulation |
| Harry Findlay | Multimodal Perception and Representation Learning in Human Behaviour Modelling |
| Harshith Yerraguntla | Multimodal Glucose Forecasting with Physics-Informed Neural Networks for Type 1 Diabetes |
| Jason Lo | From Data to Concepts via Wiring Diagrams |
| Zixuan Huang | Multimodal RL-Diffusion Framework for Automated Generation of High-Risk Scenarios in Autonomous Vehicle Safety Testing |
| Tianyi Jiang | Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry |
| Sneha Roychowdhury | Towards a Standardised Framework for Explainable AI in Healthcare using Non-imaging Data: Integrating User-Centric Perspectives, Evaluation Metrics, and Contextual Expertise |
| Mingrui Ye | Can MLLMs be Students’ Art Mentor? A Multi-Dimensional Benchmark Towards Comprehensive Assessment and Pedagogical Feedback |
| Awais Rauf | Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning |
| Awais Rauf | Bridging Domain Gaps in Specialized Fields: Multimodal Foundation Models for Sustainable Agriculture |
| Nasim Mohamed Ismail | Addressing Systematic Bias in Multimodal Integration for Alzheimer’s Disease Classification |
| Noor Ul Ain Zahra | AI-Driven Validation: Predicting NMR Spectra to Assess the Fidelity of AlphaFold Structures |
| Yutong Song | Proactive Multi-Agent Reinforcement Learning for Search and Rescue in Stochastic Ocean Environmen |
| L. M. Riza Rizky | Interpretable Multimodal Machine Learning for Identifying Drug Treatment Response Biomarkers in Neuropathic Pain |
| Haolin Wang | Benchmarking Band Gap Prediction For Semiconductor Materials Using Multimodal And Multi-Fidelity Data |
| Li Zhang | Integrating Heterogeneous Data Sources to Enhance Trading Strategies in Commodity Futures Markets |
| Xianyuan Liu | Geometry-Aware Line Graph Transformer Pre-training for Molecular Property Prediction |
| Sina Tabakhi | Missing-Modality-Aware Graph Neural Network for Cancer Classification |
| Xianyuan Liu | Towards Deployment-Centric Multimodal AI Beyond Vision and Language |
| Jiin Woei Lee | Interpretable Multimodal AI for Predicting Early Biological Cell Responses to Biomaterial Implant Coatings |
| Ziming Liu | A City-Scale Multimodal Dataset and Benchmark Suite for AI-Driven Radio Resource Control in Wireless Networks |
| Luigi A. Moretti | A Multimodal Affective Computing Pipeline for Correlating Physiological and Subjective Data Streams in Anxiety Disorders Management |
| Gaoyun Fang | Understanding Multimodal Fusion through Cross-Modal Interaction |
| Halimat Afolabi | Examining Modality‑Dependent Explanations and Reasoning Shifts in Closed Multimodal LLMs for Emotion Recognition |
| Daniel Onah | Benchmarking Machine Learning Ensemble Algorithms for a Classification Task |