Third Workshop on Multimodal AI · 16–17 September 2025 · London, UK
Name | Title |
---|---|
Hazrat Ali | From Pixels to Procedures: Structured Surgical Scene Understanding via Multimodal Large Language Models |
Sedat DOGAN | Early Prediction of Multimodal Cross-Lingual Meme Virality |
Swapnil Bhosale | Holistic Scene Representations for Immersive Audio Synthesis |
Hanwen Xing | Multiomics Data Integration via Neighbourhood Preservation |
Zainab Almugbel | Multi-Modal MAML: Revisiting Features Fusion for Discriminative Generalization and Class Distribution |
Enrico Parisini | Concept-Based Modelling for Multimodal Flows |
Emmanouil Benetos | Multimodal Music Understanding |
Qifan Fu | Gesture Space Quantized Mixture of Experts |
Alessandro Suglia | Pixel-Based Language Models: A Unified Approach to Multimodal AI Agents |
Lu Gan | A Lightweight Multimodal Audio Scene Classification Framework via Knowledge Distillation |
Siyi Du | STiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification |
Konstantin Georgiev | MM-HealthFair: A Novel Framework for Quantifying and Mitigating Healthcare Biases in Multimodal AI Algorithms for Risk Prediction |
Sabrina McCallum | GPTNT: Evaluating Multimodal Language Model Agents in Time-Critical Collaborative Tasks |
Stephan Goerttler | Stochastic Graph Heat Modelling for Cross-Modal Connectivity Estimation |
Adam Wynn | Semi-supervised Speech Confidence Detection using Whisper Embeddings |
Mohammad Aadil Minhaz | Multimodal AI Security: Mitigating Prompt Attacks on AI Models using AI-Gateway |
Wenrui Fan | Foundation-Model-Boosted Multimodal Learning for fMRI-based Neuropathic Pain Drug Response Prediction |
Hanya Tamer Ahmed | Bridging Species with AI: A Cross-Species Deep Learning Model for Fracture Detection and Beyond |
Junxi Zhang | Decoding Ambiguity: A Multimodal Dataset for Ambiguous Actions in Manufacturing |
Tadiyos Hailemichael Mamo | Causal Learning for Enhanced Chronic Disease Management and Interventions |
Minoru Dhananjaya Jayakody Arachchige | Multimodal Inspection of End-of-Life Components |
Xinxing Ren | SimuGen: Multi-Modal Agentic Framework for Constructing Block Diagram-Based Simulation Models |
Munib Mesinovic | MM-GraphSurv: Interpretable Multi-Modal Graph for Survival Prediction with Electronic Health Records |
Fiona Young | Crossmodal Contrastive Learning with Pathology and Transcriptomics |
Angeline Wang | Neural Substrates of Affective Empathy: Interactions between ACC and InC |
Fan Guo | Enhancing Negotiation Policies via Spatio-Temporal Directed Graphs for Autonomous Interaction |
Misbah Rafique | Realistic Galaxy Images Through Generative Adversarial Network |
Mohammod Suvon | Multimodal Latent Fusion of ECG Leads for Early Assessment of Pulmonary Hypertension |
Carolina Scarton | AI-TRACE: AI-driven mulTimodal and tempoRal disinformAtion analysis models in Continuous data strEams |
Mingcheng Zhu | From Byte Pair to Token Pair: Efficient Prompt Compression for Large Language Models in Clinical Prediction |
Boyu Chen | Robust Multimodal Autonomous Driving Perception under Occlusions |
Jingzhi Ruan | A Multi-Scale Tactile-Visual-Text Alignment Framework Driven by Large Models |
Chenqi Li | Multi-Teacher Distillation for Multimodal Biosignal Foundation Models |
Chenqi Li | BioX-Bridge: Model Bridging for Unsupervised Cross-Modal Knowledge Transfer across Biosignals |
David Western | Fusion of a Priori Clinical Text Enhances Abnormal EEG Classification |
Lu Gan | Digital Twins and Multimodal AI for Net Zero Housing |
Chen Chen | Advancing Cardiac Care through Multi-Modal Data Integration for Precise Scar Mapping |
Farheen Ramzan | CLAIM: Clinically-Guided LGE Augmentation for Realistic and Diverse Myocardial Scar Synthesis and Segmentation |
Jessica Fan | Tiered Vibe Mapping (TVM): A Feature-Space Decomposition for Aesthetic Modeling |
Shibo Li | A Unified Multi-modal Foundation Model for Medical Imaging Synthesis and Diagnosis |
Hegel Pedroza | Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords, and Scales |
Wenjing Zhao | Multimodal Approach in Corner Case Generation for Autonomous Driving |
Teng Gao | Non-causal Economic Model Predictive Control for Wave Energy Converter |
Vincentius Versandy Wijaya | Non-causal Model Predictive Control for Wave Energy Converters Based on Physics-Informed Neural Networks |
Kewei Zhu | ReadMOF: Structure-Free Semantic Embeddings from Systematic MOF Nomenclature for Machine Learning Applications |
Gaoyun Fang | Understanding Multimodal Fusion through Cross-Modal Interaction |
Ruby Wood | Multimodal AI for Prediction of Response to Immunotherapy in Cancer Patients |
Rahul Singh Maharjan | FM-OVD: Towards Fast Open-Vocabulary Object Detection with Feature-wise Modulation |
Harry Findlay | Multimodal Perception and Representation Learning in Human Behaviour Modelling |
Harshith Yerraguntla | Multimodal Glucose Forecasting with Physics-Informed Neural Networks for Type 1 Diabetes |
Jason Lo | From Data to Concepts via Wiring Diagrams |
Zixuan Huang | Multimodal RL-Diffusion Framework for Automated Generation of High-Risk Scenarios in Autonomous Vehicle Safety Testing |
Tianyi Jiang | Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry |
Sneha Roychowdhury | Towards a Standardised Framework for Explainable AI in Healthcare using Non-imaging Data: Integrating User-Centric Perspectives, Evaluation Metrics, and Contextual Expertise |
Mingrui Ye | Can MLLMs be Students’ Art Mentor? A Multi-Dimensional Benchmark Towards Comprehensive Assessment and Pedagogical Feedback |
Awais Rauf | Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning |
Awais Rauf | Bridging Domain Gaps in Specialized Fields: Multimodal Foundation Models for Sustainable Agriculture |
Nasim Mohamed Ismail | Addressing Systematic Bias in Multimodal Integration for Alzheimer’s Disease Classification |
Noor Ul Ain Zahra | AI-Driven Validation: Predicting NMR Spectra to Assess the Fidelity of AlphaFold Structures |
Yutong Song | Proactive Multi-Agent Reinforcement Learning for Search and Rescue in Stochastic Ocean Environmen |
L. M. Riza Rizky | Interpretable Multimodal Machine Learning for Identifying Drug Treatment Response Biomarkers in Neuropathic Pain |
Jiani Chen | Learning-Based Wave Prediction |
Haolin Wang | Benchmarking Band Gap Prediction For Semiconductor Materials Using Multimodal And Multi-Fidelity Data |
Li Zhang | Integrating Heterogeneous Data Sources to Enhance Trading Strategies in Commodity Futures Markets |
Xianyuan Liu | Geometry-Aware Line Graph Transformer Pre-training for Molecular Property Prediction |
Sina Tabakhi | Missing-Modality-Aware Graph Neural Network for Cancer Classification |
Xianyuan Liu | Towards Deployment-Centric Multimodal AI Beyond Vision and Language |
Jiin Woei Lee | Interpretable Multimodal AI for Predicting Early Biological Cell Responses to Biomaterial Implant Coatings |
Zhongtian Sun | Hybrid Framework for Lifelong Medical Imaging |
Ziming Liu | A City-Scale Multimodal Dataset and Benchmark Suite for AI-Driven Radio Resource Control in Wireless Networks |
Luigi A. Moretti | A Multimodal Affective Computing Pipeline for Correlating Physiological and Subjective Data Streams in Anxiety Disorders Management |
Valentin Danchev | Evaluation of Risks of Overreliance on AI Multimodal Models |
Abdul Ghani Zahid | Physics-Guided Domain-Aware Deep Learning for Robust Wireless Modulation Classification |
Gaoyun Fang | Understanding Multimodal Fusion through Cross-Modal Interaction |
Halimat Afolabi | Examining Modality‑Dependent Explanations and Reasoning Shifts in Closed Multimodal LLMs for Emotion Recognition |
Daniel Onah | Benchmarking Machine Learning Ensemble Algorithms for a Classification Task |