Accepted Abstracts

Third Workshop on Multimodal AI · 16–17 September 2025 · London, UK

Name    Title
Hazrat AliFrom Pixels to Procedures: Structured Surgical Scene Understanding via Multimodal Large Language Models
Sedat DOGANEarly Prediction of Multimodal Cross-Lingual Meme Virality
Swapnil BhosaleHolistic Scene Representations for Immersive Audio Synthesis
Hanwen XingMultiomics Data Integration via Neighbourhood Preservation
Zainab AlmugbelMulti-Modal MAML: Revisiting Features Fusion for Discriminative Generalization and Class Distribution
Enrico ParisiniConcept-Based Modelling for Multimodal Flows
Emmanouil BenetosMultimodal Music Understanding
Qifan FuGesture Space Quantized Mixture of Experts
Alessandro SugliaPixel-Based Language Models: A Unified Approach to Multimodal AI Agents
Lu GanA Lightweight Multimodal Audio Scene Classification Framework via Knowledge Distillation
Siyi DuSTiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification
Konstantin GeorgievMM-HealthFair: A Novel Framework for Quantifying and Mitigating Healthcare Biases in Multimodal AI Algorithms for Risk Prediction
Sabrina McCallumGPTNT: Evaluating Multimodal Language Model Agents in Time-Critical Collaborative Tasks
Stephan GoerttlerStochastic Graph Heat Modelling for Cross-Modal Connectivity Estimation
Adam WynnSemi-supervised Speech Confidence Detection using Whisper Embeddings
Mohammad Aadil MinhazMultimodal AI Security: Mitigating Prompt Attacks on AI Models using AI-Gateway
Wenrui FanFoundation-Model-Boosted Multimodal Learning for fMRI-based Neuropathic Pain Drug Response Prediction
Hanya Tamer AhmedBridging Species with AI: A Cross-Species Deep Learning Model for Fracture Detection and Beyond
Junxi ZhangDecoding Ambiguity: A Multimodal Dataset for Ambiguous Actions in Manufacturing
Tadiyos Hailemichael MamoCausal Learning for Enhanced Chronic Disease Management and Interventions
Minoru Dhananjaya Jayakody ArachchigeMultimodal Inspection of End-of-Life Components
Xinxing RenSimuGen: Multi-Modal Agentic Framework for Constructing Block Diagram-Based Simulation Models
Munib MesinovicMM-GraphSurv: Interpretable Multi-Modal Graph for Survival Prediction with Electronic Health Records
Fiona YoungCrossmodal Contrastive Learning with Pathology and Transcriptomics
Angeline WangNeural Substrates of Affective Empathy: Interactions between ACC and InC
Fan GuoEnhancing Negotiation Policies via Spatio-Temporal Directed Graphs for Autonomous Interaction
Misbah RafiqueRealistic Galaxy Images Through Generative Adversarial Network
Mohammod SuvonMultimodal Latent Fusion of ECG Leads for Early Assessment of Pulmonary Hypertension
Carolina ScartonAI-TRACE: AI-driven mulTimodal and tempoRal disinformAtion analysis models in Continuous data strEams
Mingcheng ZhuFrom Byte Pair to Token Pair: Efficient Prompt Compression for Large Language Models in Clinical Prediction
Boyu ChenRobust Multimodal Autonomous Driving Perception under Occlusions
Jingzhi RuanA Multi-Scale Tactile-Visual-Text Alignment Framework Driven by Large Models
Chenqi LiMulti-Teacher Distillation for Multimodal Biosignal Foundation Models
Chenqi LiBioX-Bridge: Model Bridging for Unsupervised Cross-Modal Knowledge Transfer across Biosignals
David WesternFusion of a Priori Clinical Text Enhances Abnormal EEG Classification
Lu GanDigital Twins and Multimodal AI for Net Zero Housing
Chen ChenAdvancing Cardiac Care through Multi-Modal Data Integration for Precise Scar Mapping
Farheen RamzanCLAIM: Clinically-Guided LGE Augmentation for Realistic and Diverse Myocardial Scar Synthesis and Segmentation
Jessica FanTiered Vibe Mapping (TVM): A Feature-Space Decomposition for Aesthetic Modeling
Shibo LiA Unified Multi-modal Foundation Model for Medical Imaging Synthesis and Diagnosis
Hegel PedrozaGuitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords, and Scales
Wenjing ZhaoMultimodal Approach in Corner Case Generation for Autonomous Driving
Teng GaoNon-causal Economic Model Predictive Control for Wave Energy Converter
Vincentius Versandy WijayaNon-causal Model Predictive Control for Wave Energy Converters Based on Physics-Informed Neural Networks
Kewei ZhuReadMOF: Structure-Free Semantic Embeddings from Systematic MOF Nomenclature for Machine Learning Applications
Gaoyun FangUnderstanding Multimodal Fusion through Cross-Modal Interaction
Ruby WoodMultimodal AI for Prediction of Response to Immunotherapy in Cancer Patients
Rahul Singh MaharjanFM-OVD: Towards Fast Open-Vocabulary Object Detection with Feature-wise Modulation
Harry FindlayMultimodal Perception and Representation Learning in Human Behaviour Modelling
Harshith YerraguntlaMultimodal Glucose Forecasting with Physics-Informed Neural Networks for Type 1 Diabetes
Jason LoFrom Data to Concepts via Wiring Diagrams
Zixuan HuangMultimodal RL-Diffusion Framework for Automated Generation of High-Risk Scenarios in Autonomous Vehicle Safety Testing
Tianyi JiangMulti-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry
Sneha RoychowdhuryTowards a Standardised Framework for Explainable AI in Healthcare using Non-imaging Data: Integrating User-Centric Perspectives, Evaluation Metrics, and Contextual Expertise
Mingrui YeCan MLLMs be Students’ Art Mentor? A Multi-Dimensional Benchmark Towards Comprehensive Assessment and Pedagogical Feedback
Awais RaufEfficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning
Awais RaufBridging Domain Gaps in Specialized Fields: Multimodal Foundation Models for Sustainable Agriculture
Nasim Mohamed IsmailAddressing Systematic Bias in Multimodal Integration for Alzheimer’s Disease Classification
Noor Ul Ain ZahraAI-Driven Validation: Predicting NMR Spectra to Assess the Fidelity of AlphaFold Structures
Yutong SongProactive Multi-Agent Reinforcement Learning for Search and Rescue in Stochastic Ocean Environmen
L. M. Riza RizkyInterpretable Multimodal Machine Learning for Identifying Drug Treatment Response Biomarkers in Neuropathic Pain
Jiani ChenLearning-Based Wave Prediction
Haolin WangBenchmarking Band Gap Prediction For Semiconductor Materials Using Multimodal And Multi-Fidelity Data
Li ZhangIntegrating Heterogeneous Data Sources to Enhance Trading Strategies in Commodity Futures Markets
Xianyuan LiuGeometry-Aware Line Graph Transformer Pre-training for Molecular Property Prediction
Sina TabakhiMissing-Modality-Aware Graph Neural Network for Cancer Classification
Xianyuan LiuTowards Deployment-Centric Multimodal AI Beyond Vision and Language
Jiin Woei LeeInterpretable Multimodal AI for Predicting Early Biological Cell Responses to Biomaterial Implant Coatings
Zhongtian SunHybrid Framework for Lifelong Medical Imaging
Ziming LiuA City-Scale Multimodal Dataset and Benchmark Suite for AI-Driven Radio Resource Control in Wireless Networks
Luigi A. MorettiA Multimodal Affective Computing Pipeline for Correlating Physiological and Subjective Data Streams in Anxiety Disorders Management
Valentin DanchevEvaluation of Risks of Overreliance on AI Multimodal Models
Abdul Ghani ZahidPhysics-Guided Domain-Aware Deep Learning for Robust Wireless Modulation Classification
Gaoyun FangUnderstanding Multimodal Fusion through Cross-Modal Interaction
Halimat AfolabiExamining Modality‑Dependent Explanations and Reasoning Shifts in Closed Multimodal LLMs for Emotion Recognition
Daniel OnahBenchmarking Machine Learning Ensemble Algorithms for a Classification Task