Fig. 1 | Yearly growth of multimodal AI preprints

Fig. 2 | Multimodal AI preprints by modality

Fig. 3 | Multimodal AI preprints by the number of combined modalities

Fig. 4 | Pairwise, triple, quadruple, and quintuple modality combinations

Fig. 5 | Modality pairs

Fig. 6 | Underexplored modality combinations

Citation

Liu, X., Zhang, J., Zhou, S. et al. Towards deployment-centric multimodal AI beyond vision and language. Nature Machine Intelligence 7, 1612-1624 (2025). https://doi.org/10.1038/s42256-025-01116-5

Show BibTeX
@article{liu2025towards,
  title={Towards deployment-centric multimodal AI beyond vision and language},
  author={Liu, Xianyuan and Zhang, Jiayang and Zhou, Shuo and van der Plas, Thijs L. and Vijayaraghavan, Avish and Grishina, Anastasiia and Zhuang, Mengdie and Schofield, Daniel and Tomlinson, Christopher and others},
  journal={Nature Machine Intelligence},
  volume={7},
  pages={1612--1624},
  year={2025},
  doi={10.1038/s42256-025-01116-5}
}