4M-21 offers extensive functionalities, from steerable multimodal generation and multimodal retrieval to robust performance in vision-related tasks.