V
LLM/Model

Vision-language encoder-decoder model