[Vision Language Models Explained](https://huggingface.co/blog/vlms)