Overview
QwenVL is one of the most advanced multimodal LLMs that you can run locally! In order to comfortably run QwenVL itβs reccomended that you have at least 24GB of VRAM.QwenVL(model: str)
The 
QwenVL class encapsulates the functionality of the QwenVL models, providing methods for loading the model, detecting objects based on textual descriptions, managing resources, and generating responses based on prompts. It supports multiple model types including base, int4, fp16, and bf16, each optimized for different hardware configurations.Specifies which version of the model to load. Valid options are 
"base", "int4", "fp16", and "bf16". The "int4" model requires additional dependencies and setup for quantization support.You have to install special dependencies to use the 
"int4" quantized version of the model, but this does allow you to run the model with only 11GB VRAM.
These instructions are outling in Extra Installation