The DenseCaptioningAgent
generates detailed descriptions of images.
DenseCaptioningAgent
is initialized with 1 optional argument:
MultimodalLLM
models can be found below:DenseCaptioningAgent
designed for a Workflow to detect if workers are wearing personal protective equipment (PPE).