The OCRAgent performs Object Character Recognition (OCR) Tasks.
OCRAgent
The OCRAgent accepts 1 optional argument:
OCRAgent(model(Optional))
The selected model. All supported OCRModel models can be found below:
OCRModel
Show Supported MultimodalLLMs
Supports gpt-4-turbo , gpt-4o .
gpt-4-turbo
gpt-4o
Supports claude-3-opus-20240229 , claude-3-haiku-20240307 , claude-3-sonnet-20240229 .
claude-3-opus-20240229
claude-3-haiku-20240307
claude-3-sonnet-20240229
Supports gemini-pro-vision .
gemini-pro-vision
Say we’re interested in extracting the license plate from this image:
OCRAgent(model=TextractModel())