Weβre thrilled to introduce Florence-2, a pioneering vision foundation model designed to excel in diverse computer vision and vision-language tasks. π Using a unified, prompt-based approach, Florence-2 handles everything from captioning to object detection with simple text instructions. Trained on FLD-5B, a dataset boasting 5.4 billion annotations across 126 million images, it sets new standards in zero-shot and fine-tuning capabilities.
Explore how Florence-2 is revolutionising the field! π
Add a Comment