Fuyu-8B
A multimodal architecture for AI agents
Featured
111 Votes

Description
Fuyu-8B is a multimodal model capable of... 🖼️ Visual Question Answering 🖼️ Image Captioning 🖼️ Text localization and more!
Fuyu-8B is a multimodal model capable of... 🖼️ Visual Question Answering 🖼️ Image Captioning 🖼️ Text localization and more!