Fuyu-8B

    A multimodal architecture for AI agents

    Featured
    111 Votes
    Fuyu-8B - A multimodal architecture for AI agents media 1

    Description

    Fuyu-8B is a multimodal model capable of... 🖼️ Visual Question Answering 🖼️ Image Captioning 🖼️ Text localization and more!

    Recommended Products