WebMarker

    Mark web pages for use with vision-language models

    Featured
    8 Votes
    WebMarker media 1
    WebMarker media 2
    WebMarker media 3
    WebMarker media 4

    Description

    WebMarker adds visual markings with labels to elements on a web page. This can be used for Set-of-Mark prompting, which improves visual grounding abilities of vision-language models such as GPT-4o, Claude 3.5, and Google Gemini 1.5.

    Categories

    Recommended Products