LLaVA-Mini

LLaVA-Mini:Efficient Image and Video Large Multimodal Models

Vorgestellt

5 Stimmen

Beschreibung

LLaVA-Mini👏is an efficient LMM for image/video understanding using 1 vision token, offering: (1)⏩fast response (40ms per image) (2)🖥️less VRAM usage (support 3-hour video understanding on 24GB GPU).

Kategorien

Code-Editoren Git-Clients

LLaVA-Mini

LLaVA-Mini:Efficient Image and Video Large Multimodal Models

Beschreibung

Kategorien

Tags

Empfohlene Produkte