llava Guide

Name: llava
Author: zechenzhangagi

Large Language and Vision Assistant. Enables visual instruction tuning and image-based conversations. Combines CLIP vision encoder with Vicuna/LLaMA language models. Supports multi-turn image chat, visual question answering, and instruction following. Use for vision-language chatbots or image understanding tasks. Best for conversational image analysis.

8,991 starsby zechenzhangagi

When to use llava

How to use llava

llava is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/zechenzhangagi/ai-research-skills/main/18-multimodal/llava/SKILL.md

Details

PlatformClaude

CategoryCode & Development

Invocationuser-invocable

Modelany

Maintainerzechenzhangagi

LicenseMIT

llava Guide

When to use llava

How to use llava

Details

Resources