Skill

openvla-oft

Vision, audio, and multimodal models including CLIP, Whisper, LLaVA, BLIP-2, Segment Anything, Stable Diffusion, AudioCraft, Cosmos Policy, OpenPI, and OpenVLA-OFT. Use when working with images, audio, multimodal tasks, or vision-language-action robot policies.

Installation

Add the marketplace

/plugin marketplace add Orchestra-Research/AI-Research-SKILLs

Install plugins

/plugin

Run these commands in Claude Code to add this plugin to your environment. The marketplace must be added before you can install its plugins.

Details & Metadata

From Plugin

multimodal

View Plugin

From Marketplace

ai-research-skills

Primary

View Marketplace

Author

@Orchestra-Research

View GitHub Profile