claudeindex
Plugin

inference-serving

Production LLM inference including vLLM, TensorRT-LLM, llama.cpp, and SGLang. Use when deploying models for production inference.

Installation

1

Add the marketplace

/plugin marketplace add tianhao909/AI-Research-SKILLs-cn
2

Install plugins

/plugin

Run these commands in Claude Code to add this plugin to your environment. The marketplace must be added before you can install its plugins.

Claude inference-serving plugin