model-pruning Guide

Name: model-pruning
Author: davila7

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.

27,615 starsby davila7

When to use model-pruning

How to use model-pruning

model-pruning is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/davila7/claude-code-templates/main/cli-tool/components/skills/ai-research/emerging-techniques-model-pruning/SKILL.md

Details

PlatformClaude

CategoryAI & ML

Invocationuser-invocable

Modelany

Maintainerdavila7

LicenseMIT

model-pruning Guide

When to use model-pruning

How to use model-pruning

Details

Resources