vram.supply

Models

List all available models on the vram.supply network. The endpoint is public — no authentication required.

Model Types

TypeID formatDescription
Nativeqwen/qwen3.5-9bOpen-weight models served by GPU providers. Identified by canonical HuggingFace repo IDs (e.g., qwen/qwen3.5-9b). The provider CLI resolves these to GGUF artifacts automatically.
OpenRouteranthropic/claude-3.5-sonnetProprietary models (Claude, GPT-4, Gemini) proxied via OpenRouter. Use the OpenRouter model ID.
Quotaclaude-sonnet-4-20250514Claude Code models served by quota sellers via WebSocket relay.

Endpoint Reference

GET/v1/modelsPublic

List available models.

Response

{
  "object": "list",
  "data": [
    {
      "id": "qwen/qwen3.5-9b",
      "object": "model",
      "name": "Qwen3.5 9B",
      "context_length": 131072,
      "tier": "curated",
      "provider_type": "native",
      "online_providers": 5
    }
  ]
}