Introduction to GPUniq
GPU Meta-Cloud platform — rent GPUs, deploy containers, manage persistent storage, and access LLM models through a unified API and Python SDK.
Overview
GPUniq is a GPU Meta-Cloud platform that aggregates compute capacity from multiple providers worldwide. Access thousands of GPUs through three deployment modes, manage persistent storage volumes, and use LLM models — all through a single API and Python SDK.
Save up to 70% compared to traditional cloud providers. No upfront commitments, pay only for what you use.
GPU Compute
GPUniq offers three ways to rent GPU compute, each suited for different use cases.
GPU Marketplace
Browse 5000+ GPU servers. Pick the exact machine, compare specs, and rent with full control.
GPU Dex-Cloud
Deploy by GPU type. Pick a model, click deploy — the platform finds the best server automatically.
GPU Burst
Scale to 100 GPUs with automatic fallback. Best for distributed training and batch inference.
Platform Features
Persistent Volumes
Storage that survives instance restarts. Upload data, attach to any instance.
LLM API
Claude, GPT-5, Gemini, Grok, and open-source models — one API key. OpenAI-compatible endpoint works with Claude Code, Cursor, and LiteLLM.
Python SDK
Full-featured Python SDK. pip install GPUniq and control everything programmatically.
CLI (gg)
Checkpoint and replay commands on GPU instances. Survive restarts without losing progress.
Quick Start
Create Account
Sign up at gpuniq.com with email or Google account.
Get API Key
Go to LLM API Keys in your dashboard and create a key. It starts with gpuniq_.
Install SDK
pip install GPUniq
Deploy a GPU
from gpuniq import GPUniq
client = GPUniq(api_key="gpuniq_your_key")
# Deploy an RTX 4090 instance
deploy = client.gpu_cloud.deploy(
gpu_name="RTX_4090",
docker_image="pytorch/pytorch:latest",
)
Authentication
GPUniq supports two authentication methods:
- API Key (recommended for SDK and automation): Pass via
X-API-Keyheader - JWT Token (web dashboard): Pass via
Authorization: Bearerheader
from gpuniq import GPUniq
client = GPUniq(api_key="gpuniq_your_key")
curl https://api.gpuniq.com/v1/marketplace/list \
-H "X-API-Key: gpuniq_your_key"
Rate limit: 120 requests/minute per API key. The SDK retries automatically on rate limit.
Next Steps
Last updated today
Built with Documentation.AI