GET /api/v1/health
Public liveness probe. No authentication required.
Both GET and HEAD are supported (HEAD for monitoring tools that default
to it, like Uptime Robot).
Request
Response — 200 OK
Field reference
| Field | Type | Description |
|---|---|---|
status | string | "healthy", "degraded", or "unavailable" |
version | string | Semantic version of the API |
uptime_seconds | float | Seconds since process start |
fhe_engine | string | Engine name and parameter regime |
gpu_available | bool | Server-side GPU presence (cloud API: always false) |
gpu_name | string | null | GPU model name when present |
active_sessions | int | Current TFHE sessions |
active_jobs | int | Current async compute jobs |
ciphertext_cache_usage_pct | float | null | LRU cache fill ratio |
ciphertext_cache_warning | bool | true if cache eviction is imminent |
dependencies | array | Per-dependency status |
Dependency status
| Field | Type | Description |
|---|---|---|
name | string | "redis", "postgres", "stripe" |
status | string | "healthy", "degraded", "unavailable" |
latency_ms | float | null | Probe round-trip time |
error | string | null | Short error message if unavailable |
degraded means the dependency
responded but slower than the threshold (1 s for Redis, 500 ms for Postgres).
Response — 503 Service Unavailable
When at least one critical dependency isunavailable:
GET /api/v1/models
List available FHE-LLM inference models. Public, no auth required.
Request
Response — 200 OK
available: false means the model exists but isn’t currently loaded — it
will be lazy-loaded on first inference (expect ~30 s extra latency).
GET /api/v1/tfhe/pricing
Public. Returns the TFHE gate pricing table.
Request
Response — 200 OK
Monitoring integrations
Uptime Robot
Use HTTP keyword monitor (free plan uses HEAD by default — both work):- URL:
https://api.wavis.xyz/api/v1/health - Keyword:
"status":"healthy" - Interval: 5 minutes
Pingdom / DataDog Synthetics
Kubernetes liveness probe
Reading the JSON in shell
Next Steps
Status page
Public uptime history
Billing API
Quota and budget endpoints