QUESTIONS WE HEAR A LOT
Pricing — the honest answers.
Can I start on Cloud and move to our own GPUs later?
Yes. It's the same runtime. Same OpenAI v1-compatible API. The model catalogue overlaps heavily. Most of our enterprise customers started with a Cloud trial, validated performance, then moved the production workload to their own infrastructure under ARK Tailored or ARK Core.
Why don't you publish a per-GPU price for Tailored and Core?
Because it's not a transaction. Deployments differ by modality mix, model catalogue size, support tier, integration work, and regional context. Publishing a single number would either underquote one customer or overquote another. We'd rather have a 30-minute discovery call and give you a real number.
What counts as "a GPU" in the per-GPU license?
Any physical accelerator attached to a Compute Node and used to serve inference under ARK. We don't meter per vGPU, per MIG partition, or per tensor core — just the physical units you put under ARK's control. Mixed generations (3060 to H100) count as one GPU each.
Do the per-GPU numbers include modalities beyond text?
Text generation is always included. Additional modalities (image, vision, embeddings, speech) are modular add-ons at a fixed per-GPU per-month rate. See the
Enterprise pricing page for the framework.
Is installation and configuration included in the license?
The license covers the software. ARK Core can be self-deployed from documentation at no additional fee, with an optional Install Guidance package if you want a light touch. ARK Tailored is delivered by ARK engineers under a Standard or Full Managed Deployment package. You only pay for the level of hand-holding you actually need.
Will ARK Core ever be fully self-service?
Yes — that's the roadmap. ARK Core is moving toward a signed installer that provisions the runtime, registers a license key, and activates against a per-GPU count with no ARK engineers in the loop. Until it ships, Core is available today with documentation-only self-deploy or an optional guidance package.
What does the SLA cover?
On Cloud, ARK operates the entire stack and is accountable for availability and latency on shared endpoints. On Tailored and Core, ARK's SLA covers software defect response times and resolution on ARK components. Infrastructure uptime is the customer's (or their infrastructure partner's) responsibility. Support tier is determined by GPU license count and is the same software, same update cycle — no feature gating.