Jatevo model catalog

Serverless model cards for the routes behind Jatevo.

Browse every playground-ready and request-access model route with provider logos, pricing markers, deployment type, and capability tags in one place.

Browse Models

Find model routes by provider, capability, or deployment style.

6 models
DeepSeek
Reasoning

DeepSeek V4 Pro

A high-capacity reasoning route for agentic workflows, coding tasks, and production chat traffic through Jatevo.

Input
$1.74
Output
$3.48
Speed
Fast
ServerlessGlobalRequest access
Z.ai
Code

GLM 5.1

GLM 5.1 is Jatevo's dedicated Z.ai route for long-running software, agent, and tool-use sessions.

Input
$1.40
Output
$4.40
Context
200K
ServerlessAPACTry model
NVIDIA
NewReasoning

NVIDIA Nemotron 3 Ultra 550B A55B NVFP4

Nemotron 3 Ultra is a 550B hybrid MoE model from NVIDIA, optimized for demanding multi-agent AI and complex reasoning tasks.

Input
$0.60
Output
$3.60
Speed
59 Tok/s
Serverlessus-central1Try model
Jatevo Inference
NewCode

Kimi K2.7 Code

Kimi K2.7 Code runs on Jatevo Inference for long-context software work, agent execution, and production chat workloads.

Input
$0.75
Output
$3.50
Focus
Agentic
ServerlessGlobalTry model
Alibaba Cloud
Chat

Qwen 3.7 Max

A Qwen Max route for text-only enterprise workflows, exposed through Jatevo with scoped key enforcement.

Input
$1.25
Output
$3.75
Speed
Fast
ServerlessAPACTry model
Cerebras
Chat

Cerebras Fast Chat

A fast hosted chat route for prompt iteration, quick testing, and low-latency playground sessions.

Input
Low
Output
Low
Speed
Ultra-fast
ServerlessGlobalTry model