NVIDIA Free AI Models Test: Only 3 Survive Peak Hours

Background

NVIDIA offers free AI model APIs via integrate.api.nvidia.com. But are they reliable during peak hours? I ran two rounds of tests to find out.

Model	Off-peak	Peak	Avg Response	Verdict
mistral-small-4-119b	3/3	2/3	0.69s	⭐ Fastest
nemotron-3-super-120b	3/3	3/3	10.1s	✅ Most stable
qwen3.5-122b	3/3	2/3	6.9s	✅ Usable
kimi-k2.5	2/3	0/3	Timeout	❌ Dead at peak
deepseek-v3.2	0/3	0/3	Timeout	❌ Always dead
glm-4.7	0/3	0/3	404	❌ Endpoint missing

Only 3 models survive peak hours:

The other three (Kimi, DeepSeek, GLM) are completely unusable during peak hours.