[转载自GitHub]free-llm-api-resources 免费大语言模型 API 资源大全

🎯 免费大语言模型 API 资源大全

项目地址：GitHub - free-llm-api-resources

转载说明：本文转载以便国内开发者查阅，部分免费 API 对国内访问不友好，可能需要科学上网。使用时请注意数据安全，敏感数据请勿上传！

📌 重要提示

⚠️ 请勿滥用：请合理使用这些免费服务，避免过度请求，以防服务被限制或关闭。

🚫 仅限合法服务：本列表不包含任何通过逆向工程等非正规途径获取的 API 服务。

📑 目录

一、完全免费的提供商

OpenRouter
Google AI Studio
NVIDIA NIM
Mistral (La Plateforme)
Mistral (Codestral)
HuggingFace Inference Providers
Vercel AI Gateway
OpenCode Zen
Cerebras
Groq
Cohere
GitHub Models
Cloudflare Workers AI

二、提供试用额度的提供商
Fireworks
Baseten
Nebius
Novita
AI21
Upstage
NLP Cloud
Alibaba Cloud (International) Model Studio
Modal
Inference.net
Hyperbolic
SambaNova Cloud
Scaleway Generative APIs

一、完全免费的提供商

OpenRouter

🔗 官网：https://openrouter.ai
📊 使用限制：
20 次/分钟
50 次/天
充值 $10 后可提升至 1000 次/天
所有模型共享配额
🤖 可用模型：
Gemma 3 系列：12B Instruct / 27B Instruct / 4B Instruct
Hermes 3 Llama 3.1 405B
Llama 系列：3.2 3B Instruct / 3.3 70B Instruct
Mistral Small 3.1 24B Instruct
Qwen 系列：Qwen3-4B / Qwen3-Coder / Qwen3-Next-80B

其他：Dolphin、Nemotron、GLM-4.5-Air 等 20+ 模型

Google AI Studio

🔗 官网：https://aistudio.google.com ⚠️ 数据政策：在英国/瑞士/欧洲经济区以外使用时,数据可能用于训练 📊 模型限制详情：	模型名称	Token限制
Gemini 3 Flash	250,000/分钟	20次/天, 5次/分钟
Gemini 3.1 Flash-Lite	250,000/分钟	500次/天, 15次/分钟
Gemini 2.5 Flash	250,000/分钟	20次/天, 5次/分钟
Gemini 2.5 Flash-Lite	250,000/分钟	20次/天, 10次/分钟
Gemma 3 27B Instruct	15,000/分钟	14,400次/天, 30次/分钟
Gemma 3 12B Instruct	15,000/分钟	14,400次/天, 30次/分钟
Gemma 3 4B Instruct	15,000/分钟	14,400次/天, 30次/分钟
Gemma 3 1B Instruct	15,000/分钟	14,400次/天, 30次/分钟

NVIDIA NIM

🔗 官网：https://build.nvidia.com/explore/discover
📋 使用要求：

✅ 需要手机号验证
⚠️ 模型通常受上下文窗口限制
📊 使用限制：40 次/分钟
🤖 可用模型：多种开源模型

Mistral (La Plateforme)

🔗 官网：https://console.mistral.ai
📋 使用要求：
免费套餐需要同意数据训练
需要手机号验证
📊 使用限制（每个模型）：
1 次/秒
500,000 tokens/分钟
1,000,000,000 tokens/月
🤖 可用模型：Mistral 开源和专有模型

Mistral (Codestral)

🔗 官网：https://codestral.mistral.ai
📋 使用说明：
✅ 目前免费使用
基于月度订阅
需要手机号验证
📊 使用限制：
30 次/分钟
2,000 次/天
🤖 可用模型：Codestral（代码生成专用）

HuggingFace Inference Providers

🔗 官网：HuggingFace 文档
📋 使用说明：
Serverless Inference 限制为小于 10GB 的模型
部分热门模型即使超过 10GB 也支持
📊 使用限制：$0.10/月额度
🤖 可用模型：支持提供商的各种开源模型

Vercel AI Gateway

🔗 官网：Vercel AI Gateway
📋 功能特点：路由到多个支持的提供商
📊 使用限制：$5/月

OpenCode Zen

🔗 官网：https://opencode.ai/docs/zen/
📋 功能特点：
提供 AI 网关和精选模型
免费模型可能使用数据进行改进
🤖 可用模型：
Big Pickle Stealth
MiniMax M2.5 Free

Arcee Large Preview Free

Cerebras

🔗 官网：https://cloud.cerebras.ai/ 📊 模型限制详情：	模型名称	Token限制	请求限制
gpt-oss-120b	60,000/分钟<br>1,000,000/小时<br>1,000,000/天	30次/分钟<br>900次/小时<br>14,400次/天
Llama 3.1 8B	60,000/分钟<br>1,000,000/小时<br>1,000,000/天	30次/分钟<br>900次/小时<br>14,400次/天

Groq

🔗 官网：https://console.groq.com 📊 模型限制详情：	模型名称	Token限制
Allam 2 7B	6,000/分钟	7,000次/天
Llama 3.1 8B	6,000/分钟	14,400次/天
Llama 3.3 70B	12,000/分钟	1,000次/天
Llama 4 Maverick 17B	6,000/分钟	1,000次/天
Llama 4 Scout Instruct	30,000/分钟	1,000次/天
Whisper Large v3/v3 Turbo	7,200音频秒/分钟	2,000次/天
Qwen3-32B	6,000/分钟	1,000次/天

Cohere

🔗 官网：https://cohere.com
📊 使用限制：

20 次/分钟
1,000 次/月
所有模型共享月度配额
🤖 可用模型：
Command 系列：Command-A、Command-R、Command-R+
Aya 系列：Aya-Expanse-32B、Aya-Vision-32B
Tiny-Aya 系列：Earth、Fire、Global、Water

GitHub Models

🔗 官网：GitHub Marketplace Models
📋 使用说明：输入/输出 Token 限制极其严格
📊 使用限制：根据 Copilot 订阅等级确定
🤖 可用模型（部分精选）：
OpenAI 系列：GPT-5、GPT-4o、O1、O3、O4-mini
Llama 系列：Llama-3.3-70B、Llama-4 Maverick、Llama-4 Scout
DeepSeek 系列：DeepSeek-R1、DeepSeek-V3
其他：Mistral Medium 3、Phi-4、Grok 3 等

Cloudflare Workers AI

🔗 官网：Cloudflare Workers AI
📊 使用限制：10,000 neurons/天
🤖 可用模型（部分精选）：
Llama 系列：3.1 8B / 3.2 Vision / 3.3 70B / 4 Scout
DeepSeek 系列：R1 Distill Qwen 32B / Coder 6.7B
Gemma 系列：3 12B Instruct / 7B Instruct
其他：Mistral 7B、Qwen 2.5 Coder、GLM-4.7-Flash 等

二、提供试用额度的提供商

Fireworks

🔗 官网：https://fireworks.ai/
💰 试用额度：$1
🤖 可用模型：多种开源模型

Baseten

🔗 官网：https://app.baseten.co/
💰 试用额度：$30
🤖 可用模型：支持模型库，按计算时间计费

Nebius

🔗 官网：https://tokenfactory.nebius.com/
💰 试用额度：$1
🤖 可用模型：多种开源模型

Novita

🔗 官网：https://novita.ai/
💰 试用额度：$0.5（有效期 1 年）
🤖 可用模型：多种开源模型

AI21

🔗 官网：https://studio.ai21.com/
💰 试用额度：$10（有效期 3 个月）
🤖 可用模型：Jamba 系列模型

Upstage

🔗 官网：https://console.upstage.ai/
💰 试用额度：$10（有效期 3 个月）
🤖 可用模型：Solar Pro/Mini

NLP Cloud

🔗 官网：https://nlpcloud.com/home
💰 试用额度：$15
📋 使用要求：需要手机号验证
🤖 可用模型：多种开源模型

Alibaba Cloud (International) Model Studio

🔗 官网：阿里云模型工作室
💰 试用额度：每个模型 100 万 tokens
🤖 可用模型：多种开源和专有 Qwen 模型

Modal

🔗 官网：https://modal.com
💰 试用额度：
注册即获 $5/月
添加支付方式后提升至 $30/月
计费方式：按计算时间计费

Inference.net

🔗 官网：https://inference.net
💰 试用额度：
$1 基础额度
回复邮件调查可获得 $25
可用模型：多种开源模型

Hyperbolic

🔗 官网：https://app.hyperbolic.ai/
💰 试用额度：$1
🤖 可用模型：
DeepSeek 系列：V3、V3.1、R1
Llama 系列：3.1 405B、3.3 70B、3.2 3B
Qwen 系列：QwQ 32B、Qwen2.5 72B、Qwen3 系列
其他：Pixtral 12B、GPT-OSS 系列

SambaNova Cloud

🔗 官网：https://cloud.sambanova.ai/
💰 试用额度：$5（有效期 3 个月）
🤖 可用模型：
Llama 系列：3.1 8B、3.3 70B、4 Maverick
DeepSeek 系列：V3.1、V3.2、R1
Qwen 系列：Qwen3-235B、Qwen3-32B
其他：Mistral、Whisper、GPT-OSS 等

Scaleway Generative APIs

🔗 官网：Scaleway Generative API
💰 试用额度：1,000,000 free tokens
🤖 可用模型：
Llama 系列：3.1 8B、3.3 70B
DeepSeek R1 Distill Llama 70B
Gemma 3 27B Instruct
Qwen3 系列：235B-A22B、Coder-30B
其他：Mistral Nemo、Pixtral、Whisper 等

💡 使用建议
1. 根据需求选择：不同平台的模型和限制各异,请根据实际需求选择
2. 注意数据安全：切勿上传敏感或私密数据
3. 合理使用：避免滥用免费资源,共同维护良好的生态环境
4. 及时关注更新：各平台政策和限制可能随时调整,建议定期查看官方文档
  
  最后更新时间：2026年2月
  原文来源：GitHub - free-llm-api-resources

[转载自GitHub]free-llm-api-resources 免费大语言模型 API 资源大全

🎯 免费大语言模型 API 资源大全

📌 重要提示

📑 目录

一、完全免费的提供商

二、提供试用额度的提供商

Scaleway Generative APIs

一、完全免费的提供商

OpenRouter

其他：Dolphin、Nemotron、GLM-4.5-Air 等 20+ 模型

Google AI Studio

NVIDIA NIM

⚠️ 模型通常受上下文窗口限制 📊 使用限制：40 次/分钟 🤖 可用模型：多种开源模型

Mistral (La Plateforme)

1,000,000,000 tokens/月 🤖 可用模型：Mistral 开源和专有模型

Mistral (Codestral)

2,000 次/天 🤖 可用模型：Codestral（代码生成专用）

HuggingFace Inference Providers

部分热门模型即使超过 10GB 也支持 📊 使用限制：$0.10/月额度 🤖 可用模型：支持提供商的各种开源模型

Vercel AI Gateway

🔗 官网：Vercel AI Gateway 📋 功能特点：路由到多个支持的提供商 📊 使用限制：$5/月

OpenCode Zen

Arcee Large Preview Free

Cerebras

Groq

Cohere

Tiny-Aya 系列：Earth、Fire、Global、Water

GitHub Models

其他：Mistral Medium 3、Phi-4、Grok 3 等

Cloudflare Workers AI

其他：Mistral 7B、Qwen 2.5 Coder、GLM-4.7-Flash 等

二、提供试用额度的提供商

Fireworks

🔗 官网：https://fireworks.ai/ 💰 试用额度：$1 🤖 可用模型：多种开源模型

Baseten

🔗 官网：https://app.baseten.co/ 💰 试用额度：$30 🤖 可用模型：支持模型库，按计算时间计费

Nebius

🔗 官网：https://tokenfactory.nebius.com/ 💰 试用额度：$1 🤖 可用模型：多种开源模型

Novita

🔗 官网：https://novita.ai/ 💰 试用额度：$0.5（有效期 1 年） 🤖 可用模型：多种开源模型

AI21

🔗 官网：https://studio.ai21.com/ 💰 试用额度：$10（有效期 3 个月） 🤖 可用模型：Jamba 系列模型

Upstage

🔗 官网：https://console.upstage.ai/ 💰 试用额度：$10（有效期 3 个月） 🤖 可用模型：Solar Pro/Mini

NLP Cloud

🔗 官网：https://nlpcloud.com/home 💰 试用额度：$15 📋 使用要求：需要手机号验证 🤖 可用模型：多种开源模型

Alibaba Cloud (International) Model Studio

🔗 官网：阿里云模型工作室 💰 试用额度：每个模型 100 万 tokens 🤖 可用模型：多种开源和专有 Qwen 模型

Modal

计费方式：按计算时间计费

Inference.net

可用模型：多种开源模型

Hyperbolic

其他：Pixtral 12B、GPT-OSS 系列

SambaNova Cloud

其他：Mistral、Whisper、GPT-OSS 等

Scaleway Generative APIs

其他：Mistral Nemo、Pixtral、Whisper 等

💡 使用建议

及时关注更新：各平台政策和限制可能随时调整,建议定期查看官方文档

关于升产大队

⚠️ 模型通常受上下文窗口限制
📊 使用限制：40 次/分钟
🤖 可用模型：多种开源模型

1,000,000,000 tokens/月
🤖 可用模型：Mistral 开源和专有模型

2,000 次/天
🤖 可用模型：Codestral（代码生成专用）

部分热门模型即使超过 10GB 也支持
📊 使用限制：$0.10/月额度
🤖 可用模型：支持提供商的各种开源模型

🔗 官网：Vercel AI Gateway
📋 功能特点：路由到多个支持的提供商
📊 使用限制：$5/月

🔗 官网：https://fireworks.ai/
💰 试用额度：$1
🤖 可用模型：多种开源模型

🔗 官网：https://app.baseten.co/
💰 试用额度：$30
🤖 可用模型：支持模型库，按计算时间计费

🔗 官网：https://tokenfactory.nebius.com/
💰 试用额度：$1
🤖 可用模型：多种开源模型

🔗 官网：https://novita.ai/
💰 试用额度：$0.5（有效期 1 年）
🤖 可用模型：多种开源模型

🔗 官网：https://studio.ai21.com/
💰 试用额度：$10（有效期 3 个月）
🤖 可用模型：Jamba 系列模型

🔗 官网：https://console.upstage.ai/
💰 试用额度：$10（有效期 3 个月）
🤖 可用模型：Solar Pro/Mini

🔗 官网：https://nlpcloud.com/home
💰 试用额度：$15
📋 使用要求：需要手机号验证
🤖 可用模型：多种开源模型

🔗 官网：阿里云模型工作室
💰 试用额度：每个模型 100 万 tokens
🤖 可用模型：多种开源和专有 Qwen 模型