POST
/api/proxy/chat/completions
核心
调用AI模型进行对话,支持流式和非流式响应,自动扣除账户余额
请求参数
| 参数 | 类型 | 必填 | 说明 |
| model | string | 否 | 模型ID,默认cloud-llm |
| messages | array | 是 | 对话消息列表,包含role和content |
| stream | boolean | 否 | 是否流式返回,默认false |
| max_tokens | integer | 否 | 最大生成Token数,默认512 |
| temperature | float | 否 | 温度采样,范围(0, 2),默认0.7 |
| top_p | float | 否 | top_p采样,范围(0, 1] |
| frequency_penalty | float | 否 | 频率惩罚,范围[-2, 2] |
| presence_penalty | float | 否 | 存在惩罚,范围[-2.0, 2.0] |
| stop | string/array | 否 | 停止标识 |
messages参数格式
[
{"role": "system", "content": "你是一个有帮助的助手"},
{"role": "user", "content": "你好"},
{"role": "assistant", "content": "你好!有什么可以帮助你的吗?"}
]
响应示例(非流式)
{
"success": true,
"id": "chatcmpl-abc123",
"model": "cloud-llm",
"choices": [{
"index": 0,
"message": {"role": "assistant", "content": "你好!我是AI助手..."},
"finish_reason": "stop"
}],
"billing": {
"input_tokens": 10,
"output_tokens": 25,
"total_tokens": 35,
"cost": 0.00011,
"balance": 99.99989,
"currency": "CNY"
}
}
支持的模型
| 模型ID | 模型名称 | 输入价格 | 输出价格 |
| cloud-llm | DeepSeek-V3 | ¥1.00/百万 | ¥4.00/百万 |
| glm-4 | GLM-4 | ¥1.60/百万 | ¥1.60/百万 |
| glm-4-flash | GLM-4-Flash | ¥0.16/百万 | ¥0.16/百万 |
调用示例(curl)
curl -X POST https://nebulai.top/api/proxy/chat/completions \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "cloud-llm",
"messages": [{"role": "user", "content": "你好"}],
"temperature": 0.7,
"max_tokens": 512
}'
调用示例(Python)
import requests
API_BASE = "https://nebulai.top/api/proxy"
API_KEY = "sk-your-api-key-here"
headers = {
"Authorization": f"Bearer {API_KEY}",
"Content-Type": "application/json"
}
data = {
"model": "cloud-llm",
"messages": [{"role": "user", "content": "你好"}],
"temperature": 0.7,
"max_tokens": 1024
}
response = requests.post(
f"{API_BASE}/chat/completions",
headers=headers,
json=data,
timeout=60
)
result = response.json()
print("AI回复:", result['choices'][0]['message']['content'])
print("费用:", result['billing']['cost'])