Zai Chat Completion
zai_chat_completionCreate a chat completion model that generates AI replies for given conversation messages.
How it works ↓Pricing
Per call
$0.02
Model
flat
Pay only for what you use. No subscriptions.
Inputs
max_tokens
integerdo_sample
booleanthinking
objecttools
arraytool_stream
booleantop_p
numberresponse_format
objectstop
arraystream
booleanuser_id
stringtemperature
numbermessages *
arraytool_choice
stringmodel *
stringrequest_id
stringInput Parameters
Cost per run
Execution cost$0.02
About Zai Chat Completion
Create a chat completion model that generates AI replies for given conversation messages. It supports multimodal inputs (text, images, audio, video, file), offers configurable parameters (like temperature, max tokens, tool use), and supports both streaming and non-streaming output modes.
Frequently Asked Questions
Create a chat completion model that generates AI replies for given conversation messages. It supports multimodal inputs (text, images, audio, video, file), offers configurable parameters (like temperature, max tokens, tool use), and supports both streaming and non-streaming output modes.
Zai Chat Completion costs $0.02 per call on xpay. No subscription, no minimums. Pay only for the calls you make. New accounts get $5 in free credits.
Connect the Z.ai API MCP endpoint to your client — Claude Code: claude mcp add --transport http zai "https://zai.mcp.xpay.sh/mcp?key=YOUR_XPAY_KEY"; Cursor/Windsurf/Cline/VS Code: same URL in mcp.json. The agent will see zai_chat_completion as a callable tool with the input schema and run it directly. (Unified across all providers: https://mcp.xpay.sh/mcp?key=YOUR_XPAY_KEY, then xpay_run with toolPath zai/zai_chat_completion.)
Yes — that's exactly what xpay is for. You don't need a Z.ai API account or API key. Sign up at xpay.tools (Google or email), get $5 free credit, and run Zai Chat Completion immediately. Billing flows through your xpay balance.
Zai Chat Completion accepts 15 input parameters: max_tokens, do_sample, thinking, tools, tool_stream, top_p…. See the input schema and runnable form on this page for details and to test live.

