xpay tools logo
  • Tools
  • Collections
  • Skills
  • Docs
  • xpay.sh
Tool Skill
Baseten Model APIs logo

Baseten Chat Completions

baseten_chat_completions

Create a chat completion using OpenAI-compatible API. **Supported Models:** - `deepseek-ai/DeepSeek-V3-0324` - DeepSeek V3 0324 (164k context) 🧠 - `deepseek-ai/DeepSeek-V3.1` - DeepSeek V3.1 (164k context) 🧠 - `zai-org/GLM-4.6` - GLM 4.6 (200k context) 🧠 - `zai-org/GLM-4.7` - GLM 4.7 (200k context) 🧠 - `moonshotai/Kimi-K2-Instruct-0905` - Kimi K2 0905 (128k context) - `moonshotai/Kimi-K2-Thinking` - Kimi K2 Thinking (262k context) 🧠 always-on - `moonshotai/Kimi-K2.5` - Kimi K2.5 (262k context) - `openai/gpt-oss-120b` - OpenAI GPT OSS 120B (128k context) 🧠 = Reasoning model. Use `reasoning_effort` param (low/medium/high) to control thinking depth. Response includes `reasoning_content` field with chain-of-thought. Supports streaming, tool calling, structured outputs.

$0.01/call
Flat rate
Baseten Model APIs
Raw SKILL.mdTool Page

When to Use

Use this tool when you need to create a chat completion using openai-compatible api.

supported models:

  • deepseek-ai/deepseek-v3-0324 - deepseek v3 0324 (164k context) 🧠
  • deepseek-ai/deepseek-v3.1 - deepseek v3.1 (164k context) 🧠
  • zai-org/glm-4.6 - glm 4.6 (200k context) 🧠
  • zai-org/glm-4.7 - glm 4.7 (200k context) 🧠
  • moonshotai/kimi-k2-instruct-0905 - kimi k2 0905 (128k context)
  • moonshotai/kimi-k2-thinking - kimi k2 thinking (262k context) 🧠 always-on
  • moonshotai/kimi-k2.5 - kimi k2.5 (262k context)
  • openai/gpt-oss-120b - openai gpt oss 120b (128k context)

🧠 = reasoning model. use reasoning_effort param (low/medium/high) to control thinking depth. response includes reasoning_content field with chain-of-thought.

supports streaming, tool calling, structured outputs.. This is part of the Baseten Model APIs provider on xpay✦.

MCP Connection

Connect to xpay✦ to access this tool (and 0+ others):

{
  "mcpServers": {
    "xpay": {
      "url": "https://mcp.xpay.sh/mcp?key=YOUR_API_KEY"
    }
  }
}

For Claude Code:

claude mcp add --transport http xpay "https://mcp.xpay.sh/mcp?key=YOUR_API_KEY"

How to Execute

Use the xpay✦ meta-tools to run this tool:

  1. xpay_details — Get full input schema: xpay_details("baseten/baseten_chat_completions")
  2. xpay_run — Execute: xpay_run("baseten/baseten_chat_completions", { ...inputs })

Input Parameters

ParameterTypeRequiredDescription
top_logprobsnumberNoTop logprobs to return (0-20)
reasoning_effortstringNoReasoning depth for supported models (low/medium/high). Default: medium. Supported on: DeepSeek V3.1, DeepSeek V3 0324, GLM 4.7, GLM 4.6, Kimi K2 Thinking
logit_biasobjectNoToken ID to bias map (-100 to 100)
seednumberNoRandom seed
badstringNoWords to avoid
skip_special_tokensbooleanNoRemove special tokens
documentsstringNoDocuments for RAG
presence_penaltynumberNoPenalize by presence
echobooleanNoPrepend last message to output
top_p_minnumberNoMin dynamic top_p
early_stoppingbooleanNoStop when n candidates found
toolsstringNoFunctions model can call
logprobsbooleanNoReturn log probabilities
top_pnumberNoNucleus sampling 0-1
frequency_penaltynumberNoPenalize tokens by frequency (default: 0)
response_formatobjectNoResponse format type
truncate_prompt_tokensnumberNoTruncate prompt to N tokens
best_ofnumberNoCandidates to generate (only 1)
streambooleanNoStream responses
top_knumberNoTop-K sampling
disaggregated_paramsobjectNoAdvanced distributed inference params
temperaturenumberNoSampling temperature 0-4
tool_choicestringNoTool calling mode
modelstringYesModel slug (e.g., deepseek-ai/DeepSeek-V3.1)
ignore_eosbooleanNoContinue past EOS token
chat_templatestringNoCustom Jinja template
max_tokensnumberNoMax tokens (default: 4096)
add_generation_promptbooleanNoAdd generation prompt from template
nnumberNoNumber of completions (only 1)
min_tokensnumberNoMinimum tokens before stopping
min_pnumberNoMin probability threshold
spaces_between_special_tokensbooleanNoAdd spaces between special tokens
chat_template_argsobjectNoChat template arguments
stopstringNoStop sequences
parallel_tool_callsbooleanNoParallel tool calls
include_stop_str_in_outputbooleanNoInclude stop string in output
messagesstringYesConversation messages with role and content
bad_token_idsstringNoToken IDs to avoid
stream_optionsobjectNoStream options
userstringNoEnd-user identifier
repetition_penaltynumberNoRepetition penalty
length_penaltynumberNoLength penalty for beam search
stop_token_idsstringNoToken IDs that stop generation
add_special_tokensbooleanNoAdd special tokens like BOS

Pricing

  • Cost: $0.01/call
  • Balance check: Use xpay_balance to check remaining credits
  • Get your API key at xpay.tools — $5 free credits included

Related Skills

  • Baseten Model APIs (all tools) — 1 tools

Links

  • Tool page: https://xpay.tools/baseten/baseten-chat-completions/
  • Provider: https://xpay.tools/baseten/
  • All tools: https://xpay.tools/explore
How to Execute
// 1. Get full schema
xpay_details("baseten/baseten_chat_completions")
// 2. Execute
xpay_run("baseten/baseten_chat_completions", { ...inputs })
Input Parameters
ParameterTypeRequiredDescription

top_logprobs

numberNo

Top logprobs to return (0-20)

reasoning_effort

stringNo

Reasoning depth for supported models (low/medium/high). Default: medium. Supported on: DeepSeek V3.1, DeepSeek V3 0324, GLM 4.7, GLM 4.6, Kimi K2 Thinking

logit_bias

objectNo

Token ID to bias map (-100 to 100)

seed

numberNo

Random seed

bad

stringNo

Words to avoid

skip_special_tokens

booleanNo

Remove special tokens

documents

stringNo

Documents for RAG

presence_penalty

numberNo

Penalize by presence

echo

booleanNo

Prepend last message to output

top_p_min

numberNo

Min dynamic top_p

early_stopping

booleanNo

Stop when n candidates found

tools

stringNo

Functions model can call

logprobs

booleanNo

Return log probabilities

top_p

numberNo

Nucleus sampling 0-1

frequency_penalty

numberNo

Penalize tokens by frequency (default: 0)

response_format

objectNo

Response format type

truncate_prompt_tokens

numberNo

Truncate prompt to N tokens

best_of

numberNo

Candidates to generate (only 1)

stream

booleanNo

Stream responses

top_k

numberNo

Top-K sampling

disaggregated_params

objectNo

Advanced distributed inference params

temperature

numberNo

Sampling temperature 0-4

tool_choice

stringNo

Tool calling mode

model

string
Yes

Model slug (e.g., deepseek-ai/DeepSeek-V3.1)

ignore_eos

booleanNo

Continue past EOS token

chat_template

stringNo

Custom Jinja template

max_tokens

numberNo

Max tokens (default: 4096)

add_generation_prompt

booleanNo

Add generation prompt from template

n

numberNo

Number of completions (only 1)

min_tokens

numberNo

Minimum tokens before stopping

min_p

numberNo

Min probability threshold

spaces_between_special_tokens

booleanNo

Add spaces between special tokens

chat_template_args

objectNo

Chat template arguments

stop

stringNo

Stop sequences

parallel_tool_calls

booleanNo

Parallel tool calls

include_stop_str_in_output

booleanNo

Include stop string in output

messages

string
Yes

Conversation messages with role and content

bad_token_ids

stringNo

Token IDs to avoid

stream_options

objectNo

Stream options

user

stringNo

End-user identifier

repetition_penalty

numberNo

Repetition penalty

length_penalty

numberNo

Length penalty for beam search

stop_token_ids

stringNo

Token IDs that stop generation

add_special_tokens

booleanNo

Add special tokens like BOS

Install Skill
Claude Code
claude /install-skill https://xpay.tools/skills/baseten/baseten-chat-completions/SKILL.md
CLI
npx @xpaysh/cli install baseten/baseten-chat-completions
Manual
curl -o SKILL.md https://xpay.tools/skills/baseten/baseten-chat-completions/SKILL.md
Pricing

Cost

$0.01/call

Model

Flat rate

Provider

Baseten Model APIs

xpay tools logo

Marketplace for AI Capabilities. Run agents, tools & prompts with pay-per-use micropayments.

or ask your AI app
Popular Tools
FirecrawlTavilyExaHunterPerplexityRecraftIdeogramFlux
Collections
AI Image & MediaContent & ResearchDev ToolsFinanceLead GenWeb ScrapingCompetitive Intel
Developers
DocumentationAPI ReferenceQuickstartSDKsSkillsskill.mdllms.txt
Resources
The Agentic EconomyAgentic CommerceAgent Frameworksx402 ProtocolBlog
Company
About xpayxpay.shGitHubDiscordTwitter / X

© 2026 Agentically Inc. All rights reserved.
Privacy PolicyTerms of Service