xpay tools logo
  • Providers
  • Collections
  • Explore
  • Docs
  • xpay.sh
  1. Home
  2. ›
  3. Providers
  4. ›
  5. Baseten Model APIs

Baseten Model APIs logo

Baseten Model APIs

Community
1 tool

$0.01/call

OpenAI-compatible inference API for high-performance LLMs. Drop-in replacement for OpenAI SDK - just change base_url and api_key. **Supported Models:** | Model | Slug | Context | |-------|------|--------| | DeepSeek V3 0324 | `deepseek-ai/DeepSeek-V3-0324` | 164k | | DeepSeek V3.1 | `deepseek-ai/DeepSeek-V3.1` | 164k | | GLM 4.6 (Zhipu) | `zai-org/GLM-4.6` | 200k | | GLM 4.7 (Zhipu) | `zai-org/GLM-4.7` | 200k | | Kimi K2 0905 | `moonshotai/Kimi-K2-Instruct-0905` | 128k | | Kimi K2 Thinking | `moonshotai/Kimi-K2-Thinking` | 262k | | Kimi K2.5 | `moonshotai/Kimi-K2.5` | 262k | | OpenAI GPT OSS 120B | `openai/gpt-oss-120b` | 128k | **Features:** Chat completions, streaming, tool calling, structured outputs, reasoning modes. **Pricing:** ~$0.60/1M tokens (varies by model)

​
​
Baseten Chat Completions
baseten_chat_completions

Create a chat completion using OpenAI-compatible API. **Supported Models:** - `deepseek-ai/DeepSeek-V3-0324` - DeepSeek V3 0324 (164k context) 🧠 - `deepseek-ai/DeepSeek-V3.1` - DeepSeek V3.1 (164k context) 🧠 - `zai-org/GLM-4.6` - GLM 4.6 (200k context) 🧠 - `zai-org/GLM-4.7` - GLM 4.7 (200k context) 🧠 - `moonshotai/Kimi-K2-Instruct-0905` - Kimi K2 0905 (128k context) - `moonshotai/Kimi-K2-Thinking` - Kimi K2 Thinking (262k context) 🧠 always-on - `moonshotai/Kimi-K2.5` - Kimi K2.5 (262k context) - `openai/gpt-oss-120b` - OpenAI GPT OSS 120B (128k context) 🧠 = Reasoning model. Use `reasoning_effort` param (low/medium/high) to control thinking depth. Response includes `reasoning_content` field with chain-of-thought. Supports streaming, tool calling, structured outputs.


flat

$0.01/call

Connect Baseten Model APIs tools
​
​

Cursor

Claude Code

Claude Desktop

Windsurf

VS Code

Cline

Roo Code

ChatGPT

Gemini CLI

Amazon Q

Goose

Augment

n8n

API / cURL

AI SDK

TypeScript SDK

Add to ~/.cursor/mcp.json
{
  "mcpServers": {
    "baseten": {
      "url": "https://baseten.mcp.xpay.sh/mcp?key=YOUR_API_KEY"
    }
  }
}
Or connect all tools

Access all tools (including Baseten Model APIs) through a single MCP connection.

https://mcp.xpay.sh/mcp?key=***
{
  "mcpServers": {
    "xpay": {
      "url": "https://mcp.xpay.sh/mcp?key=YOUR_API_KEY"
    }
  }
}
Agent Discovery

Machine-readable catalogs for LLM agents and automation.

llms.txt — Human-readable tool catalog
curl https://xpay.tools/llms.txt
agents.txt — Structured JSON for agents
curl https://xpay.tools/agents.txt
skill.md — Claude Code skill
curl https://xpay.tools/skill.md
Pricing

Pay per tool call. No subscriptions.

1 Baseten Model APIs tool available
Frequently Asked Questions

Baseten Model APIs is an API provider available on xpay with 1 tool. OpenAI-compatible inference API for high-performance LLMs. Drop-in replacement for OpenAI SDK - just change base_url and api_key. **Supported Models:** | Model | Slug | Context | |-------|------|--------| | DeepSeek V3 0324 | `deepseek-ai/DeepSeek-V3-0324` | 164k | | DeepSeek V3.1 | `deepseek-ai/DeepSeek-V3.1` | 164k | | GLM 4.6 (Zhipu) | `zai-org/GLM-4.6` | 200k | | GLM 4.7 (Zhipu) | `zai-org/GLM-4.7` | 200k | | Kimi K2 0905 | `moonshotai/Kimi-K2-Instruct-0905` | 128k | | Kimi K2 Thinking | `moonshotai/Kimi-K2-Thinking` | 262k | | Kimi K2.5 | `moonshotai/Kimi-K2.5` | 262k | | OpenAI GPT OSS 120B | `openai/gpt-oss-120b` | 128k | **Features:** Chat completions, streaming, tool calling, structured outputs, reasoning modes. **Pricing:** ~$0.60/1M tokens (varies by model) All tools are pay-per-call with no monthly fees.

Baseten Model APIs tools are priced at $0.01/call. You only pay when you use a tool — there are no subscriptions or monthly fees. New accounts get $5 in free credits to try any tool.

Add xpay's MCP server to your AI client (Claude Code, Cursor, VS Code, Windsurf, or Cline) with the endpoint: https://mcp.xpay.sh/mcp?key=YOUR_API_KEY. Once connected, use xpay_discover to find Baseten Model APIs tools, then xpay_run to execute them. For Claude Code: claude mcp add --transport http xpay "https://mcp.xpay.sh/mcp?key=YOUR_API_KEY"

Baseten Model APIs provides 1 tool: baseten chat completions. Each tool can be called independently and has its own pricing.

No. With xpay, one API key gives you access to all 1 Baseten Model APIs tools plus 1000+ tools from 80+ other providers. You don't need to sign up for Baseten Model APIs's API directly — xpay handles authentication, billing, and rate limiting.

Yes. Every new xpay account gets $5 in free credits. You can use these credits to try any Baseten Model APIs tool — no credit card required. Sign up at xpay.tools to get started.

xpay tools logo

Marketplace for AI Capabilities. Run agents, tools & prompts with pay-per-use micropayments.

Product
ExploreCollectionsBundles
Resources
Documentationxpay.shGitHub

© 2026 Agentically Inc. All rights reserved.Microtransactions happen via Stablecoins