Olostep Start Crawl SKILL.md — Install for Claude Code, Cursor & Cline | Olostep API

Skill

Olostep Start Crawl

olostep_start_crawl

Starts a new crawl. You receive a id to track the progress. The operation may take 1-10 mins depending upon the site and depth and pages parameters.

$0.10/call

Flat rate

Olostep API

Raw SKILL.md Run This Tool

When to Use

Use this tool when you need to starts a new crawl. you receive a id to track the progress. the operation may take 1-10 mins depending upon the site and depth and pages parameters.. This is part of the Olostep API provider on xpay✦.

MCP Connection

Connect to xpay✦ to access this tool (and 11+ others):

{
  "mcpServers": {
    "xpay": {
      "url": "https://mcp.xpay.sh/mcp?key=YOUR_API_KEY"
    }
  }
}

For Claude Code:

claude mcp add --transport http xpay "https://mcp.xpay.sh/mcp?key=YOUR_API_KEY"

How to Execute

Use the xpay✦ meta-tools to run this tool:

xpay_details — Get full input schema: xpay_details("olostep/olostep_start_crawl")
xpay_run — Execute: xpay_run("olostep/olostep_start_crawl", { ...inputs })

Input Parameters

Parameter	Type	Required	Description
`include_urls`	array	No	URL path patterns to include in the crawl using glob syntax. Defaults to `/` which includes all URLs. Use patterns like `/blog/` to crawl specific sections (e.g., only blog pages), `/products/.html` for product pages, or multiple patterns for different sections. Supports standard glob features like (any characters) and ** (recursive matching).
`top_n`	number	No	An optional number to only crawl the top N most relevant links on every page as per search query.
`webhook_url`	string	No	An optional POST request endpoint called when this crawl is completed. The body of the request will be same as the response of this `v1/crawls/{crawl_id}` endpoint.
`max_depth`	number	No	Maximum depth of the crawl. Useful to extract only up to n-degree of links.
`include_external`	boolean	No	Crawl first-degree external links.
`start_url`	string	Yes	The starting point of the crawl.
`include_subdomain`	boolean	No	Include subdomains of the website. `false` by default.
`exclude_urls`	array	No	URL path names in glob pattern to exclude. For example: `/careers/**`. Excluded URLs will supersede included URLs.
`max_pages`	number	Yes	Maximum number of pages to crawl. Recommended for most use cases like crawling an entire website.
`search_query`	string	No	An optional search query to find specific links and also sort the results by relevance.
`timeout`	number	No	End the crawl after n seconds with the pages completed until then. May take ~10s extra from provided timeout.

Pricing

Cost: $0.10/call
Balance check: Use xpay_balance to check remaining credits
Get your API key at xpay.tools — $5 free credits included

Related Skills

Olostep API (all tools) — 12 tools
Olostep Create Scrape — $0.02/call
Olostep Create Answer — $0.10/call
Olostep Maps — $0.02/call
Olostep Start Batch — $0.02/call
Olostep Batch Items — $0.02/call

Links

Tool page: https://xpay.tools/olostep/olostep-start-crawl/
Provider: https://xpay.tools/olostep/
All tools: https://xpay.tools/explore

How to Execute

// 1. Get full schema

xpay_details("olostep/olostep_start_crawl")

// 2. Execute

xpay_run("olostep/olostep_start_crawl", { ...inputs })

Input Parameters

Parameter	Type	Required	Description
include_urls	array	No	URL path patterns to include in the crawl using glob syntax. Defaults to `/` which includes all URLs. Use patterns like `/blog/` to crawl specific sections (e.g., only blog pages), `/products/.html` for product pages, or multiple patterns for different sections. Supports standard glob features like (any characters) and ** (recursive matching).
top_n	number	No	An optional number to only crawl the top N most relevant links on every page as per search query.
webhook_url	string	No	An optional POST request endpoint called when this crawl is completed. The body of the request will be same as the response of this [`v1/crawls/{crawl_id}`](./info#response-created) endpoint.
max_depth	number	No	Maximum depth of the crawl. Useful to extract only up to n-degree of links.
include_external	boolean	No	Crawl first-degree external links.
start_url	string	Yes	The starting point of the crawl.
include_subdomain	boolean	No	Include subdomains of the website. `false` by default.
exclude_urls	array	No	URL path names in glob pattern to exclude. For example: `/careers/**`. Excluded URLs will supersede included URLs.
max_pages	number	Yes	Maximum number of pages to crawl. Recommended for most use cases like crawling an entire website.
search_query	string	No	An optional search query to find specific links and also sort the results by relevance.
timeout	number	No	End the crawl after n seconds with the pages completed until then. May take ~10s extra from provided timeout.

Other Olostep API Skills

Olostep Create Scrape — $0.02 Olostep Create Answer — $0.10 Olostep Maps — $0.02 Olostep Start Batch — $0.02 Olostep Batch Items — $0.02 Olostep Crawl Info — $0.02 Olostep Crawl Pages — $0.02 Olostep Get Answer — $0.02

Install Skill

Claude Code

claude /install-skill https://xpay.tools/skills/olostep/olostep-start-crawl/SKILL.md

CLI

npx @xpaysh/cli install olostep/olostep-start-crawl

Manual

curl -o SKILL.md https://xpay.tools/skills/olostep/olostep-start-crawl/SKILL.md

Pricing

Cost

$0.10/call

Model

Flat rate

Provider

Olostep API

Input Parameters

Parameter

Type

Required

Description

include_urls

array

URL path patterns to include in the crawl using glob syntax. Defaults to /** which includes all URLs. Use patterns like /blog/** to crawl specific sections (e.g., only blog pages), /products/*.html for product pages, or multiple patterns for different sections. Supports standard glob features like * (any characters) and ** (recursive matching).

top_n

number

An optional number to only crawl the top N most relevant links on every page as per search query.

webhook_url

string

An optional POST request endpoint called when this crawl is completed. The body of the request will be same as the response of this v1/crawls/{crawl_id} endpoint.

max_depth

number

Maximum depth of the crawl. Useful to extract only up to n-degree of links.

include_external

boolean

Crawl first-degree external links.

start_url

string

Yes

The starting point of the crawl.

include_subdomain

boolean

Include subdomains of the website. false by default.

exclude_urls

array

URL path names in glob pattern to exclude. For example: /careers/**. Excluded URLs will supersede included URLs.

max_pages

number

Yes

Maximum number of pages to crawl. Recommended for most use cases like crawling an entire website.

search_query

string

An optional search query to find specific links and also sort the results by relevance.

timeout

number

End the crawl after n seconds with the pages completed until then. May take ~10s extra from provided timeout.

Parameter

Type

Required

Description

include_urls

array

URL path patterns to include in the crawl using glob syntax. Defaults to `/**` which includes all URLs. Use patterns like `/blog/**` to crawl specific sections (e.g., only blog pages), `/products/*.html` for product pages, or multiple patterns for different sections. Supports standard glob features like * (any characters) and ** (recursive matching).

top_n

number

An optional number to only crawl the top N most relevant links on every page as per search query.

webhook_url

string

An optional POST request endpoint called when this crawl is completed. The body of the request will be same as the response of this [`v1/crawls/{crawl_id}`](./info#response-created) endpoint.

max_depth

number

Maximum depth of the crawl. Useful to extract only up to n-degree of links.

include_external

boolean

Crawl first-degree external links.

start_url

string

Yes

The starting point of the crawl.

include_subdomain

boolean

Include subdomains of the website. `false` by default.

exclude_urls

array

URL path names in glob pattern to exclude. For example: `/careers/**`. Excluded URLs will supersede included URLs.

max_pages

number

Yes

Maximum number of pages to crawl. Recommended for most use cases like crawling an entire website.

search_query

string

An optional search query to find specific links and also sort the results by relevance.

timeout

number

End the crawl after n seconds with the pages completed until then. May take ~10s extra from provided timeout.