Skip to main content
OpenClaw + Sats4AI

What Your OpenClaw Agent Can Do with Sats4AI

One line in your config gives your agent 10+ AI tools. Here's what it can actually build with them — and what each workflow costs.

10+ AI toolsNo API keyPay per callOpenAI-compatible

Setup

Add one entry to your openclaw.json and your agent gets every tool listed below:

openclaw.json
{
  "mcpServers": {
    "sats4ai": {
      "url": "https://sats4ai.com/api/mcp"
    }
  }
}

No API key. No account. No billing setup. The agent pays per call with Bitcoin Lightning.

Utilities

File Conversion

100 sats/file

Convert between 200+ file formats. Documents, images, audio, video, ebooks, spreadsheets. Batch-convert files as part of any pipeline — up to 1 GB per file.

Example prompts:

  • • "Convert this DOCX report to PDF for sharing"
  • • "Batch convert these WAV recordings to MP3"
  • • "Turn this HEIC photo into a JPG before analyzing it"
Full guide →

SMS Worldwide

from 5 sats

Send SMS to any phone number in 200+ countries. Alerts, notifications, OTP codes, appointment reminders, or AI-generated outreach.

Example prompts:

  • • "Send an SMS alert when this server goes down"
  • • "Text this delivery update to the customer"
  • • "Send appointment reminders to these numbers"
Full guide →

Automated Phone Calls

varies by country

Make phone calls with a custom spoken message. Combine with voice cloning for personalized automated calls in any language.

Example prompts:

  • • "Call this number and deliver an appointment reminder"
  • • "Generate a voice message in my cloned voice and call this number"
Full guide →

PDF Convert & Merge

200 sats

Convert PDFs to DOCX, HTML, Markdown, ODT, and more. Merge multiple PDFs into one. Prepare documents for downstream AI analysis or editing.

Example prompts:

  • • "Convert this locked PDF to DOCX so I can edit it"
  • • "Merge these 5 reports into one PDF for the client"
  • • "Convert this PDF to Markdown before analyzing it"
Full guide →
Analysis & Text

AI Chat / Text Generation

per character

Frontier-class reasoning with Kimi K2.5 (100 chars/sat) or fast responses for simpler tasks (333 chars/sat). No minimum. File and image attachments supported. Reason, code, analyze, write.

Example prompts:

  • • "Analyze this codebase and suggest architectural improvements"
  • • "Write product descriptions for these 50 items"
  • • "Review this contract and flag potential issues"
Full guide →

Image Analysis / Vision

21 sats

Describe, analyze, and extract information from images. Product photos, screenshots, diagrams, documents — anything visual.

Example prompts:

  • • "Describe this screenshot and extract any visible text"
  • • "Analyze these product photos and write alt text for accessibility"
  • • "Read this whiteboard photo and convert to structured notes"
Full guide →

Image Editing

200–450 sats

Edit images with natural language. Remove objects, change backgrounds, adjust styles, add elements — up to 4K resolution with 14 reference images.

Example prompts:

  • • "Remove the background from all product photos in this folder"
  • • "Change the sky to sunset in this real estate photo"
  • • "Add a logo watermark to these 20 images"
Full guide →

OCR / Text Extraction

10 sats/page

Extract text from PDFs, scanned documents, and images. Process invoices, receipts, contracts, handwritten notes — 30+ languages.

Example prompts:

  • • "Extract all text from these scanned invoices and create a spreadsheet"
  • • "Digitize this handwritten recipe"
  • • "Convert this PDF report to editable text before summarizing it"
Full guide →
Voice & Audio

Text to Speech

300 sats

Convert any text to natural-sounding speech. Narrate articles, create voiceovers for videos, or build audio notifications. Works with custom cloned voices.

Example prompts:

  • • "Read this article aloud and save as MP3"
  • • "Generate a voiceover for this video script in my cloned voice"
  • • "Create audio versions of these 10 product descriptions"
Full guide →

Voice Cloning

7,500 sats (one-time)

Clone any voice from a 10-second to 5-minute audio sample. Get a permanent Voice ID that works unlimited times on text-to-speech at no extra cloning fee.

Example prompts:

  • • "Clone my voice from this recording, then narrate all my blog posts in my voice"
  • • "Create a branded voice for our product tutorials"
Full guide →

Speech Transcription

10 sats/min

Audio and video to text with timestamps. Meetings, podcasts, interviews, lectures — up to 60 minutes per file.

Example prompts:

  • • "Transcribe this podcast episode and create show notes"
  • • "Convert this meeting recording to a summary with action items"
  • • "Transcribe these 20 customer interviews and find common themes"
Full guide →
Content Creation

Image Generation

100–200 sats

Generate images from text prompts. Blog thumbnails, social posts, product mockups, concept art — created on demand without touching a design tool.

Example prompts:

  • • "Generate a hero image for my blog post about remote work"
  • • "Create 5 product mockup variations for my landing page"
  • • "Make concept art for a fantasy RPG character"
Full guide →

Video Generation

300–550 sats/sec

Text-to-video or image-to-video. Short clips for ads, social content, explainer videos, or product demos. Optional audio track included.

Example prompts:

  • • "Turn this product photo into a 5-second promo video"
  • • "Generate a cinematic intro clip from this description"
  • • "Create a short animated ad for my app"
Full guide →

Music Generation

100 sats

Generate original songs with lyrics and vocals. Background music for videos, jingles for podcasts, or full tracks for creative projects.

Example prompts:

  • • "Compose a 30-second upbeat jingle for my podcast intro"
  • • "Generate background music for this explainer video"
  • • "Write a lofi hip-hop track about coding at night"
Full guide →

3D Model Generation

350 sats

Image-to-3D in ~30 seconds. Converts reference images into GLB files compatible with Unity, Unreal, Blender, Three.js, and 3D printers.

Example prompts:

  • • "Generate a 3D model from this concept art for our game prototype"
  • • "Turn this product photo into a 3D asset for the website"
  • • "Create a printable 3D model from this sketch"
Full guide →

Multi-Tool Pipelines

The real power is chaining tools together. Your agent can combine multiple services in a single instruction — each step costs a few sats.

Image Gen3D ModelVision Review

Content-to-3D Pipeline

Generate concept art from a text prompt (200 sats) → convert it to a 3D model (350 sats) → analyze the result with vision (21 sats). A reviewed 3D asset from a text prompt for ~571 sats.

TranscriptionAI ChatTTS Intro

Podcast Production Pipeline

Transcribe an interview (10 sats/min) → AI writes show notes and a summary (15 sats) → generate a voiced intro in your cloned voice (300 sats). Full podcast post-production.

Image GenVideo GenMusic Gen

Full Video Production

Generate a hero image (200 sats) → animate it into a video clip (300+ sats/sec) → compose a background track (300 sats). A complete video with custom music from one instruction.

OCRAI ChatSMS Alert

Document Processing

Extract text from a scanned PDF (10 sats/page) → analyze and summarize with AI (15 sats) → send the summary via SMS (from 5 sats). Automate document intake end-to-end.

Voice CloneTTSPhone Call

Voice Automation

Clone a voice once (7,500 sats one-time) → generate a personalized message in that voice (300 sats) → deliver it as an automated phone call (varies). Personalized voice outreach at scale.

How the agent pays automatically

Payment is handled by the L402 protocol — no human needs to approve each call.

  1. 1. Agent calls a tool (e.g., generate_image)
  2. 2. Server replies with HTTP 402 + a Lightning invoice
  3. 3. Agent pays the invoice, resends with the payment proof
  4. 4. Server returns the result

This happens in milliseconds. Your agent's wallet handles it all.

Give your OpenClaw agent a wallet

Sats4AI is the service side. The wallet side is open: install the Alby Bitcoin Payments Skill and your agent gets a Nostr Wallet Connect interface it can drive itself. The bundled @getalby/cli fetch command auto-detects L402, X402, and MPP — so it pays any Sats4AI endpoint with one line.

# 1. install the skill (Claude Code, Gemini CLI, Roo Code, OpenClaw)
npx skills add getAlby/payments-skill

# 2. connect any NWC wallet (Alby Hub, etc.) — or grab a test wallet
curl -X POST "https://faucet.nwc.dev?balance=10000"

# 3. pay any Sats4AI L402 endpoint, one command
npx @getalby/cli fetch "https://sats4ai.com/api/translate-text" \
  --method POST \
  --body '{"text":"hello world","target_language":"es"}'

No API key, no account, no SDK wiring. Self-custodial wallet on one side, Lightning-paid AI tools on the other.

What a Typical Agent Session Costs

A multi-tool autonomous session using several Sats4AI tools:

5 image generations1,000 sats
20 AI chat messages (Kimi K2.5)~300 sats
1 video clip (5 seconds)~2,000 sats
30 min transcription300 sats
1 voice clone + 3 TTS generations8,400 sats
10 file conversions500 sats
Total~12,500 sats (~$8)

An entire multi-tool agent session for the price of a coffee. No subscription, no commitment — just pay for what you use.

Give Your Agent Superpowers

One line of config. 10+ AI tools. Pay-per-use with Lightning. No account, no API key, no credit card.