Skip to main content
OpenClaw + Sats4AI

What Your OpenClaw Agent Can Do with Sats4AI

One line in your config gives your agent 10+ AI tools. Here's what it can actually build with them — and what each workflow costs.

10+ AI toolsNo API keyPay per callOpenAI-compatible

Setup

Add one entry to your openclaw.json and your agent gets every tool listed below:

openclaw.json
{
  "mcpServers": {
    "sats4ai": {
      "url": "https://sats4ai.com/api/mcp"
    }
  }
}

No API key. No account. No billing setup. The agent pays per call with Bitcoin Lightning.

Utilities

File Conversion

100 sats/file

Convert between 200+ file formats. Documents, images, audio, video, ebooks, spreadsheets. Batch-convert files as part of any pipeline — up to 1 GB per file.

Example prompts:

  • • "Convert this DOCX report to PDF for sharing"
  • • "Batch convert these WAV recordings to MP3"
  • • "Turn this HEIC photo into a JPG before analyzing it"
Full guide →

SMS Worldwide

from 5 sats

Send SMS to any phone number in 200+ countries. Alerts, notifications, OTP codes, appointment reminders, or AI-generated outreach.

Example prompts:

  • • "Send an SMS alert when this server goes down"
  • • "Text this delivery update to the customer"
  • • "Send appointment reminders to these numbers"
Full guide →

Automated Phone Calls

varies by country

Make phone calls with a custom spoken message. Combine with voice cloning for personalized automated calls in any language.

Example prompts:

  • • "Call this number and deliver an appointment reminder"
  • • "Generate a voice message in my cloned voice and call this number"
Full guide →

PDF Convert & Merge

200 sats

Convert PDFs to DOCX, HTML, Markdown, ODT, and more. Merge multiple PDFs into one. Prepare documents for downstream AI analysis or editing.

Example prompts:

  • • "Convert this locked PDF to DOCX so I can edit it"
  • • "Merge these 5 reports into one PDF for the client"
  • • "Convert this PDF to Markdown before analyzing it"
Full guide →
Analysis & Text

AI Chat / Text Generation

per character

Frontier-class reasoning with Kimi K2.5 (100 chars/sat) or fast responses for simpler tasks (333 chars/sat). No minimum. File and image attachments supported. Reason, code, analyze, write.

Example prompts:

  • • "Analyze this codebase and suggest architectural improvements"
  • • "Write product descriptions for these 50 items"
  • • "Review this contract and flag potential issues"
Full guide →

Image Analysis / Vision

21 sats

Describe, analyze, and extract information from images. Product photos, screenshots, diagrams, documents — anything visual.

Example prompts:

  • • "Describe this screenshot and extract any visible text"
  • • "Analyze these product photos and write alt text for accessibility"
  • • "Read this whiteboard photo and convert to structured notes"
Full guide →

Image Editing

200–450 sats

Edit images with natural language. Remove objects, change backgrounds, adjust styles, add elements — up to 4K resolution with 14 reference images.

Example prompts:

  • • "Remove the background from all product photos in this folder"
  • • "Change the sky to sunset in this real estate photo"
  • • "Add a logo watermark to these 20 images"
Full guide →

OCR / Text Extraction

10 sats/page

Extract text from PDFs, scanned documents, and images. Process invoices, receipts, contracts, handwritten notes — 30+ languages.

Example prompts:

  • • "Extract all text from these scanned invoices and create a spreadsheet"
  • • "Digitize this handwritten recipe"
  • • "Convert this PDF report to editable text before summarizing it"
Full guide →
Voice & Audio

Text to Speech

300 sats

Convert any text to natural-sounding speech. Narrate articles, create voiceovers for videos, or build audio notifications. Works with custom cloned voices.

Example prompts:

  • • "Read this article aloud and save as MP3"
  • • "Generate a voiceover for this video script in my cloned voice"
  • • "Create audio versions of these 10 product descriptions"
Full guide →

Voice Cloning

7,500 sats (one-time)

Clone any voice from a 10-second to 5-minute audio sample. Get a permanent Voice ID that works unlimited times on text-to-speech at no extra cloning fee.

Example prompts:

  • • "Clone my voice from this recording, then narrate all my blog posts in my voice"
  • • "Create a branded voice for our product tutorials"
Full guide →

Speech Transcription

10 sats/min

Audio and video to text with timestamps. Meetings, podcasts, interviews, lectures — up to 1 GB files.

Example prompts:

  • • "Transcribe this podcast episode and create show notes"
  • • "Convert this meeting recording to a summary with action items"
  • • "Transcribe these 20 customer interviews and find common themes"
Full guide →
Content Creation

Image Generation

100–200 sats

Generate images from text prompts. Blog thumbnails, social posts, product mockups, concept art — created on demand without touching a design tool.

Example prompts:

  • • "Generate a hero image for my blog post about remote work"
  • • "Create 5 product mockup variations for my landing page"
  • • "Make concept art for a fantasy RPG character"
Full guide →

Video Generation

300–550 sats/sec

Text-to-video or image-to-video. Short clips for ads, social content, explainer videos, or product demos. Optional audio track included.

Example prompts:

  • • "Turn this product photo into a 5-second promo video"
  • • "Generate a cinematic intro clip from this description"
  • • "Create a short animated ad for my app"
Full guide →

Music Generation

100 sats

Generate original songs with lyrics and vocals. Background music for videos, jingles for podcasts, or full tracks for creative projects.

Example prompts:

  • • "Compose a 30-second upbeat jingle for my podcast intro"
  • • "Generate background music for this explainer video"
  • • "Write a lofi hip-hop track about coding at night"
Full guide →

3D Model Generation

350 sats

Image-to-3D in ~30 seconds. Converts reference images into GLB files compatible with Unity, Unreal, Blender, Three.js, and 3D printers.

Example prompts:

  • • "Generate a 3D model from this concept art for our game prototype"
  • • "Turn this product photo into a 3D asset for the website"
  • • "Create a printable 3D model from this sketch"
Full guide →

Multi-Tool Pipelines

The real power is chaining tools together. Your agent can combine multiple services in a single instruction — each step costs a few sats.

Image Gen3D ModelVision Review

Content-to-3D Pipeline

Generate concept art from a text prompt (200 sats) → convert it to a 3D model (350 sats) → analyze the result with vision (21 sats). A reviewed 3D asset from a text prompt for ~571 sats.

TranscriptionAI ChatTTS Intro

Podcast Production Pipeline

Transcribe an interview (10 sats/min) → AI writes show notes and a summary (15 sats) → generate a voiced intro in your cloned voice (300 sats). Full podcast post-production.

Image GenVideo GenMusic Gen

Full Video Production

Generate a hero image (200 sats) → animate it into a video clip (300+ sats/sec) → compose a background track (300 sats). A complete video with custom music from one instruction.

OCRAI ChatSMS Alert

Document Processing

Extract text from a scanned PDF (10 sats/page) → analyze and summarize with AI (15 sats) → send the summary via SMS (from 5 sats). Automate document intake end-to-end.

Voice CloneTTSPhone Call

Voice Automation

Clone a voice once (7,500 sats one-time) → generate a personalized message in that voice (300 sats) → deliver it as an automated phone call (varies). Personalized voice outreach at scale.

How the agent pays automatically

Payment is handled by the L402 protocol — no human needs to approve each call.

  1. 1. Agent calls a tool (e.g., generate_image)
  2. 2. Server replies with HTTP 402 + a Lightning invoice
  3. 3. Agent pays the invoice, resends with the payment proof
  4. 4. Server returns the result

This happens in milliseconds. Your agent's wallet handles it all.

What a Typical Agent Session Costs

A multi-tool autonomous session using several Sats4AI tools:

5 image generations1,000 sats
20 AI chat messages (Kimi K2.5)~300 sats
1 video clip (5 seconds)~2,000 sats
30 min transcription300 sats
1 voice clone + 3 TTS generations8,400 sats
10 file conversions500 sats
Total~12,500 sats (~$8)

An entire multi-tool agent session for the price of a coffee. No subscription, no commitment — just pay for what you use.

Give Your Agent Superpowers

One line of config. 10+ AI tools. Pay-per-use with Lightning. No account, no API key, no credit card.