What Your OpenClaw Agent Can Do with Sats4AI
One line in your config gives your agent 10+ AI tools. Here's what it can actually build with them — and what each workflow costs.
Setup
Add one entry to your openclaw.json and your agent gets every tool listed below:
{
"mcpServers": {
"sats4ai": {
"url": "https://sats4ai.com/api/mcp"
}
}
}No API key. No account. No billing setup. The agent pays per call with Bitcoin Lightning.
File Conversion
100 sats/fileConvert between 200+ file formats. Documents, images, audio, video, ebooks, spreadsheets. Batch-convert files as part of any pipeline — up to 1 GB per file.
Example prompts:
- • "Convert this DOCX report to PDF for sharing"
- • "Batch convert these WAV recordings to MP3"
- • "Turn this HEIC photo into a JPG before analyzing it"
SMS Worldwide
from 5 satsSend SMS to any phone number in 200+ countries. Alerts, notifications, OTP codes, appointment reminders, or AI-generated outreach.
Example prompts:
- • "Send an SMS alert when this server goes down"
- • "Text this delivery update to the customer"
- • "Send appointment reminders to these numbers"
Automated Phone Calls
varies by countryMake phone calls with a custom spoken message. Combine with voice cloning for personalized automated calls in any language.
Example prompts:
- • "Call this number and deliver an appointment reminder"
- • "Generate a voice message in my cloned voice and call this number"
PDF Convert & Merge
200 satsConvert PDFs to DOCX, HTML, Markdown, ODT, and more. Merge multiple PDFs into one. Prepare documents for downstream AI analysis or editing.
Example prompts:
- • "Convert this locked PDF to DOCX so I can edit it"
- • "Merge these 5 reports into one PDF for the client"
- • "Convert this PDF to Markdown before analyzing it"
AI Chat / Text Generation
per characterFrontier-class reasoning with Kimi K2.5 (100 chars/sat) or fast responses for simpler tasks (333 chars/sat). No minimum. File and image attachments supported. Reason, code, analyze, write.
Example prompts:
- • "Analyze this codebase and suggest architectural improvements"
- • "Write product descriptions for these 50 items"
- • "Review this contract and flag potential issues"
Image Analysis / Vision
21 satsDescribe, analyze, and extract information from images. Product photos, screenshots, diagrams, documents — anything visual.
Example prompts:
- • "Describe this screenshot and extract any visible text"
- • "Analyze these product photos and write alt text for accessibility"
- • "Read this whiteboard photo and convert to structured notes"
Image Editing
200–450 satsEdit images with natural language. Remove objects, change backgrounds, adjust styles, add elements — up to 4K resolution with 14 reference images.
Example prompts:
- • "Remove the background from all product photos in this folder"
- • "Change the sky to sunset in this real estate photo"
- • "Add a logo watermark to these 20 images"
OCR / Text Extraction
10 sats/pageExtract text from PDFs, scanned documents, and images. Process invoices, receipts, contracts, handwritten notes — 30+ languages.
Example prompts:
- • "Extract all text from these scanned invoices and create a spreadsheet"
- • "Digitize this handwritten recipe"
- • "Convert this PDF report to editable text before summarizing it"
Text to Speech
300 satsConvert any text to natural-sounding speech. Narrate articles, create voiceovers for videos, or build audio notifications. Works with custom cloned voices.
Example prompts:
- • "Read this article aloud and save as MP3"
- • "Generate a voiceover for this video script in my cloned voice"
- • "Create audio versions of these 10 product descriptions"
Voice Cloning
7,500 sats (one-time)Clone any voice from a 10-second to 5-minute audio sample. Get a permanent Voice ID that works unlimited times on text-to-speech at no extra cloning fee.
Example prompts:
- • "Clone my voice from this recording, then narrate all my blog posts in my voice"
- • "Create a branded voice for our product tutorials"
Speech Transcription
10 sats/minAudio and video to text with timestamps. Meetings, podcasts, interviews, lectures — up to 1 GB files.
Example prompts:
- • "Transcribe this podcast episode and create show notes"
- • "Convert this meeting recording to a summary with action items"
- • "Transcribe these 20 customer interviews and find common themes"
Image Generation
100–200 satsGenerate images from text prompts. Blog thumbnails, social posts, product mockups, concept art — created on demand without touching a design tool.
Example prompts:
- • "Generate a hero image for my blog post about remote work"
- • "Create 5 product mockup variations for my landing page"
- • "Make concept art for a fantasy RPG character"
Video Generation
300–550 sats/secText-to-video or image-to-video. Short clips for ads, social content, explainer videos, or product demos. Optional audio track included.
Example prompts:
- • "Turn this product photo into a 5-second promo video"
- • "Generate a cinematic intro clip from this description"
- • "Create a short animated ad for my app"
Music Generation
100 satsGenerate original songs with lyrics and vocals. Background music for videos, jingles for podcasts, or full tracks for creative projects.
Example prompts:
- • "Compose a 30-second upbeat jingle for my podcast intro"
- • "Generate background music for this explainer video"
- • "Write a lofi hip-hop track about coding at night"
3D Model Generation
350 satsImage-to-3D in ~30 seconds. Converts reference images into GLB files compatible with Unity, Unreal, Blender, Three.js, and 3D printers.
Example prompts:
- • "Generate a 3D model from this concept art for our game prototype"
- • "Turn this product photo into a 3D asset for the website"
- • "Create a printable 3D model from this sketch"
Multi-Tool Pipelines
The real power is chaining tools together. Your agent can combine multiple services in a single instruction — each step costs a few sats.
Content-to-3D Pipeline
Generate concept art from a text prompt (200 sats) → convert it to a 3D model (350 sats) → analyze the result with vision (21 sats). A reviewed 3D asset from a text prompt for ~571 sats.
Podcast Production Pipeline
Transcribe an interview (10 sats/min) → AI writes show notes and a summary (15 sats) → generate a voiced intro in your cloned voice (300 sats). Full podcast post-production.
Full Video Production
Generate a hero image (200 sats) → animate it into a video clip (300+ sats/sec) → compose a background track (300 sats). A complete video with custom music from one instruction.
Document Processing
Extract text from a scanned PDF (10 sats/page) → analyze and summarize with AI (15 sats) → send the summary via SMS (from 5 sats). Automate document intake end-to-end.
Voice Automation
Clone a voice once (7,500 sats one-time) → generate a personalized message in that voice (300 sats) → deliver it as an automated phone call (varies). Personalized voice outreach at scale.
How the agent pays automatically
Payment is handled by the L402 protocol — no human needs to approve each call.
- 1. Agent calls a tool (e.g.,
generate_image) - 2. Server replies with HTTP 402 + a Lightning invoice
- 3. Agent pays the invoice, resends with the payment proof
- 4. Server returns the result
This happens in milliseconds. Your agent's wallet handles it all.
What a Typical Agent Session Costs
A multi-tool autonomous session using several Sats4AI tools:
An entire multi-tool agent session for the price of a coffee. No subscription, no commitment — just pay for what you use.
Give Your Agent Superpowers
One line of config. 10+ AI tools. Pay-per-use with Lightning. No account, no API key, no credit card.