NodeTool is the open-source creative AI workspace. Every major model from every major provider — FAL, KIE, OpenAI, Anthropic, Gemini, Replicate, and more — is called with your own keys and wired into one node-based canvas. Image, video, audio, and text live on the same surface, with editing tools like masks, inpaint, outpaint, relight, upscale, layers, and compositing built in. Runs as a desktop app on macOS, Windows, and Linux, or in the browser via NodeTool Cloud.

How is NodeTool different from ComfyUI?

ComfyUI is a Stable Diffusion power tool with engineer-first UX. NodeTool is the full creative workspace — every modality on one canvas, with the editing tools creatives actually use. NodeTool also supports a much wider model roster across providers and modalities, called with your own keys at provider prices.

How is NodeTool different from Weavy or other closed SaaS canvases?

Closed canvases lock you into a credit system and a curated model roster. NodeTool is open source and BYOK. You bring your own API keys to every provider, pay providers directly at provider prices, and own your workflows and files. Cloud is just our managed hosting of the same open-source code you can run yourself.

How does pricing work?

NodeTool Studio is free to download and use. NodeTool Cloud is a subscription for managed hosting. In both editions, you bring your own API keys to every provider and pay those providers directly at their list prices. NodeTool does not run model inference for you on its own servers, does not issue proprietary credits, and does not mark up model calls.

What models does NodeTool support?

Frontier models including Flux, Seedance, Wan, Veo, Kling, Hailuo, Qwen Image, Whisper, ElevenLabs, and Suno, called through providers like FAL, KIE, OpenAI, Anthropic, Gemini, Replicate, Together, Groq, Mistral, OpenRouter, and HuggingFace. Local inference is supported via MLX, Ollama, llama.cpp, vLLM, and LM Studio.

Independent generative artists, motion designers, AI-native illustrators, technical art directors, ComfyUI power users frustrated with the UX, and small creative studios, brand teams, and post-production shops working with AI every day.

Is NodeTool open source?

Yes. Both Studio and Cloud share the same AGPL-3.0 codebase. There is no closed-source layer and no "pro tier" hiding the good features. You can self-host any time.

All templates

Template·Audio & Music

Audio To Image

Name: NodeTool
Author: NodeTool

Transform spoken descriptions into images with this workflow. Record or upload audio, which is transcribed by Whisper and then visualized by Stable Diffusion. Perfect for quickly generating images from verbal ideas without typing.

Download NodeTool How to run it

Audio To Image — example output from the NodeTool workflow

The workflow

Workflow EditorAudio To Image

Note

Audio Input

Automatic Speech Recognition

Audio

Text

Text To Image

Prompt

Nodes in this workflow

3 nodes · 3 types

Audio Input
nodetool.input.AudioInput
Automatic Speech Recognition
nodetool.text.AutomaticSpeechRecognition
Text To Image
nodetool.image.TextToImage

How to run it

01
Download NodeTool Studio
Install the free desktop app for macOS, Windows, or Linux. It runs on your own machine, no account required to start.
02
Open the Audio To Image template
Browse the built-in template library inside Studio and open this workflow onto the canvas. Every node is already wired up.
03
Add your keys
Connect the providers this workflow uses (Audio Input, Text To Image). Bring your own keys — you pay the provider directly.
04
Run and remix
Hit Run to execute the graph and watch results stream in. Swap models, edit prompts, or rewire nodes to make it yours.