Skip to content

Tools, prompts, and resources

The MCP server advertises live schemas via tools/list, prompts/list, and resources/list on connect, so your client always sees authoritative parameter shapes. The tables below summarize what's exposed so you know what to look for and which REST recipe each tool maps to.

Tools

Image generation

ToolPurpose
generate_imageText-to-image and image-edit. Engines: sdcpp, seedream, flux1-kontext, openai, gemini, grok, google, wan. Returns a resource link to each generated blob.
upscale_imageRepeated 2× upscale (1–3 passes → up to 8×).
convert_imageFormat conversion (jpeg / png / webp / gif) with optional resize.

Behavior maps directly to the image recipes — see Flux 2, SDXL, Image upscaling, and Image conversion for parameter and output details.

Video generation

ToolPurpose
generate_videoText-to-video and image-to-video. Engines: kling-v3, kling, haiper, veo3, wan, minimax, vidu, sora, grok, lightricks.
extract_video_framesSample frames at a configurable rate; perceptual-hash deduplication filters near-identical frames.
upscale_videoFlashVSR 2–4× upscaling.

See WAN, Kling, Veo 3, and Video upscaling for matching REST recipes.

Audio

ToolPurpose
transcribe_audioSpeech-to-text with optional word-level timestamps.
text_to_speechTTS with selectable speakers (aiden, dylan, eric, ryan, serena, sohee, vivian).

See Transcription and Text-to-speech.

Music

ToolPurpose
generate_musicACE Step 1.5. Supports structured lyrics with section markers like [Verse], [Chorus], [Bridge]. Returns MP3 audio or WebM with cover image.

See ACE-Step music generation.

Media analysis

ToolPurpose
caption_mediaGenerate a descriptive caption for an image or video.
rate_mediaNSFW level, blocked status, content labels. Optional sub-analyses for age classification, face recognition, AI detection, and anime recognition.
tag_mediaWD-style tagging with confidence scores and content-rating distribution.

Language models

ToolPurpose
chat_completionOpenRouter passthrough — any model from OpenAI, Anthropic, Google, Meta, Mistral, DeepSeek, Qwen, etc. Supports multi-turn system / user / assistant messages.

See Chat completion for the model ID format.

Prompt utilities

ToolPurpose
enhance_promptAnalyze and rewrite a generation prompt for a target ecosystem (sd1, sdxl, flux, ltx2). Returns the improved prompt with issues and recommendations.

See Prompt enhancement.

Discovery

ToolPurpose
find_modelsNatural-language model search across image, video, audio, and chat catalogs. Accepts queries like "fast cheap chat model" or a metrics ID like image/flux1-kontext/pro.

Workflow management

ToolPurposeAuth
submit_workflowSubmit raw workflow JSON — same shape as POST /v2/consumer/workflows. Use when a specific tool doesn't cover your case.optional
get_workflowStatus and output by workflow ID.optional
cancel_workflowCancel a running workflow.optional
list_workflowsRecent workflows for the authenticated user. Supports take, tags, excludeFailed.required

Prompts

The server ships three built-in MCP prompts that return ready-to-use guidance for multi-step pipelines. Clients can list and invoke them like any MCP prompt.

PromptInputWhat it returns
image_generation_guideintent (e.g. "photorealistic product photo", "anime character", "fast draft")Engine comparison table, quick recommendations, parameter tips.
video_creation_pipelineintent (e.g. "product showcase", "music video clip", "talking head")Recommended pipeline (image → video → upscale), engine selection matrix, example tool sequence.
content_analysis_pipelinemediaUrlStepwise plan: caption → tag → rate, with notes on when to use each.

Resources

URI templateMIMEBehavior
spine://blobs/{blobId}application/octet-streamImages are inlined as base64 content. Videos and audio return a 5-minute signed download URL. Returns an error if the blob does not exist.

Tools that produce media include resource links pointing at this URI template, so MCP clients can render outputs inline without a separate download step.

Capabilities advertised on initialize

json
{
  "protocolVersion": "2024-11-05",
  "capabilities": {
    "logging": {},
    "prompts": { "listChanged": true },
    "resources": { "listChanged": true },
    "tools": { "listChanged": true }
  },
  "serverInfo": {
    "name": "civitai-orchestration",
    "title": "Civitai Orchestration MCP Server",
    "description": "Generate images, videos, audio, and more via the Civitai Orchestration platform"
  }
}

Civitai Developer Documentation