Venice AI CLI Reference

Command-line interface for Venice AI. Chat with AI models, generate images, create audio and video, manage embeddings, and more — all from your terminal.

Installation

Prerequisites

Python 3.13+
A Venice AI API key (venice.ai)

Install with pip

pip install venice-ai[cli]

Install with Poetry

poetry add venice-ai[cli]

Development Install

git clone https://github.com/sethbang/venice-ai.git
cd venice-ai
poetry install --all-extras
poetry run venice --version

Quick Start

# Set your API key
export VENICE_API_KEY="your-api-key"

# Or run the interactive setup wizard
venice configure

# Chat with an AI model
venice chat start "What is quantum computing?"

# Generate an image
venice image generate "A sunset over mountains"

# List available models
venice models

Global Options

These options apply to the top-level venice command and affect all subcommands.

Option	Description
`--version`	Show version information and exit
`--plain`	Plain text output — no colors, panels, or emojis (see Plain Mode)
`--config <path>`	Path to a custom config file (default: `~/.venice/config.yaml`) — see note below
`--help`	Show help text for any command

venice --version
venice --plain chat start "Hello"
venice --config /path/to/config.yaml models

--config <path> is resolved for all subcommands: the chosen file's api.key and api.base_url are applied to every client the CLI constructs. The VENICE_API_KEY environment variable still takes precedence over api.key in the file. venice configure reads from and writes back to the same --config path, so venice --config ./team.yaml configure edits that file rather than the default ~/.venice/config.yaml.

Configuration

Interactive Setup

Run the configuration wizard to set your API key, default models, generation parameters, and output settings:

venice configure

The wizard walks through:

API key — enter or update your Venice AI API key
Default models — choose default models for chat, image, text-to-speech, speech-to-text, embedding, and video (text-to-video / image-to-video) from the live API
Generation parameters — set default temperature and max tokens
Output settings — configure the default image save directory
Features — enable or disable response streaming

Config File

Configuration is stored at ~/.venice/config.yaml:

api:
  key: your-api-key-here          # optional if VENICE_API_KEY env var is set
  base_url: https://api.venice.ai/api/v1

defaults:
  chat_model: venice-uncensored
  image_model: hidream
  max_completion_tokens: 2048
  temperature: 0.7

output:
  format: markdown
  images_dir: ~/Pictures/venice

features:
  streaming: true
  cost_tracking: true   # reserved for a future feature; not read by any CLI command yet

Environment Variables

Variable	Description
`VENICE_API_KEY`	Your Venice AI API key (takes precedence over config file)

Shell Completion

Generate shell completion scripts for tab-completion of commands and options.

# Bash
venice completion bash >> ~/.bashrc

# Zsh
venice completion zsh >> ~/.zshrc

# Fish
venice completion fish > ~/.config/fish/completions/venice.fish

Chat

Chat with AI models via streaming or one-shot messages. Supports interactive sessions, conversation history, piped input, Venice-specific parameters, and JSON output.

`venice chat start`

Start an interactive chat session or send a single message.

venice chat start [MESSAGE] [OPTIONS]

Arguments

Argument	Description
`MESSAGE`	Optional message to send. If omitted, starts an interactive session.

Options

Option	Short	Default	Description
`--model`	`-m`	runtime *	AI model to use
`--select-model`	`-S`	—	Interactively select a model from the API
`--system`	`-s`	—	System prompt to set AI personality
`--temperature`	`-t`	`0.7` ²	Temperature for response generation
`--max-completion-tokens`		`2048` ¹	Maximum tokens in response
`--stream / --no-stream`		`--stream`	Enable or disable streaming
`--show-thinking / --hide-thinking`		`--hide-thinking`	Show reasoning process for reasoning models
`--animation`	`-a`	`smooth`	Animation style: `none`, `smooth`, `word`, `char`, `line`, `typewriter`
`--animation-speed`		`0.03`	Animation speed in seconds (lower = faster)
`--show-stats / --hide-stats`		`--hide-stats`	Show streaming statistics after response
`--top-p`		—	Top-p (nucleus) sampling parameter (0.0–1.0)
`--web-search`		—	Enable web search: `auto`, `on`, `off`
`--character`		—	Character slug for character-driven chat
`--reasoning-effort`		—	Reasoning effort tier: `none`, `minimal`, `low`, `medium`, `high`, `xhigh`, `max` (per-model support)
`--strip-thinking / --no-strip-thinking`		—	Strip thinking/reasoning from response
`--disable-thinking`		`false`	Disable thinking/reasoning entirely
`--json-output`		`false`	Output the full API response as JSON
`--save`		`false`	Save conversation after session ends
`--continue-from`		—	Continue a previous conversation by ID (use `last` for most recent)

* No static default — resolved at runtime: --model wins, else a matching defaults.chat_model in your config if set, else the live API-recommended model.

¹ The --max-completion-tokens flag has no built-in default (the click option defaults to unset); 2048 reflects the example configuration. The value actually used falls back to defaults.max_completion_tokens in your ~/.venice/config.yaml when the flag is omitted.

² The --temperature flag has no built-in default (the click option defaults to unset); 0.7 reflects the example configuration. The value actually used falls back to defaults.temperature in your ~/.venice/config.yaml when the flag is omitted.

Examples

# Interactive session
venice chat start

# Single message
venice chat start "Explain the theory of relativity"

# Choose a specific model
venice chat start --model qwen3-235b "Summarize this article"

# System prompt with custom temperature
venice chat start --system "You are a Python tutor" --temperature 0.3

# Show reasoning process
venice chat start --show-thinking "Solve: what is 23 * 47?"

# Typewriter animation with stats
venice chat start --animation typewriter --show-stats "Write a haiku"

# Web search enabled
venice chat start --web-search on "What happened in tech news today?"

# Chat as a character
venice chat start --character alan-watts "What is the meaning of life?"

# Reasoning model with effort control
venice chat start --model deepseek-r1-0528 --reasoning-effort high "Prove P != NP"

# Non-streaming JSON output (for scripting)
venice chat start --no-stream --json-output "What is 2+2?"

# Save and resume conversations
venice chat start --save "Tell me about quantum computing"
venice chat start --continue-from last "What are its applications?"

# Pipe input from stdin
echo "Summarize this text" | venice chat start
cat article.txt | venice chat start --model llama-3.3-70b "Summarize:"

`venice chat history`

View and manage saved conversations.

venice chat history [OPTIONS]

Option	Default	Description
`--json`	`false`	Output conversation list as JSON
`--delete <id>`	—	Delete a conversation by ID

# List saved conversations
venice chat history

# Export as JSON
venice chat history --json

# Delete a conversation
venice chat history --delete abc12345

Image

Full-featured image generation, upscaling, editing, background removal, batch generation, style management, and presets.

`venice image generate`

Generate an image from a text prompt.

venice image generate [PROMPT] [OPTIONS]

Options

Option	Short	Default	Description
`--model`	`-m`	runtime *	Image model to use
`--size`	`-s`	`1024x1024`	Image size in WxH format (64–4096px per dimension)
`--num-images`	`-n`	`1`	Number of images to generate (1–4)
`--output`	`-o`	auto-generated	Output filename (without extension)
`--save-dir`		`~/Pictures/venice`	Directory to save images
`--steps`		model default	Inference steps (1–50, higher = better quality)
`--cfg-scale`		model default	CFG scale (0–20, higher = stricter prompt adherence)
`--seed`		random	Random seed for reproducibility
`--style-preset`	`-sp`	—	Style preset to apply (see `venice image list-styles`)
`--lora-strength`		—	LoRA strength 0–100
`--format`		`png`	Output format: `jpeg`, `png`, `webp`
`--aspect-ratio`		—	Aspect ratio for the output (e.g. `1:1`, `16:9`)
`--resolution`		—	Resolution tier: `1K`, `2K`, `4K`
`--quality`		—	Output quality for quality-aware models: `low`, `medium`, `high`
`--enable-web-search / --no-enable-web-search`		—	Allow the image model to incorporate web search results
`--safe-mode / --no-safe-mode`		—	Blur adult content
`--hide-watermark`		`false`	Hide the Venice watermark
`--embed-exif`		`false`	Embed generation metadata in EXIF
`--return-binary`		`false`	Return raw binary data (faster)
`--show-timing / --no-show-timing`		`--show-timing`	Show generation timing
`--preset`	`-p`	—	Load a saved or built-in preset
`--interactive`	`-i`	`false`	Launch the interactive wizard
`--open`		`false`	Open image after saving

* No static default — resolved at runtime: --model wins, else a matching defaults.image_model in your config if set, else the live API-recommended model.

Examples

# Basic generation
venice image generate "A serene mountain landscape at sunset"

# Specific model and size
venice image generate "Cyberpunk city" --model flux-dev --size 1280x720

# Fine-tuned generation
venice image generate "Portrait of a scientist" \
  --steps 40 --cfg-scale 7.5 --style-preset photographic \
  --format png --embed-exif

# Reproducible generation with seed
venice image generate "Cyberpunk cityscape" --seed 12345

# Aspect ratio, high-resolution tier, and quality
venice image generate "Editorial product shot" \
  --aspect-ratio 16:9 --resolution 4K --quality high

# Let the model pull in web search results
venice image generate "Today's top headline as an infographic" --enable-web-search

# Multiple images
venice image generate "Abstract art" --num-images 3 --save-dir ./art

# Using a preset
venice image generate "Fantasy landscape" --preset photorealistic

# Interactive wizard
venice image generate --interactive

# Open in viewer after generation
venice image generate "Cute cat" --open

`venice image upscale`

Upscale an image using AI.

venice image upscale <INPUT_FILE> [OPTIONS]

Option	Short	Default	Description
`--scale`		—	Scale factor (e.g., `2.0` for 2x)
`--enhance / --no-enhance`		—	Apply AI enhancement during upscaling
`--enhance-creativity`		—	Enhancement creativity (0.0–1.0)
`--enhance-prompt`		—	Style prompt for enhancement
`--replication`		—	Replication factor (0.0–1.0)
`--output`	`-o`	auto-generated	Output file path
`--save-dir`		`.`	Directory to save the result
`--open`		`false`	Open image after saving

venice image upscale photo.jpg --scale 2
venice image upscale photo.png --scale 4 --enhance --save-dir ./upscaled
venice image upscale logo.png --output logo_hd.png --open

`venice image edit`

Edit an image using a text prompt.

venice image edit <INPUT_FILE> --prompt <INSTRUCTION> [OPTIONS]

Option	Short	Default	Description
`--prompt`	`-p`	(required)	Edit instruction describing what to change
`--model`	`-m`	—	Edit model to use
`--output`	`-o`	auto-generated	Output file path
`--save-dir`		`.`	Directory to save the result
`--format`		input format	Filename-extension hint: `png`, `jpeg`, `webp`
`--aspect-ratio`		—	Aspect ratio for the output (e.g. `1:1`, `16:9`)
`--resolution`		—	Resolution tier: `1K`, `2K`, `4K`
`--output-format`		resolution-inferred	Output format sent to the API: `jpeg`, `png`, `webp`
`--safe-mode / --no-safe-mode`		—	Blur adult content (server default on)
`--open`		`false`	Open image after saving

--output-format is sent to the API and sets the real encoding; --format only hints the saved filename extension. When both are given, --output-format wins for the extension too.

venice image edit photo.jpg --prompt "Add a rainbow to the sky"
venice image edit portrait.png --prompt "Change hair color to red"
venice image edit scene.jpg -p "Make it nighttime" --format png --open
venice image edit photo.jpg -p "Editorial retouch" \
  --aspect-ratio 16:9 --resolution 4K --output-format jpeg
venice image edit photo.jpg -p "Add a rainbow" --no-safe-mode

`venice image multi-edit`

Edit an image using up to three layered inputs. The first image is the base; additional images are layered on top, guided by a single prompt.

venice image multi-edit --prompt <INSTRUCTION> --image <BASE_FILE> [OPTIONS]

Option	Short	Default	Description
`--prompt`	`-p`	(required)	Edit instruction describing the changes
`--image`	`-i`	(required)	Base image (first layer)
`--image-2`		—	Second layer image
`--image-3`		—	Third layer image
`--model`	`-m`	—	Edit model to use
`--output`	`-o`	auto-generated	Output file path
`--save-dir`		`.`	Directory to save the result
`--aspect-ratio`		—	Aspect ratio for the output (e.g. `1:1`, `16:9`)
`--resolution`		—	Resolution tier: `1K`, `2K`, `4K`
`--output-format`		input format	Output format: `jpeg`, `png`, `webp`
`--quality`		—	Output quality for quality-aware models: `low`, `medium`, `high`
`--open`		`false`	Open image after saving

venice image multi-edit --prompt "Combine these into one scene" --image base.png
venice image multi-edit -p "Blend the overlay onto the base" -i base.png --image-2 overlay.png
venice image multi-edit -p "Composite three layers" -i base.png --image-2 mid.png --image-3 top.png \
  --output-format png --quality high

`venice image remove-bg`

Remove the background from an image (returns transparent PNG by default).

venice image remove-bg <INPUT_FILE> [OPTIONS]

Option	Short	Default	Description
`--output`	`-o`	auto-generated	Output file path
`--save-dir`		`.`	Directory to save the result
`--format`		`png`	Output format: `png` (recommended), `jpeg`, `webp`
`--open`		`false`	Open image after saving

venice image remove-bg photo.jpg
venice image remove-bg product.png --save-dir ./cutouts --open
venice image remove-bg headshot.jpg --output headshot_nobg.png

`venice image batch`

Generate multiple images from a file of prompts (one prompt per line).

venice image batch --prompts-file <FILE> [OPTIONS]

Option	Short	Default	Description
`--prompts-file`	`-f`	(required)	File containing prompts, one per line
`--model`	`-m`	runtime *	Image model to use
`--size`	`-s`	`1024x1024`	Image size in WxH format
`--save-dir`		`~/Pictures/venice`	Directory to save images
`--steps`		model default	Inference steps
`--cfg-scale`		model default	CFG scale
`--seed`		random	Random seed
`--style-preset`	`-sp`	—	Style preset for all images
`--format`		`png`	Output format: `jpeg`, `png`, `webp`
`--safe-mode / --no-safe-mode`		—	Blur adult content
`--hide-watermark`		`false`	Hide Venice watermark
`--embed-exif`		`false`	Embed generation metadata in EXIF

* No static default — resolved at runtime: --model wins, else a matching defaults.image_model in your config if set, else the live API-recommended model.

# Create a prompts file
cat > prompts.txt << 'EOF'
A serene mountain lake at dawn
A bustling Tokyo street at night
A peaceful English garden in spring
EOF

# Generate all with consistent style
venice image batch --prompts-file prompts.txt --style-preset cinematic --steps 30

`venice image list-styles`

List all available style presets from the API.

venice image list-styles

Common styles include: photographic, cinematic, anime, fantasy-art, digital-art, 3d-model, pixel-art, and more.

`venice image presets`

Launch the interactive preset manager. Create, view, and delete image generation presets.

venice image presets

The interactive menu offers:

List all presets — view your custom presets
View built-in presets — see the 5 built-in configurations
Save current config as preset — create a new preset interactively
Delete a preset — remove a custom preset

Built-in Presets

Preset	Steps	CFG	Format	Description
`photorealistic`	30	7.5	png	High-quality photorealistic images
`artistic`	25	9.0	webp	Artistic and creative styles
`quick`	15	7.0	webp	Fast generation with good quality
`high-quality`	50	8.0	png	Maximum quality (slower)
`creative`	20	5.0	webp	More creative freedom, less prompt adherence

Custom presets are stored in ~/.venice/presets/ as JSON files.

# Apply a preset during generation
venice image generate "Portrait photo" --preset photorealistic

# Override specific preset values
venice image generate "Abstract art" --preset artistic --steps 40

Audio

Text-to-speech and speech-to-text capabilities.

`venice audio speak`

Convert text to speech.

venice audio speak [TEXT] [OPTIONS]

Option	Short	Default	Description
`--model`	`-m`	runtime *	TTS model to use
`--voice`	`-v`	`af_heart`	Voice to use
`--format`		`mp3`	Output format: `mp3`, `wav`, `opus`, `flac`, `aac`, `pcm`
`--speed`		`1.0`	Speech speed (0.25–4.0)
`--output`	`-o`	`speech_<timestamp>.<format>`	Output file path
`--save-dir`		`.`	Directory to save the audio file

* No static default — resolved at runtime: --model wins, else a matching defaults.tts_model in your config if set, else the live API-recommended model.

# Basic text-to-speech
venice audio speak "Hello, welcome to Venice AI!"

# Custom voice and format
venice audio speak "Good morning" --voice af_heart --format wav

# Adjust speed and save location
venice audio speak "This is a fast narration" --speed 1.5 --output narration.mp3

# Pipe text from stdin
echo "Read this aloud" | venice audio speak
cat script.txt | venice audio speak --output script_audio.mp3

`venice audio transcribe`

Transcribe audio to text.

venice audio transcribe <FILE> [OPTIONS]

Option	Short	Default	Description
`--model`	`-m`	runtime *	STT model to use
`--language`	`-l`	auto-detect	Language code (e.g., `en`, `es`, `fr`)
`--timestamps`		`false`	Include word-level timestamps
`--format`		`text`	Output format: `text`, `json`
`--output`	`-o`	stdout	Save transcription to file

* No static default — resolved at runtime: --model wins, else a matching defaults.stt_model in your config if set, else the live API-recommended model.

# Basic transcription
venice audio transcribe recording.mp3

# With language hint and timestamps
venice audio transcribe meeting.wav --language en --timestamps

# Save to file
venice audio transcribe interview.mp3 --output transcript.txt

# JSON output
venice audio transcribe audio.mp3 --format json

`venice audio voices`

List available TTS voices. Wraps Audio.get_voices(). Supports filtering by gender, region, and model.

venice audio voices
venice audio voices --gender female
venice audio voices --region af --json
venice audio voices --model tts-kokoro

Video

Generate videos from text prompts or animate existing images. Video generation is asynchronous — jobs are queued and polled until complete.

`venice video generate`

Generate a video from a text prompt.

venice video generate <PROMPT> [OPTIONS]

Option	Short	Default	Description
`--model`	`-m`	runtime *	Video model to use
`--duration`		`5s`	Duration (e.g., `5s`, `10s`)
`--resolution`		—	Output resolution (e.g., `720p`, `1080p`)
`--aspect-ratio`		`16:9`	Aspect ratio (e.g., `16:9`, `9:16`, `1:1`)
`--negative-prompt`		—	Content to avoid in the video
`--audio / --no-audio`		—	Generate audio if the model supports it (omitted from the request unless set)
`--reference-image-urls`		—	Reference image URL for character/style consistency (repeatable, up to 9)
`--reference-video-urls`		—	Reference video URL for R2V models (repeatable, up to 3)
`--reference-audio-urls`		—	Reference audio donor URL for R2V models (repeatable, up to 3)
`--end-image-url`		—	End-frame image URL for models that support transitions
`--output`	`-o`	`video_<timestamp>.mp4`	Output file path
`--save-dir`		`.`	Directory to save the video
`--no-poll`		`false`	Queue the job and return the job ID without waiting
`--open`		`false`	Open the video after saving

* No static default — resolved at runtime: --model wins, else a matching defaults.video_t2v_model in your config if set, else the live API-recommended model.

The CLI exposes the flat reference/transition fields above. The structured elements[] (per-segment prompts) and scene_image_urls parameters remain SDK-only — pass them through client.video.run()/submit().

# Basic video generation
venice video generate "A sunset over the ocean with gentle waves"

# Custom duration and aspect ratio
venice video generate "A cat playing piano" --duration 10s --aspect-ratio 1:1

# Cinematic high-res
venice video generate "Cinematic drone shot over mountains" --resolution 1080p

# Queue without waiting
venice video generate "Quick preview scene" --no-poll

# Reference images for character consistency, plus audio
venice video generate "She walks toward the camera" \
  --audio --reference-image-urls https://example.com/ref1.png

# Transition to an end frame
venice video generate "Morning to night timelapse" \
  --end-image-url https://example.com/night.png

`venice video from-image`

Animate an image into a video. The prompt describes the desired motion, not the image content.

venice video from-image <INPUT_FILE> [OPTIONS]

Option	Short	Default	Description
`--prompt`	`-p`	`""`	Motion prompt to guide animation
`--model`	`-m`	runtime *	Image-to-video model
`--duration`		`5s`	Duration (e.g., `5s`, `10s`)
`--resolution`		—	Output resolution
`--audio / --no-audio`		—	Generate audio if the model supports it (omitted from the request unless set)
`--reference-image-urls`		—	Reference image URL for character/style consistency (repeatable, up to 9)
`--reference-video-urls`		—	Reference video URL for R2V models (repeatable, up to 3)
`--reference-audio-urls`		—	Reference audio donor URL for R2V models (repeatable, up to 3)
`--end-image-url`		—	End-frame image URL for models that support transitions
`--output`	`-o`	`video_<timestamp>.mp4`	Output file path
`--save-dir`		`.`	Directory to save the video
`--no-poll`		`false`	Queue without waiting
`--open`		`false`	Open the video after saving

* No static default — resolved at runtime: --model wins, else a matching defaults.video_i2v_model in your config if set, else the live API-recommended model.

As with generate, structured elements[] and scene_image_urls parameters remain SDK-only.

# Animate an image with default motion
venice video from-image landscape.jpg

# Describe the motion
venice video from-image photo.png --prompt "Gentle breeze through the trees"

# Longer duration
venice video from-image portrait.jpg --duration 10s --open

`venice video status`

Check the status of a video generation job (useful with --no-poll).

venice video status <JOB_ID> [OPTIONS]

Option	Short	Default	Description
`--model`	`-m`	—	Model used when the job was queued

No default is applied here; pass the same model the job was queued with.

venice video status abc123
venice video status abc123 --model wan-2.6-image-to-video

Characters

Browse and inspect pre-configured AI personalities and specialized assistants.

`venice characters list`

List available AI characters.

venice characters list [OPTIONS]

Option	Default	Description
`--json`	`false`	Output as JSON
`--search`	—	Free-text search across name, description, tags, and hashtags. Server-side — the API searches the full catalog rather than filtering a locally fetched first page.
`--sort-by`	—	Sort mode: `featured`, `highestRating`, `highlyRated`, `highlyRatedAndRecent`, `imports`, `mostRecent`, `ratingCount`
`--sort-order`	—	Sort order applied to `--sort-by`: `asc`, `desc`
`--tags`	—	Filter by tag name (repeatable)
`--categories`	—	Filter by category name (repeatable)
`--limit`	—	Number of characters to return
`--offset`	—	Number of characters to skip (for paging)
`--adult / --no-adult`	—	Filter by the adult-content flag
`--pro / --no-pro`	—	Filter to characters using pro models
`--web-enabled / --no-web-enabled`	—	Filter to web-enabled characters
`--model-id`	—	Filter by model ID

venice characters list
venice characters list --search coding
venice characters list --sort-by highlyRated --limit 20
venice characters list --tags philosophy --web-enabled
venice characters list --categories education
venice characters list --json

`venice characters info`

Show detailed information about a specific character.

venice characters info <SLUG> [OPTIONS]

Option	Default	Description
`--json`	`false`	Output as JSON

The human-readable view shows the slug, model, description, tags, adult/web flags, import count, and timestamps. --json emits the full record — id, author, featured, isOwner, shareUrl, photoUrl, createdAt/updatedAt, and a nested stats object (imports, averageRating, ratingCount, ratingSum, userRating).

venice characters info alan-watts
venice characters info alan-watts --json

Use a character in chat with --character:

venice chat start --character alan-watts "What is the meaning of life?"

`venice characters reviews`

Show public reviews for a character.

venice characters reviews <SLUG> [OPTIONS]

Option	Default	Description
`--json`	`false`	Output as JSON
`--page`	—	1-indexed page number
`--page-size`	—	Reviews per page

The summary line reports the average rating and total review count; each review shows its rating, creation date, and message.

venice characters reviews alan-watts
venice characters reviews alan-watts --page 2 --page-size 50
venice characters reviews alan-watts --json

Embeddings

Generate text embeddings for semantic analysis, similarity search, and clustering.

venice embeddings [TEXT] [OPTIONS]

Option	Short	Default	Description
`--model`	`-m`	runtime *	Embedding model to use
`--encoding-format`		`float`	Encoding format: `float`, `base64`
`--dimensions`		model default	Number of output dimensions
`--json`		`false`	Output full JSON response with embedding vector
`--output`	`-o`	stdout	Save embeddings to file

* No static default — resolved at runtime: --model wins, else a matching defaults.embedding_model in your config if set, else the live API-recommended model.

# Generate embeddings
venice embeddings "The quick brown fox"

# Full JSON output with vector
venice embeddings "Hello world" --json

# Save to file
venice embeddings "Some text" --output embeddings.json

# Pipe text from stdin
echo "Embed this text" | venice embeddings

# Custom dimensions
venice embeddings "Search query" --dimensions 256

Models

List, filter, compare, and inspect available AI models. The models command supports extensive filtering by type, capabilities, price, and status.

venice models [OPTIONS]

Display Options

Option	Short	Default	Description
`--verbose`	`-v`	`false`	Show detailed information for all models
`--json`		`false`	Output as JSON for scripting
`--currency`		`both`	Currency to display: `usd`, `diem`, `both`
`--no-legend`		`false`	Hide the capability icon legend

Type Filtering

Option	Short	Description
`--type`	`-t`	Filter by type — can be repeated. Choices: `text`, `image`, `embedding`, `tts`, `asr`, `music`, `upscale`, `inpaint`, `video`. An unknown value errors instead of returning nothing.

Capability Filtering

Option	Description
`--function-calling`	Models with function calling support
`--vision`	Models with vision capabilities
`--reasoning`	Models with reasoning support
`--web-search`	Models with web search
`--code`	Models optimized for code
`--response-schema`	Models supporting response schemas

Trait Filtering

Option	Description
`--trait <value>`	Filter by trait (repeatable)

Price Filtering

Option	Description
`--max-input <price>`	Maximum input price per 1M tokens (USD)
`--max-output <price>`	Maximum output price per 1M tokens (USD)
`--max-gen <price>`	Maximum generation price (for images)
`--budget <price>`	Maximum average price (input+output)/2

Status Filtering

Option	Description
`--beta`	Show only beta models
`--no-beta`	Exclude beta models
`--online`	Show only online models

Search & Detail

Option	Description
`--search <query>`	Search in model names and descriptions
`--id <model-id>`	Show details for a specific model
`--compare <id1,id2,...>`	Side-by-side comparison of models (comma-separated)

Sorting

Option	Default	Values
`--sort`	`name`	`name`, `id`, `price-asc`, `price-desc`, `context`, `created`

Examples

# List all models
venice models

# Text models only, sorted by price
venice models --type text --sort price-asc

# Image models
venice models --type image

# Models with vision and function calling
venice models --vision --function-calling

# Budget-friendly models under $1/M tokens
venice models --budget 1.0

# Exclude beta models
venice models --no-beta

# Search for a model
venice models --search "llama"

# Detailed view of a specific model
venice models --id qwen3-235b

# Compare two models side-by-side
venice models --compare "qwen3-235b,llama-3.3-70b"

# JSON output for scripting
venice models --json

# Verbose listing with full per-model detail
venice models --verbose

`venice models resolve`

Auto-pick a model by type and capabilities. Wraps Models.resolve() and the ten typed resolve_*() helpers (resolve_chat, resolve_image, resolve_embedding, etc.). Useful in scripts when you want "the cheapest model that supports vision + function calling" without hardcoding a specific model ID.

venice models resolve --type chat --function-calling
venice models resolve --type image
venice models resolve --type chat --vision --min-context-tokens 32000

`venice models get`

Show the full model record for a single ID. Wraps Models.get(). Pass --json for machine-readable output.

venice models get llama-3.3-70b
venice models get qwen3-235b --json

`venice models capabilities`

Show the structured Capabilities for a single model (Models.get_capabilities()). Includes function-calling, vision, reasoning, web-search, code, response-schema, and TEE/e2ee flags.

venice models capabilities llama-3.3-70b
venice models capabilities qwen3-235b --json

Account

View account billing information and usage statistics.

`venice account balance`

Show current account balance and credits.

venice account balance [OPTIONS]

Option	Default	Description
`--json`	`false`	Output as JSON

venice account balance
venice account balance --json

`venice account usage`

Show usage statistics for your account. Displays billing entries, costs, and a breakdown by product SKU.

venice account usage [OPTIONS]

Option	Default	Description
`--json`	`false`	Output as JSON
`--start-date`	—	Start date (`YYYY-MM-DD` or ISO 8601)
`--end-date`	—	End date (`YYYY-MM-DD` or ISO 8601)
`--currency`	—	Filter entries by currency: `USD`, `DIEM`, `VCU`, `BUNDLED_CREDITS`

venice account usage
venice account usage --start-date 2025-01-01 --end-date 2025-02-01
venice account usage --currency USD
venice account usage --json

`venice account keys get`

Retrieve a single API key by ID. Wraps client.api_keys.retrieve(). The top-level venice api-keys group covers list/create/delete; the account keys subgroup adds the single-key retrieve / update surface.

venice account keys get key_abc123
venice account keys get key_abc123 --json

`venice account keys update`

Update an existing API key. Wraps client.api_keys.update(). At least one of --description, --expiry, --limit-usd, --limit-diem, --limit-vcu, or --limit-period must be provided.

Option	Description
`--description`	New description for the key
`--expiry`	New expiration (ISO 8601, e.g. `2026-12-31`)
`--limit-usd`	USD consumption limit
`--limit-diem`	DIEM consumption limit
`--limit-vcu`	VCU consumption limit (legacy alias for diem)
`--limit-period`	Period over which the consumption limit resets: `EPOCH`, `MONTH`, `LIFETIME`
`--json`	Output JSON

venice account keys update key_abc123 --description "Production"
venice account keys update key_abc123 --expiry 2026-12-31
venice account keys update key_abc123 --limit-usd 50 --limit-diem 5
venice account keys update key_abc123 --limit-period MONTH

`venice account keys rate-limits`

Show the current rate limits (per-model RPM / RPD / TPM) for your API key. Wraps client.api_keys.get_rate_limits(). The header reports your API tier, whether access is permitted, and when the next epoch begins.

Option	Default	Description
`--json`	`false`	Output as JSON

venice account keys rate-limits
venice account keys rate-limits --json

`venice account keys rate-limit-logs`

Show the most recent rate-limit violations for your account (up to 50). Wraps client.api_keys.get_rate_limit_logs(). Each entry lists a timestamp, model, limit type, and tier.

Option	Default	Description
`--json`	`false`	Output as JSON

venice account keys rate-limit-logs
venice account keys rate-limit-logs --json

API Keys

Manage your Venice AI API keys. List, create, and delete keys for your account.

`venice api-keys list`

List all API keys.

venice api-keys list [OPTIONS]

Option	Default	Description
`--json`	`false`	Output as JSON

venice api-keys list
venice api-keys list --json

`venice api-keys create`

Create a new API key. The secret key is only shown once — save it immediately.

venice api-keys create --name <NAME> [OPTIONS]

Option	Default	Description
`--name` (alias: `--description`)	(required)	Name / description for the new key
`--type`	`INFERENCE`	API key type: `INFERENCE`, `ADMIN`
`--limit-usd`	—	USD consumption limit
`--limit-diem`	—	DIEM consumption limit
`--limit-vcu`	—	VCU consumption limit (legacy alias for `--limit-diem`; folds into `diem` unless `--limit-diem` is also given)
`--limit-period`	—	Period over which the consumption limit resets: `EPOCH`, `MONTH`, `LIFETIME`
`--expiry`	—	Expiration date (ISO 8601, e.g. `2026-12-31` or `2026-12-31T23:59:00Z`)
`--json`	`false`	Output as JSON

venice api-keys create --name "Production Key"
venice api-keys create --name "Admin" --type ADMIN
venice api-keys create --name "Capped" --limit-usd 50 --limit-period MONTH
venice api-keys create --name "Expiring" --expiry 2026-12-31
venice api-keys create --name "CI/CD Pipeline" --json

`venice api-keys delete`

Delete an API key. Prompts for confirmation before deletion.

venice api-keys delete <KEY_ID> [OPTIONS]

Option	Short	Default	Description
`--yes`	`-y`	`false`	Skip the confirmation prompt (for scripting / CI)

venice api-keys delete key_abc123

# Non-interactive deletion (no confirmation prompt)
venice api-keys delete key_abc123 --yes

Lint

venice lint <path> walks Python source files and flags v1 / OpenAI-style / non-idiomatic patterns that won't work or work sub-optimally on Venice SDK v2 — AsyncVeniceClient imports, hardcoded model IDs, max_tokens= kwargs, PaymentRequiredError.payment_instructions accesses, and a dozen others.

Output is in flake8-compatible path:line:col: CODE message format, suitable for editor integrations and CI pipelines.

Quick examples

# Lint a directory tree
venice lint src/

# Lint a single file
venice lint path/to/script.py

# Treat informational findings (V401) as errors — exit 1 if any are present
venice lint --strict src/

# Filter to specific rule codes
venice lint --code V100 --code V200 src/

Exit codes

0 — clean, or only informational findings (without --strict).
1 — at least one error-level finding (or any finding with --strict).

Rule codes

Code	Severity	Pattern
`V000`	error	could not parse (syntax error)
`V100`	error	`AsyncVeniceClient` import or call (gone in v2 — use `VeniceClient`)
`V101`	error	`client.image.generate(...)` (gone — use `client.image.create(...)`)
`V102`	error	`client.audio.generate_music(...)` (gone — use `client.music.run(...)`)
`V103`	error	`client.audio.generate_speech(...)` (gone — use `client.audio.create_speech(...)`)
`V104`	error	`client.video.queue(...)` (gone — use `client.video.run/submit(...)`)
`V105`	error	`client.video.complete(...)` (gone — use `client.video.cancel(...)`)
`V106`	error	`client.embeddings.generate(...)` (gone — use `client.embeddings.create(...)`)
`V107`	error	`client.audio.transcribe(audio=...)` (kwarg renamed — use `file=`)
`V200`	error	`max_tokens=` kwarg (removed — use `max_completion_tokens=`)
`V300`	error	hardcoded model string on `model=` (use `client.models.resolve_*()`)
`V401`	informational	`client.chat.completions.create(stream=True)` (works, but bypasses the v2 streaming idiom)
`V501`	error	`BudgetManager(limit=...)` (use `daily_usd=` / `monthly_usd=`)
`V502`	error	`tracker.total` accessor (real attr is `total_cost_usd`)
`V503`	error	`tracker.calls` accessor (use `len(tracker.requests)` or `total_tokens`)
`V601`	error	`e.payment_instructions` access (real attr is `e.body`)

The lint runs entirely on AST — no execution of user code. Use it as a pre-commit hook or in CI to catch v1 leftovers before they reach production.

Health

venice health runs a sequence of small checks against the Venice API and reports each one. Use it after configuring a fresh environment, when something feels off, or as a CI smoke test.

Quick examples

# Default checks: API key, models catalog, billing balance
venice health

# Add a tiny embedding ping (small cost, typically <$0.001)
venice health --full

# Add an x402 prepaid-ledger balance read using a wallet env var
venice health --wallet                                  # default env var: VENICE_X402_TEST_PRIVATE_KEY
venice health --wallet --wallet-env MY_WALLET_KEY       # custom env var

# Combine
venice health --full --wallet

Default checks

Check	What it does
`api_key`	Verifies a key is reachable via `VENICE_API_KEY` env var or saved config
`models.list`	Fetches the `text` model catalog (auth + connectivity sanity)
`billing.get_balance`	Reads the account's USD / DIEM balance

Optional checks (opt-in via flags)

Flag	Adds	Notes
`--full`	`embeddings.create(input="ping")`	Small cost; verifies the embedding endpoint
`--wallet`	`client.x402.balance(auth=X402Auth(...))`	Requires the `[x402]` extra and a wallet private key in `--wallet-env` (default: `VENICE_X402_TEST_PRIVATE_KEY`)

Exit codes

0 — every check passed.
1 — at least one check failed (the failing line is printed to stderr).

Sample output

ℹ️  venice health
  ✓ api_key                  present (vn_a…fde9)
  ✓ models.list              78 text models reachable
  ✓ billing.get_balance      USD $35.71774095 / DIEM 0.7503080828059601
  ✓ embeddings.create        text-embedding-bge-m3 (3 tokens)
  ✓ x402.balance             0xecB6627A… $4.99814435

✅ All 5 checks passed.

The --plain flag substitutes plain-text equivalents for the emojis / colors.

Plain Mode

The --plain flag disables all Rich formatting — colors, panels, emojis, and spinners — producing clean text suitable for piping, logging, and scripting.

Plain mode affects every command:

# Plain chat output
venice --plain chat start "Hello"

# Plain model listing
venice --plain models --type text

# Plain image generation
venice --plain image generate "A mountain" --output mountain

In plain mode:

Status messages use INFO:, ERROR:, SUCCESS: prefixes instead of emojis
Tables are rendered as aligned text columns
Progress spinners are replaced by text status messages
Markdown content is printed as raw text

Piping & Scripting

The CLI is designed for use in scripts and pipelines. Combine --plain, --json, and stdin piping for automation.

Piping Input via stdin

Chat, audio speak, and embeddings support piped input:

# Pipe text to chat
echo "Summarize this in one sentence" | venice chat start

# Pipe a file to chat
cat README.md | venice chat start "Summarize this document"

# Pipe text to TTS
echo "Hello world" | venice audio speak --output greeting.mp3

# Pipe text to embeddings
echo "Semantic search query" | venice embeddings --json

JSON Output for Scripting

Most commands support --json for machine-readable output:

# Get model list as JSON
venice models --json | jq '.[] | .id'

# Get account balance as JSON
venice account balance --json | jq '.balances.usd'

# Get character list as JSON
venice characters list --json | jq '.[].slug'

# Get API keys as JSON
venice api-keys list --json

# Chat with JSON response
venice chat start --no-stream --json-output "What is 2+2?"

Combining Plain Mode with Pipelines

# Extract just the response text
venice --plain chat start "What is the capital of France?" 2>/dev/null | grep "^Assistant:" | cut -d: -f2-

# Batch process a list of prompts
while IFS= read -r prompt; do
  venice --plain image generate "$prompt" --save-dir ./batch-output
done < prompts.txt

# Generate embeddings for multiple texts
cat sentences.txt | while IFS= read -r line; do
  venice --plain embeddings "$line" --json >> all_embeddings.jsonl
done

Usage in CI/CD

# Verify API key works
venice --plain account balance || exit 1

# Generate assets in a build pipeline
venice --plain image generate "App icon" --size 512x512 --output icon --save-dir ./assets
venice --plain audio speak "Welcome to our app" --output welcome.mp3 --save-dir ./assets

Troubleshooting

API Key Not Found

export VENICE_API_KEY="your-key"
# or
venice configure

Module Not Found

pip install venice-ai[cli]

Command Not Found

# If using Poetry
poetry run venice --help

# Or activate the virtual environment
poetry shell
venice --help

Model Not Found

Run venice models to see available models. Use venice models --type text or venice models --type image to filter by type. Model IDs must match exactly.

Installation​

Prerequisites​

Install with pip​

Install with Poetry​

Development Install​

Quick Start​

Global Options​

Configuration​

Interactive Setup​

Config File​

Environment Variables​

Shell Completion​

Chat​

venice chat start​

Arguments​

Options​

Examples​

venice chat history​

Image​

venice image generate​

Options​

Examples​

venice image upscale​

venice image edit​

venice image multi-edit​

venice image remove-bg​

venice image batch​

venice image list-styles​

venice image presets​

Built-in Presets​

Audio​

venice audio speak​

venice audio transcribe​

venice audio voices​

Video​

venice video generate​

venice video from-image​

venice video status​

Characters​

venice characters list​

venice characters info​

venice characters reviews​

Embeddings​

Models​

Display Options​

Type Filtering​

Capability Filtering​

Trait Filtering​

Price Filtering​

Status Filtering​

Search & Detail​

Sorting​

Examples​

venice models resolve​

venice models get​

venice models capabilities​

Account​

venice account balance​

venice account usage​

venice account keys get​

venice account keys update​

venice account keys rate-limits​

venice account keys rate-limit-logs​

API Keys​

venice api-keys list​

venice api-keys create​

venice api-keys delete​

Lint​

Quick examples​

Exit codes​

Rule codes​

Health​

Quick examples​

Default checks​

Optional checks (opt-in via flags)​

Exit codes​

Sample output​

Plain Mode​

Piping & Scripting​

Piping Input via stdin​

Installation

Prerequisites

Install with pip

Install with Poetry

Development Install

Quick Start

Global Options

Configuration

Interactive Setup

Config File

Environment Variables

Shell Completion

Chat

`venice chat start`

Arguments

Options

Examples

`venice chat history`

Image

`venice image generate`

Options

Examples

`venice image upscale`

`venice image edit`

`venice image multi-edit`

`venice image remove-bg`

`venice image batch`

`venice image list-styles`

`venice image presets`

Built-in Presets

Audio

`venice audio speak`

`venice audio transcribe`

`venice audio voices`

Video

`venice video generate`

`venice video from-image`

`venice video status`

Characters

`venice characters list`

`venice characters info`

`venice characters reviews`

Embeddings

Models

Display Options

Type Filtering

Capability Filtering

Trait Filtering

Price Filtering

Status Filtering

Search & Detail

Sorting

Examples

`venice models resolve`

`venice models get`

`venice models capabilities`

Account

`venice account balance`

`venice account usage`

`venice account keys get`

`venice account keys update`

`venice account keys rate-limits`

`venice account keys rate-limit-logs`

API Keys

`venice api-keys list`

`venice api-keys create`

`venice api-keys delete`

Lint

Quick examples

Exit codes

Rule codes

Health

Quick examples

Default checks

Optional checks (opt-in via flags)

Exit codes

Sample output

Plain Mode

Piping & Scripting

Piping Input via stdin