Models and Parameters

All Models

The Models view aggregates every model from every discovered service into a single searchable list. Use the search bar to filter by name.

Star a model to pin it in the chat dropdown. You can also create aliases — short names that map to a specific service/model pair:

@fast  →  ollama/llama3
@smart →  lmstudio/deepseek-r1

Aliases are usable in the model dropdown search.

Add backends that are not auto-discovered. Click Add Endpoint and provide:

Manual endpoints appear alongside discovered services.

Click the gear icon on any model or service to open the settings overlay. Settings have two scopes:

Add one or more strings that cause the model to stop generating. Enter each sequence and press Enter to add.

Set a system prompt at global or per-service scope. The per-service prompt fully replaces the global one when set.

You are a helpful coding assistant. Always include type annotations.

Choose between:

Text — default free-form output
JSON schema — paste a JSON schema to constrain model output to that structure

When an Ollama service is selected, additional parameters are available:

Parameter	Description
`keep_alive`	How long to keep the model loaded after last use (e.g., `5m`, `24h`)
`mirostat`	Mirostat sampling mode (0 = disabled, 1, 2)
`repeat_penalty`	Penalty for repeated tokens (1.0 = none)
`repeat_last_n`	Lookback window for repeat penalty
`min_p`	Minimum probability threshold
`tfs_z`	Tail-free sampling parameter
`typical_p`	Locally typical sampling threshold
`num_ctx`	Context window size in tokens
`num_batch`	Batch size for prompt processing