Neural Inverse is Open Source →
DocsLLM Providers (BYOLLM)

LLM Providers (BYOLLM)

Neural Inverse never routes your code through Neural Inverse servers. You bring your own API keys — they are stored locally and used directly from your machine.

Supported Providers

Cloud Providers

ProviderSetup
AnthropicAPI key from console.anthropic.com
OpenAIAPI key from platform.openai.com
Google GeminiAPI key from aistudio.google.com
xAI (Grok)API key from x.ai
DeepSeekAPI key from platform.deepseek.com
MistralAPI key from console.mistral.ai
GroqAPI key from console.groq.com
OpenRouterAPI key from openrouter.ai
GitHub ModelsGitHub PAT with models:read scope
Fireworks AIAPI key from fireworks.ai
CerebrasAPI key from cloud.cerebras.ai
AWS BedrockAWS credentials + region (default: us-east-1)
Google Vertex AIGCP project + region (default: us-west2)
Microsoft AzureAzure resource name + API key + API version

Local / Self-Hosted Providers

ProviderDefault endpointNotes
Ollamahttp://localhost:11434Models auto-detected
vLLMhttp://localhost:8000Models auto-detected
LM Studiohttp://localhost:1234Models auto-detected
LiteLLMCustom endpointOpenAI-compatible proxy
OpenAI-CompatibleCustom endpointAny OpenAI-format API, custom headers supported

Default Models

Neural Inverse ships with a curated default model list per provider. You can add any model string not in the list.

Anthropic: claude-opus-4-6, claude-sonnet-4-6, claude-haiku-4-5-20251001, claude-3-7-sonnet-latest, claude-3-5-sonnet-latest, claude-3-5-haiku-latest

OpenAI: gpt-5.4, gpt-5.4-mini, gpt-5.4-nano, gpt-5.1-codex, o3, o4-mini

Google Gemini: gemini-3.1-pro-preview, gemini-3-flash-preview, gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-lite, gemini-2.0-flash

xAI: grok-4.20, grok-4-1-fast-reasoning, grok-3, grok-3-mini

DeepSeek: deepseek-chat, deepseek-reasoner

GitHub Models: openai/gpt-4.1, openai/gpt-4.1-mini, openai/gpt-4.1-nano, openai/o4-mini, openai/o3-mini, deepseek/deepseek-r1, meta/llama-4-scout-17b-16e-instruct, mistralai/mistral-small-2503, xai/grok-3-mini

Fireworks AI: accounts/fireworks/models/llama-v3p3-70b-instruct, accounts/fireworks/models/deepseek-r1, accounts/fireworks/models/qwen3-235b-a22b, accounts/fireworks/models/qwen3-32b, accounts/fireworks/models/gemma-4-31b-it, accounts/fireworks/models/gpt-oss-120b, accounts/fireworks/models/gpt-oss-20b

Cerebras: llama3.1-8b, gpt-oss-120b, qwen-3-235b-a22b-instruct-2507

Ollama / vLLM / LM Studio: Models are auto-detected from your running server — no manual list needed.

Adding a Provider

  1. Open Settings (gear icon or Cmd+, / Ctrl+,).
  2. Go to Neural Inverse > LLM Providers.
  3. Select the provider and enter your API key or endpoint.
  4. Click Verify to test the connection.

For local providers (Ollama, vLLM, LM Studio), start your server first — Neural Inverse polls the endpoint to discover available models automatically.

Selecting a Model per Feature

Neural Inverse lets you assign different models to different features:

  • Chat — the model used in the sidebar chat panel
  • Autocomplete — the model used for inline code completions
  • Ctrl+K — the model used for inline edit (Cmd+K / Ctrl+K)
  • Apply — the model used when applying suggested changes
  • Power Mode — the model used by Power Mode agents

Each feature has an independent model selection. You can use a fast local model for autocomplete and a powerful cloud model for Power Mode.

OpenAI-Compatible Endpoints

For any provider that exposes an OpenAI-format API (LiteLLM proxy, custom inference server, etc.):

  1. Select OpenAI Compatible as the provider.
  2. Enter the base endpoint URL.
  3. Enter the API key (leave blank if not required).
  4. Optionally add custom request headers as JSON (e.g. for authentication headers).

Air-Gapped / Offline Setup

For environments with no internet access:

  1. Run Ollama or vLLM on a local server within your network.
  2. Point Neural Inverse at the local endpoint.
  3. All LLM traffic stays within your network boundary.

No Neural Inverse cloud dependency is required after initial installation.


Was this page helpful?