Suggested LLMs
โจ Default Model โ DeepSeek V3.2โ

The central model responsible for generating spoken lines, in-character dialogue, combat banter, memory processing, and action decisions. Acts like a Dungeon Master, guiding the scene and ensuring the world feels alive. This is the workhorse model in SkyrimNet โ chosen for its balanced creativity, flexible roleplay, no restrictions, adherance to instructions and cost efficiency.
Why it works well here:
- Handles long, context-heavy prompts without losing detail.
- Maintains consistent personality and tone for roleplay.From friendly, to tame to hostile, just depends on the npc bios and context.
- Strong decision making for action selection
- fully uncensored, allowing and roleplaying any sort of theme, if it makes sense for the character personna.
- Cost-effective enough to be used frequently for all main gameplay interactions.
Alternative Models:

Gemma4 31b , a "small" gem of a model. For its cost it delivers very good, immersive roleplay, with high creativity. ( arguably as good or better than many , more expensive models). As of SkyrimNet Beta20 , the only caveat is that you may encounter some rate limiting by some providers, due to high demand. In that case, locate and exclude the provider. Gemma4 31 can handle several dialogue features at once (if those are enabled), like narration, TTS emotion tags , inline npc "thoughts" and embedded action intents. Its extremely cost effective and the ideal model for a budget but still immersive experience.

Gemma4 26 , does pretty much anything the slighly bigger 31b model does, but faster and cheaper.

anthropic/claude-4.5-sonnet - a high quality roleplay model, offering realistic nuance, great prose and overall top notch experience. The price is its major shortcoming, with costs averaging $1 per hour of play (just this model alone), so have that in mind. It also tends to be verbose, though that can be managed through prompting.
๐ Model Rotation (Only for Dialogue Model)โ
SkyrimNet supports model rotation for dialogue generation.
You can list multiple models in the model name field, separated by commas. After each conversation turn, the system automatically switches to the next model in the list (round-robin).
- Applies only to the
Default(Conversation) model type - Adds response variety
- Lets stronger models periodically keep weaker ones on-track
Example: openrouter/deepseek-v3.2, anthropic/claude-3.7-sonnet, anthropic/claude-4.5
Each NPC response advances to the next model, then loops back to the start.
๐ง Memory Generation Model โ current Qwen/qwen3-235b-a22b-2507 instructโ

Qwen/qwen3-235b-a22b-2507 instruct , generates detailed, vivid memories, but still very grounded on the dialogue and events that produced them. Very good quality for the cost. Handles memory creation, summarization, and storage.
** Memory Creation**
- Summarizes recent event streams into first-person memories.
- Assigns:
importance_scoreemotiontagstype(TRAUMA, EXPERIENCE, etc.)
- Used later for recall, relationships, and mood shaping.
๐ View in-game under: UI > Memories
Alternative Models:
- Deepseek v3.2 , generates detailed memories, but still very grounded on the dialogue and events that produced them.
๐งฌ Character Profile Generation Model โ *Qwen/qwen3-235b-a22b-2507 instructโ

Generates and updates NPC identity data.
*Why Qwen/qwen3-235b-a22b-2507 instruct here:
- cheap but still competent at structured, consistent character building.
- Rich vocabulary for detailed backstories and personality profiles.
- Maintains thematic consistency while updating profiles over time.
** Profile Creation**
- Auto-generates profiles for modded or uninitialized NPCs.
- Includes:
- Goals
- Personality traits
- Background
- Relationship tendencies
** Dynamic Updates**
- Modifies profiles based on:
- Important memories
- Emotional shifts
- Allows characters to evolve naturally.
Alternative Models:
Several, less capable models are often unable to comply with the prompt Json instructions, so you should be careful what model to choose for this use.
-Deepseek V3 0324
๐ญ Action Evaluation Model โ current: deepseek/deepseek-v4-flashโ
Decides what an NPC does after speaking or reacting to an event.
** Action Selection**
- Picks gameplay actions that follow logically from dialogue.
- Examples:
FollowPlayer,SlapTarget,PickUpItem.
Alternative Models:
DeepSeek V3 (0324), is a cheap, competent llm, capable of dealing with the context and action parameters.
google/gemma-4-31b it cheap and very competent for the cost
google/gemma-4-26b-a4b-it , is extremely cheap, reported to still be dealing well with the parameter callโ
๐ง GameMaster Model โ current: meta-llama/llama-3.3-70b-instructโ

The humble and dated meta-llamma 3.3 is an inexpensive and good enough model for the gamemaster role. He will follow instructions and be very consistent enough in creating new or continue conversation topics consistently, for your proactive npcs dialogues. Everytime the GM cooldown ends, this model will likey send a topic for the dialogue llm ( while "smarter" models often prefer silence) Since most users prefer more banter, this is the chosen model
Alternative Models:
DeepSeek V3.2 , if you require more context awareness, you may want a "better" model. However these may output too much "none" as the conversation topics ( to keep it obnotrusive for the player).
โ๏ธ Combat Model โ current: deepseek/deepseek-v4-flashโ

For combat banter you will want a fast model, to cope with the rapid firing back and forths, as well as numerous events. No need for it to be an expensive, very creative model, dialogue wise. Its an ideal model for those short combat related lines.
Alternative Models:
google/gemma-4-26b-a4b-it will be a very cheap and worthy solution. However the latency may make the banter unable to keep up with very frequent combat events.
๐ Universal Translator โ current: google/gemini-2.5-flashโ
For the Universal Translator feature a precise, fast model is best, to avoid latency while working in tandem with the altered scripted dialogues. Gemini models work well with many languages, together with its relatively low price its a good match
Alternative Models:
x-ai/grok-4.1-fast will also do a good job as the model for the universal translator. Being fast and precise, while still creative enough
๐งช Meta Evaluation Model โ current deepseek/deepseek-v4-flashโ

Performs high-frequency, small tasks that keep scenes running smoothly.
Why deepseek-v4 -flash is here:
- Extremely fast and very cost-efficient for micro-decisions.
- Good at short context analysis without hallucination.
- Perfect for frequent updates like turn-taking and mood tracking.
- Its low input token cost assures that , despite the meta frequent calls, operational costs stay low
๐ Mood Evaluation
- Adjusts NPC emotional state from:
- Dialogue tone
- Memories
- Player actions
- Affects:
- Voice tone (for higher end TTS)
- Facial expressions
- Decision-making
๐ฅ Speaking Turn Selector
- Chooses the next speaker based on relevance, social rules, and proximity.
- Ensures realistic pacing in conversations.
Alternative Models:

Gemini 2.5 Flash
A solid alternative, it was previously the default meta llm. Is slightly more censored than grok, if you run into certain more extreme themes.
๐งช Diary Generation Model โ -Qwen/qwen3-235b-a22b-2507โ
Alternative Models:
Like for Bios generations, cheaper but still creative models will be good enough, costing much less:
-Deepseek V3.2 -Gemma4 31b is very good at writing and still inexpensive
๐งช Vision Model โ qwen/qwen3-vl-8b-instructโ
is a very inexpensive model, allowing you to have Omnisight always on and still being a negletible part of operational costs (adding a few cents over many hours of play)
Alternative Models:
๐งช Agent helper Model โ anthropic/claude-haiku-4.5โ
Alternative Models:
Excluding faulty LLM providers in OpenRouter:โ
On you Openrouter call log page, check what provider was the culprit ( of weird output, slow call, above average cost, censorship, etc)
Next go to your Settings , "Preferences" and next go to "Privacy"
In there click on "add" to choose the provider you want to permanently exclude from being used on OpenRouter ( will affect all LLMs and tasks)
OpenRouter SkyrimNet dataโ
The charts displaying show the models by token count. The ranking doesnt mean higher is better, only that has more usage, often due to the task itself, like the grok4 fast models used as Meta.
View current analytics on OpenRouter