How To Install And Configure A Full Featured LLM Server

Windows 11 Pro + Docker + Ollama + Open WebUI + SearXNG + ComfyUI + Remote Access

Made using a fresh install of Windows 11 Pro with no modifications.

Last Updated: 2026-06-18 · Suggestions: setupllmserver@gmail.com

So you're here to setup a LLM server. Awesome! Here is a guide to do that. No videos, no images, no ads - just plain text. This guide assumes you have some tech skills. I've tried to make it straightforward and error free. Just FYI I ran the original guide through my LLM to clean it up and organize it better to get what you see here. I do plan on making a Linux version of this at some point in the future.

1. Install Windows Subsystem for Linux (WSL)

Open PowerShell as Administrator, then run:

# Install WSL
wsl --install

# Reboot, then ensure WSL is up to date
wsl --update

2. Install Docker Desktop

Open Microsoft Store and search for Docker Desktop
Open Docker Desktop and close the "Windows Subsystem for Linux" dialogue box
[Optional] Create or log in to your account
Enable Start Docker Desktop when you sign in to your computer
Go to Settings → General and confirm WSL 2 based engine is enabled
Uncheck Send usage statistics

3. Install Ollama & Download Models

Visit ollama.com and download/run the installer
Open Ollama and create or log in to your account
Open a browser and go to localhost:11434 to verify the app is running
Open PowerShell as Administrator and pull your desired models:

# Find available models at: https://ollama.com/search
# ollama pull <model_name>

ollama pull llama3.3:70b-instruct-q8_0
ollama pull llama4:latest
ollama pull qwen3.5:122b

4. Install Open WebUI (OWUI)

Docker Desktop must be running. Open PowerShell as Administrator:

# Pull the OWUI image
docker pull ghcr.io/open-webui/open-webui:main

# AMD GPU
docker run -d -p 3000:8080 -v open-webui:/app/backend/data \
  --name open-webui ghcr.io/open-webui/open-webui:main

# nVidia GPU
docker run -d -p 3000:8080 --gpus all -v open-webui:/app/backend/data \
  --name open-webui ghcr.io/open-webui/open-webui:cuda

If the command hangs while pulling a layer, press Ctrl+C then run:
docker system prune -f
docker pull ghcr.io/open-webui/open-webui:cuda

Click Allow on the Windows firewall dialogue box.

5. Install SearXNG (SXNG)

Open PowerShell as Administrator:

# Note that $HOME refers to C:UsersYour_Name
$base = "$HOMEsearxng"

# Create base, config, and data folders
New-Item -ItemType Directory -Force -Path "$baseconfig","$basedata"

cd $base

# Pull latest SXNG image
docker pull docker.io/searxng/searxng:latest

# Start the service — this also creates the settings.yml config file
docker run --name searxng -p 8080:8080 docker.io/searxng/searxng

Create docker-compose.yml

Open File Explorer and navigate to C:\Users\Your_Name
Right-click → New Text Document → name it docker-compose.yml (ensure it does not end in .txt)
Open the file in Notepad and paste the contents below

This puts OWUI and SXNG in the same Docker container so they can communicate. I was not able to get them to communicate when running in different containers.

services:
  open-webui:
    image: ghcr.io/open-webui/open-webui:main
    container_name: open-webui
    depends_on:
      - searxng
    environment:
      - OLLAMA_BASE_URL=http://host.docker.internal:11434
      # Add the next line only if you plan to use ComfyUI for image generation
      - ENABLE_RAG_LOCAL_WEB_FETCH=true
    volumes:
      - open_webui_data:/app/backend/data
    ports:
      - "3000:8080"
    restart: unless-stopped

  searxng:
    image: searxng/searxng:latest
    container_name: searxng
    volumes:
      - ./searxng:/etc/searxng
    ports:
      - "8080:8080"
    restart: unless-stopped

volumes:
  open_webui_data:

Configure settings.yml

Navigate to the searxng base folder and open settings.yml
Add use_default_settings: true as the very first line, above the general: section
Scroll down to formats: and add - json above the - html entry

Restart both services in the same container

In File Explorer, navigate up one level to where docker-compose.yml lives (C:\Users\Your_Name)
Right-click in the folder → Open Terminal

docker compose down
docker compose up -d

# Both services should appear in Docker Desktop in the same container.
# If you see container name conflict errors, remove the conflicting containers first:
# docker rm -f searxng
# docker compose up -d
# docker rm -f open-webui
# docker compose up -d

6. Configure Models in OWUI Admin Panel

Open http://localhost:3000/ and create an Admin account
Click the letter icon in the lower-left corner → Admin Panel
Under Users → Overview, click + to add new users. Repeat as needed.
[Optional] Under Users → Groups, create a group called Users. In the Permissions tab, toggle all Workspace Permissions on → Save. Then click Edit → Users to add members.

Go to Settings → Models. For each model you want available to users:

It appears that Save & Update saves only one change at a time — so re-save after every individual change.

Click Access → change from Private to Public → close the window
Scroll to Default Features → tick Web Search → click Save & Update

7. Configure Documents in OWUI Admin Panel

Go to Settings → Documents → scroll to the Retrieval section → toggle Hybrid Search on
Set Top K to 50
Set Top K Reranker to 20
Scroll to RAG Template → at the end of the Guidelines section add: - Do not apologize for not finding information
Scroll to the bottom → click Save

8. Point OWUI to the SearXNG Instance

Go to Settings → Web Search → toggle Web Search on
Set Web Search Engine to searxng
Enter the SearXNG Query URL:

http://searxng:8080/search?q=<query>&format=json

# Alternatively, use the server's direct IP address:
# http://server_ip_address:8080/search?q=<query>&format=json

Higher numbers for Search Result Count and Concurrent Requests don't guarantee better results. Recommended: 5–7 results, 2–3 concurrent requests.

Set Search Result Count to 10 and Concurrent Requests to 5
Click Save

9. Other Search Engine Options in OWUI

Found under Admin Panel → Settings → Web Search. Most require an API key.

ollama_cloud — Free API key via ollama.com
searxng — No API key required. Free. Covered in this guide.
brave — Paid subscription. Key via brave.com/search/api
perplexity — Paid. Key via perplexity.ai
kagi — Paid. API key for searching still in closed beta as of 2026-03.
perplexity_search — Does not work in OWUI due to API key changes in '24/'25.

10. Enable Native Search Function for Individual Models

Go to Admin Panel → Settings → Models → click Edit on the desired model
Scroll to Advanced Params → click Show
Find Function Calling → change from Default to Native and save

11. Using ComfyUI Portable With nVidia GPU To Generate Images

Status: In Progress

Download and unzip the appropriate ComfyUI Portable zip file into the desired folder
Navigate to that folder and right-click run_nvidia_gpu.bat
If OWUI needs to access ComfyUI over a network, add --listen 0.0.0.0 to the end of the first line
Double click run_nvidia_gpu.bat to start ComfyUI and minimize the open terminal
Press Win+R → type shell:startup → add a shortcut to run_nvidia_gpu.bat so ComfyUI runs on startup
Load ComfyUI in a browser, select a workflow, and download the JSON file (used later in OWUI)

Configure OWUI for ComfyUI

Go to Admin Panel → Settings → Images
Toggle Image Generation to ComfyUI
Enter the model name (from the currently loaded ComfyUI workflow)
Set image size and enter the URL used to access ComfyUI
Ignore the ComfyUI API Key field
Upload the ComfyUI Workflow JSON file

Node IDs from the JSON file

Open JSON file to look for Node IDs [these ID numbers are examples]:

Prompt:       Node 57:27  — search for "CLIPTextEncode"
Model:        Node 57:28  — search for "unet_name" (also shows model name for OWUI)
Width/Height: Node 57:13  — search for "width" or "height"
Steps:        Node 57:3   — search for "steps"
Seed:         Node 57:3   — search for "seed"

Image Edit configuration

Toggle Image Edit on
For the Image node ID, open the JSON file and search for images → Node 57:8
Model node ID is the same as ckpt_name from the Create Image section (57:28)
Use the data from the Create Image section to fill out the remaining fields

12. Setup Cloudflare Tunnel To Access Server Remotely

Status: In Progress

Go to cloudflare.com → Domains → Buy Domain (e.g. exampledomain.com)
From the main account page, click Zero Trust → create an account → select the Free Plan
Go to Access Policies → Policies → Add A Policy
In the Include section, select Emails and enter the email addresses of users who will access the server
Under Policy Details, give the policy a name (e.g. OpenWeb UI), set Action to Allow, and set a session duration (e.g. 2 weeks)
Click Save Policy

This creates a policy that emails the user a one-time password (OTP) granting access for the policy's duration.

Go to Networks → Connectors → Create a tunnel
Select Cloudflared, give the tunnel a name → Save Tunnel
Select your OS (Windows 11 in this case) and follow steps 1–3. For step 4, use the copy icon — the command is very long.
Scroll down → Next
Add a subdomain (e.g. openwebui) and select exampledomain.com from the dropdown
Under Service, set Type to HTTP and enter localhost:3000 (the port OWUI runs on)

HTTPS is more secure but is outside the scope of this guide.

Click Complete Setup
Go to Access Controls → Applications → Create new application → Self-hosted and private
Enter the subdomain (openwebui) and select the purchased domain (exampledomain.com)
Under Access Policies, select the OpenWeb UI policy from the dropdown → save
Open a browser and go to openwebui.exampledomain.com — enter your email to receive an OTP and gain access

Works with Firefox and Chrome, but have not tested it with Safari.

Optional: Google Gemini for Image Generation

Go to Admin Panel → Settings → Images
Toggle Image Generation on
Enter gemini-3-pro-image-preview in the Model field
Set the desired image width and height
Toggle Image Prompt Generation on
Set Image Generation Engine to Gemini
Enter the Gemini Base URL: https://generativelanguage.googleapis.com/v1beta
Generate an API key at aistudio.google.com/app/api-keys → Create API Key
Payment is required to use this API. Once set up, paste the key into the Gemini API Key field
Set Gemini Endpoint Method to generateContent
For image editing, toggle Image Edit on — fields will populate automatically. Adjust image size if needed.

Optional: SearXNG Search Priorities

SXNG allows the admin to prioritize or block certain domains via settings.yml. Scroll to the # Configuration of the "Hostnames plugin": section.

A good starting reference for blocked / low-priority / high-priority sites: kagi.com/stats?stat=insights

Optional: Adjust Model Reasoning Effort in OWUI

Go to Admin Panel → Settings → Models → click the desired model
Go to Advanced Params → Show
Scroll to Reasoning Effort → click Default → change the value from medium to high (or low)
Scroll to the bottom → click Save & Update

After saving advanced parameters, refresh the Admin Panel page. Going back into the same model without refreshing may still show the old values.

Miscellaneous Notes

After clicking Save & Update on a model, refresh the Admin Panel page — the updated values may not display immediately.
Keep track of all the ports each service uses.
Open the OWUI and/or SXNG port(s) in Windows Firewall if either is meant to be accessed by multiple users over a network.
There are many more settings in the configuration files and OWUI not covered here — feel free to ask the LLM what each option does.
In settings.yml, under the engines section, change the disabled: flag from true to false for Reddit and Steam if desired.

End of guide · Suggestions: setupllmserver@gmail.com