Running your own local AI with Ollama

- October 01, 2025

Run Your Own Local AI with Ollama + Open WebUI on Proxmox

Ever wanted to have your own local AI assistant running right from your homelab?
Instead of relying on cloud services, you can set up a lightweight yet powerful AI environment inside Proxmox.

In this guide, I’ll show you how I deployed Ollama (for models) and Open WebUI (for the interface) in separate LXC containers. This way, you get a clean modular setup that works even on modest hardware.

My Setup

Hypervisor: Proxmox VE
Container 1 (Ollama): Debian 13, 2 cores, 4GB RAM, 8GB swap
Container 2 (Open WebUI): Debian 11, 1 core, 1GB RAM, 2GB swap (optional)

Step 1: Deploy Ollama

Inside the first LXC container (Debian 13, 4GB RAM, 2 cores, 8GB swap):


curl -fsSL https://ollama.com/install.sh | sh

Pull some lightweight models to test:


ollama pull phi3
ollama pull llama3.2
ollama pull gemma3:270m
ollama pull tinyllama

Arena comes preinstalled by default in Ollama.

Models I Installed

Here’s what I currently have on my Ollama LXC:

Phi-3 Mini (Q4) → good reasoning while still lightweight
Llama 3.2 (quantized) → higher accuracy, heavier
Gemma 3 270M → ultra-light, runs even on very small RAM
TinyLlama latest → small but useful for experiments
Arena → comes by default with Ollama for benchmarking/chat

This mix gives me flexibility:

Use Gemma 3 270M or TinyLlama on very low resources.
Use Phi-3 Mini for balanced reasoning.
Use Llama 3.2 for better accuracy (but slower).

For more models visit: https://ollama.com/

🧩 Step 2: Deploy Open WebUI (pipx + systemd method)


Why this method?

Instead of using the one-line curl | bash installer, we’ll install Open WebUI cleanly with pipx.
This avoids version conflicts, keeps Python packages isolated, and integrates smoothly with. 
This avoids version conflicts, keeps Python packages isolated, and integrates smoothly with systemd — 
so the service starts automatically after reboot.

# 1. Install pipx (if not already installed)
sudo apt update
sudo apt install -y pipx
pipx ensurepath

# 2. Install Open WebUI using pipx
pipx install open-webui

# 3. Confirm that open-webui is available
which open-webui
# It should output something like: /root/.local/bin/open-webui

# 4. Create a systemd service file for Open WebUI
sudo tee /etc/systemd/system/open-webui.service > /dev/null <<EOF
[Unit]
Description=Open WebUI (Root Service)
After=network.target

[Service]
ExecStart=/root/.local/bin/open-webui serve --host 0.0.0.0 --port 8080
WorkingDirectory=/root
Restart=always
RestartSec=10
Environment=PATH=/root/.local/bin:/usr/local/bin:/usr/bin:/bin
StandardOutput=journal
StandardError=journal

[Install]
WantedBy=multi-user.target
EOF

# 5. Reload systemd, enable and start the service
sudo systemctl daemon-reload
sudo systemctl enable open-webui.service
sudo systemctl start open-webui.service

# 6. Check status
sudo systemctl status open-webui.service


http://<open-webui-container-ip>:8080

Inside the second container (Debian 11, 1 GB RAM, 1 core), run the following commands:If everything is correct, you’ll see Active: active (running) in the output.

Then you can open your browser and access:✅ Tip: This setup uses pipx to keep Open WebUI isolated and systemd to handle automatic restarts — no need for manual activation or a virtual environment.

Step 3: Connect Open WebUI to Ollama

On the Ollama container, check if the API is running:
```
systemctl status ollama
```
On the Open WebUI container, configure the backend to point to Ollama’s IP:
```
http://<ollama-container-ip>:11434
```

Now the frontend can talk to the backend 🎉

Architecture Diagram

Sample

Conclusion

By running Ollama and Open WebUI on separate LXC containers, I now have a local AI setup that’s:

Lightweight (optimized for low resources)
Modular (frontend and backend separated)
Private (runs entirely in my Proxmox homelab)

This setup lets me experiment with models like Phi-3 Mini, Llama 3.2, Gemma 3 270M, and TinyLlama while keeping my system stable.

It’s a simple way to get started with local AI on homelab hardware without needing a GPU or big resources.

Search This Blog

Pinoy IT Share

Running your own local AI with Ollama

Run Your Own Local AI with Ollama + Open WebUI on Proxmox

My Setup

Step 1: Deploy Ollama

Models I Installed

🧩 Step 2: Deploy Open WebUI (pipx + systemd method)

Step 3: Connect Open WebUI to Ollama

Architecture Diagram

Sample

Conclusion

Comments

Post a Comment

Popular posts from this blog

Suricata on Mikrotik(IDS+IPS) = Part 4 - Configuration of the IPS Part

Why upload comes first before download

Suricata on Mikrotik(IDS+IPS) = Part 3 - Configuration of the IDS Part