For the person managing the Nexus One AI server. Commands to restart services, manage users, add models, check health, and pull logs โ all in one place.
๐ This page is for system administrators only. All commands below require SSH access to the server. If you're an end user looking for help, visit Troubleshooting or FAQ instead.
Notebook server โ restrict to developer users only
FastAPI
8080
http://ai.local:8080
Custom API layer โ if deployed
Model management
List installed models
Shows all models currently available on the system with their sizes.
ollama list
Pull (install) a new model
Downloads and installs a model. Examples: llama3.1:8b, mistral:7b, gemma2:9b. Requires the model files to be available on your network or internet (if connected).
ollama pull MODEL_NAME
Remove a model
Deletes a model from disk to free up space. Make sure no users are actively using it.
ollama rm MODEL_NAME
Show model info
Displays the model's parameters, context length, and configuration.
ollama show MODEL_NAME
Check what's currently loaded in GPU
Shows running Ollama processes and their VRAM usage.
ollama ps
Model storage location
All model files are stored here. Monitor this directory for disk usage.
ls -lh /usr/share/ollama/.ollama/models/
User management (Open WebUI)
User accounts are managed through the Open WebUI admin panel at http://ai.local:3001/admin/users. Log in with your admin account and use the panel for most user tasks. The commands below are for bulk operations or when the UI is unavailable.
Open user admin panel
The primary way to create, disable, and manage user accounts. Open this URL in a browser while connected to the internal network.
http://ai.local:3001/admin/users
Open WebUI database location
User data is stored in a SQLite database at this path. Back this up regularly.
/opt/open-webui/data/webui.db
Back up Nexus One AI data
Creates a local archive of Cezen data, settings, audit logs, and uploaded working files.
Models are added via the Ollama CLI. On an air-gapped system, copy the model file to the server first.
Check available disk space: df -h / โ ensure at least 2ร the model size is free
Pull the model (if internet-connected): ollama pull llama3.1:8b
For air-gapped install, copy the GGUF file then: ollama create mymodel -f /path/to/Modelfile
Verify it appears: ollama list
Test it loads: ollama run MODEL_NAME "Hello"
The model will appear automatically in Open WebUI's model selector โ no restart needed
๐๏ธ
Freeing up disk space
Disk space is most commonly consumed by model files, ChromaDB data, and Jupyter notebooks. Check each before deleting anything.
Find what's using the most space: sudo du -sh /opt/* | sort -rh | head -10
List models with sizes: ollama list
Remove unused models (confirm with users first): ollama rm MODEL_NAME
Clear old journal logs (keeps last 7 days): sudo journalctl --vacuum-time=7d
Verify space recovered: df -h /
๐
Resetting an admin password (Open WebUI)
If the admin password is lost, reset it via the Open WebUI CLI while the service is stopped.
Stop Open WebUI: sudo systemctl stop open-webui
Run the reset command: sudo open-webui reset-admin-password --email admin@example.com --password newpassword
Restart Open WebUI: sudo systemctl start open-webui
Log in with the new password and immediately change it to something secure
๐
Need Cezen support?
For hardware issues, system-level failures, or anything beyond this guide, contact Cezen. Include: the service name, the error from journalctl, and the output of nvidia-smi and df -h.