Nexus One AI ๐Ÿ”” Basic Tier
What's New

Release Notes & Updates

A record of what's been added, changed, or improved on your Nexus One AI system and this portal. Administrators should update this page when new models or features are deployed.

Current Version
Basic Tier โ€” v2.0
Released June 2026
9
Core tools
8
AI models
43
Portal pages
12
AI Tools & features
Changelog

Full AI Platform โ€” Intelligence, Automation & Control

v2.0 transforms Nexus One AI from a chat and document Q&A tool into a complete on-premises AI operating platform. This release adds agents, workflow automation, meeting intelligence, secure collaboration, and a full administrative control layer โ€” all running entirely on your hardware.

๐Ÿง  Intelligence Tools
  • Multimodal Chat Prompt Studio โ€” Full rewrite with persistent history, sidebar search, image upload, per-conversation model switching, and markdown rendering with code blocks
  • Document Intelligence Workbench โ€” Side-by-side document comparison, table extraction, batch processing, and structured output from uploaded files
  • Meeting Assistant โ€” On-premises meeting transcription (Whisper), AI summaries, action item extraction, and follow-up email drafting from audio recordings
  • Secure Chat Rooms โ€” Multi-user AI-assisted collaboration spaces with AI moderation, private to invited members, no external relay
โš™๏ธ Automation Tools
  • Agent Builder โ€” Build autonomous AI agents with tool access, multi-step planning, and governance/approval layer for sensitive actions
  • Workflow Automation โ€” Visual pipeline builder for chaining AI tasks: trigger โ†’ process โ†’ output, with support for scheduled and webhook triggers
  • Scheduled Jobs โ€” Cron-based AI task scheduling: daily reports, nightly document ingestion, automated monitoring โ€” runs without user interaction
  • Connectors โ€” Integration layer for databases, file shares, REST APIs, email, and SharePoint โ€” usable by agents and workflows
๐Ÿ›ก๏ธ Control & Quality Tools
  • Guardrails โ€” Keyword blocks, PII redaction, regex rules, and prompt injection detection with block/redact/warn actions
  • Model Router โ€” Automatic query routing to the optimal model based on task type, token count, or custom rules
  • RAG Quality Dashboard โ€” Chunk quality scoring, retrieval accuracy monitoring, and query testing for knowledge base collections
  • AI Eval Suite โ€” Standardised test suites for model and prompt quality, with pass rate tracking and regression detection
  • Model Benchmarking โ€” Speed and quality benchmarks across installed models with tokens/second and TTFT metrics
  • API Key Manager โ€” Generate and manage API keys for programmatic access with per-key rate limits and model restrictions
๐Ÿ“– Portal & Knowledge Center
  • Portal expanded from 20 to 43 pages covering all new features
  • Quick Start updated with tabbed layout: First Steps, Platform Features, Admin Setup
  • FAQ expanded with AI Tools, Automation, and new Technical Q&As
  • Use Cases updated with agent, meeting assistant, and automation scenarios
  • Glossary updated with 15 new terms covering all new features
  • Light lavender theme applied consistently across all 43 pages
  • Navigation fixed and standardised across all pages with complete AI Tools section
๐Ÿค– New AI Models
  • llama3:70b โ€” Added for high-accuracy complex reasoning tasks
  • codellama:34b โ€” Added for code generation and technical review
  • llava:13b โ€” Added for vision/image understanding queries
  • nomic-embed-text โ€” Dedicated embedding model for RAG (required)

Basic Tier โ€” Initial Deployment

First deployment of the Nexus One AI Basic Tier system. Full AI stack installed and configured on-premises with the following components and capabilities.

๐Ÿ› ๏ธ Tools Installed
  • Ollama โ€” Local AI model engine running on NVIDIA RTX Pro 6000 GPU
  • Open WebUI โ€” Browser-based chat interface accessible at http://ai.local:3001
  • ChromaDB โ€” Vector database for document storage and RAG
  • LangChain โ€” Python framework for building AI pipelines
  • Jupyter Notebook โ€” Interactive Python environment at http://ai.local:8888
  • FastAPI โ€” REST API layer for application integration
  • DCGM โ€” NVIDIA GPU health monitoring and diagnostics
  • nvtop โ€” Live GPU dashboard for terminal monitoring
  • QLoRA / Unsloth โ€” Efficient model fine-tuning toolkit
๐Ÿค– AI Models Available
  • llama3.1:8b โ€” Primary recommended model for general tasks
  • llama3.2:3b โ€” Fast lightweight model for simple tasks
  • mistral:7b โ€” Strong multilingual and structured output model
  • llama3.1:70b โ€” High-accuracy model for complex reasoning
  • gemma2:9b โ€” Google's efficient model with strong instruction following
  • phi3:mini โ€” Ultra-fast 3.8B model for quick responses
  • nomic-embed-text โ€” Embedding model for document search
  • mxbai-embed-large โ€” High-accuracy embedding model
๐Ÿ“– Knowledge Portal Launched
  • Home page with live service status monitoring (auto-refreshes every 60 seconds)
  • Quick Start guide โ€” 5 steps to first AI conversation
  • Tool pages for all 9 installed components with direct access buttons
  • Prompt Library โ€” 32 ready-to-use prompts across 8 government departments
  • Use Case Gallery โ€” 18 real-world scenarios with time savings benchmarks
  • Model Library โ€” comparison table and "which model should I use?" guide
  • Troubleshooting โ€” 9 common issues with searchable accordion
  • FAQ โ€” 14 plain-English answers to common questions
  • Glossary โ€” 18 AI terms explained
  • Security & Privacy โ€” air-gapped architecture and data handling guide
  • Admin Guide โ€” service management, model management, log access, and procedures
  • What's New โ€” this page

Pro Tier Upgrade

The Pro Tier upgrade adds enterprise-scale AI infrastructure for higher concurrency, larger models, and production-grade deployment capabilities.

๐Ÿ”œ Planned additions
  • vLLM โ€” High-throughput inference engine for multi-user production workloads
  • Ray โ€” Distributed computing framework for parallel AI jobs
  • DeepSpeed โ€” Memory-efficient training for larger model fine-tuning
  • Milvus โ€” Scalable vector database replacing ChromaDB for large document collections
  • MLflow โ€” Experiment tracking and model registry
  • Additional large models (70B+ class) optimised for the upgraded hardware
  • Portal tier switcher โ€” toggle between Entry, Mid, and Advanced documentation

Max Tier Upgrade

The Max Tier adds Kubernetes-based orchestration and enterprise model serving for high-availability, large-scale AI deployment.

๐Ÿ”œ Planned additions
  • Kubeflow โ€” ML pipeline orchestration on Kubernetes
  • Triton Inference Server โ€” NVIDIA's production model serving platform
  • Megatron-LM โ€” Large-scale model training across multiple GPU nodes
  • High-availability configuration with automatic failover
  • Role-based portal access โ€” admin console with embedded terminal
  • Usage analytics dashboard for administrators
๐Ÿ“ For administrators: Update this page whenever a new model is installed, a service is upgraded, or a configuration change is made. Edit whats-new.html and add a new entry above v1.0 following the same format. Include the date, a brief description, and what changed.