Full AI Platform โ Intelligence, Automation & Control
v2.0 transforms Nexus One AI from a chat and document Q&A tool into a complete on-premises AI operating platform. This release adds agents, workflow automation, meeting intelligence, secure collaboration, and a full administrative control layer โ all running entirely on your hardware.
- Multimodal Chat Prompt Studio โ Full rewrite with persistent history, sidebar search, image upload, per-conversation model switching, and markdown rendering with code blocks
- Document Intelligence Workbench โ Side-by-side document comparison, table extraction, batch processing, and structured output from uploaded files
- Meeting Assistant โ On-premises meeting transcription (Whisper), AI summaries, action item extraction, and follow-up email drafting from audio recordings
- Secure Chat Rooms โ Multi-user AI-assisted collaboration spaces with AI moderation, private to invited members, no external relay
- Agent Builder โ Build autonomous AI agents with tool access, multi-step planning, and governance/approval layer for sensitive actions
- Workflow Automation โ Visual pipeline builder for chaining AI tasks: trigger โ process โ output, with support for scheduled and webhook triggers
- Scheduled Jobs โ Cron-based AI task scheduling: daily reports, nightly document ingestion, automated monitoring โ runs without user interaction
- Connectors โ Integration layer for databases, file shares, REST APIs, email, and SharePoint โ usable by agents and workflows
- Guardrails โ Keyword blocks, PII redaction, regex rules, and prompt injection detection with block/redact/warn actions
- Model Router โ Automatic query routing to the optimal model based on task type, token count, or custom rules
- RAG Quality Dashboard โ Chunk quality scoring, retrieval accuracy monitoring, and query testing for knowledge base collections
- AI Eval Suite โ Standardised test suites for model and prompt quality, with pass rate tracking and regression detection
- Model Benchmarking โ Speed and quality benchmarks across installed models with tokens/second and TTFT metrics
- API Key Manager โ Generate and manage API keys for programmatic access with per-key rate limits and model restrictions
- Portal expanded from 20 to 43 pages covering all new features
- Quick Start updated with tabbed layout: First Steps, Platform Features, Admin Setup
- FAQ expanded with AI Tools, Automation, and new Technical Q&As
- Use Cases updated with agent, meeting assistant, and automation scenarios
- Glossary updated with 15 new terms covering all new features
- Light lavender theme applied consistently across all 43 pages
- Navigation fixed and standardised across all pages with complete AI Tools section
- llama3:70b โ Added for high-accuracy complex reasoning tasks
- codellama:34b โ Added for code generation and technical review
- llava:13b โ Added for vision/image understanding queries
- nomic-embed-text โ Dedicated embedding model for RAG (required)
Basic Tier โ Initial Deployment
First deployment of the Nexus One AI Basic Tier system. Full AI stack installed and configured on-premises with the following components and capabilities.
- Ollama โ Local AI model engine running on NVIDIA RTX Pro 6000 GPU
- Open WebUI โ Browser-based chat interface accessible at http://ai.local:3001
- ChromaDB โ Vector database for document storage and RAG
- LangChain โ Python framework for building AI pipelines
- Jupyter Notebook โ Interactive Python environment at http://ai.local:8888
- FastAPI โ REST API layer for application integration
- DCGM โ NVIDIA GPU health monitoring and diagnostics
- nvtop โ Live GPU dashboard for terminal monitoring
- QLoRA / Unsloth โ Efficient model fine-tuning toolkit
- llama3.1:8b โ Primary recommended model for general tasks
- llama3.2:3b โ Fast lightweight model for simple tasks
- mistral:7b โ Strong multilingual and structured output model
- llama3.1:70b โ High-accuracy model for complex reasoning
- gemma2:9b โ Google's efficient model with strong instruction following
- phi3:mini โ Ultra-fast 3.8B model for quick responses
- nomic-embed-text โ Embedding model for document search
- mxbai-embed-large โ High-accuracy embedding model
- Home page with live service status monitoring (auto-refreshes every 60 seconds)
- Quick Start guide โ 5 steps to first AI conversation
- Tool pages for all 9 installed components with direct access buttons
- Prompt Library โ 32 ready-to-use prompts across 8 government departments
- Use Case Gallery โ 18 real-world scenarios with time savings benchmarks
- Model Library โ comparison table and "which model should I use?" guide
- Troubleshooting โ 9 common issues with searchable accordion
- FAQ โ 14 plain-English answers to common questions
- Glossary โ 18 AI terms explained
- Security & Privacy โ air-gapped architecture and data handling guide
- Admin Guide โ service management, model management, log access, and procedures
- What's New โ this page
Pro Tier Upgrade
The Pro Tier upgrade adds enterprise-scale AI infrastructure for higher concurrency, larger models, and production-grade deployment capabilities.
- vLLM โ High-throughput inference engine for multi-user production workloads
- Ray โ Distributed computing framework for parallel AI jobs
- DeepSpeed โ Memory-efficient training for larger model fine-tuning
- Milvus โ Scalable vector database replacing ChromaDB for large document collections
- MLflow โ Experiment tracking and model registry
- Additional large models (70B+ class) optimised for the upgraded hardware
- Portal tier switcher โ toggle between Entry, Mid, and Advanced documentation
Max Tier Upgrade
The Max Tier adds Kubernetes-based orchestration and enterprise model serving for high-availability, large-scale AI deployment.
- Kubeflow โ ML pipeline orchestration on Kubernetes
- Triton Inference Server โ NVIDIA's production model serving platform
- Megatron-LM โ Large-scale model training across multiple GPU nodes
- High-availability configuration with automatic failover
- Role-based portal access โ admin console with embedded terminal
- Usage analytics dashboard for administrators
whats-new.html and add a new entry above v1.0 following the same format. Include the date, a brief description, and what changed.