AI & Machine Learning — Docs

Running Ollama Models on a VPS with Limited RAM
Everything you need to know about ollama for your VPS, from initial setup to production-ready configuration with ram. 10 views
Deploying vLLM for High-Throughput LLM Serving
A comprehensive guide to vllm covering installation, configuration, and optimization for Linux VPS environments. 7 views
Training Custom NER Models with spaCy on VPS
Everything you need to know about spacy for your VPS, from initial setup to production-ready configuration with ner. 5 views
Setting Up ComfyUI for AI Image Generation Workflows
Everything you need to know about comfyui for your VPS, from initial setup to production-ready configuration with image-gen. 4 views
Deploying Hugging Face Models with FastAPI
Everything you need to know about huggingface for your VPS, from initial setup to production-ready configuration with fastapi. 7 views
Setting Up MLflow for Experiment Tracking
Practical guide to mlflow featuring real-world examples, performance tuning tips, and security best practices. 5 views
Running Ollama for Local LLM Inference
Ollama lets you run large language models (LLMs) locally on your VPS. No API keys, no per-token costs, full data privacy. 280 views
TensorFlow Serving on Docker for Production
Learn how to set up and configure tensorflow on your VPS with step-by-step instructions, including serving and best practices for production environments. 9 views
Building a RAG System with Python and ChromaDB
Retrieval-Augmented Generation (RAG) combines a knowledge base with an LLM. Instead of relying solely on the model's training data, you retrieve relevant documents and include them in the prompt. 750 views
Multi-GPU Training Environment Setup
Configure a multi-GPU training environment with CUDA, PyTorch, and distributed training for machine learning workloads. 392 views
RAG Pipeline with LlamaIndex
Build a Retrieval-Augmented Generation (RAG) pipeline using LlamaIndex to query your own documents with LLMs. 183 views
Jan.ai Local AI Chat Application
Run Jan.ai for a desktop AI chat application that runs models locally with Ollama integration and full privacy. 316 views
LangServe API for LLM Applications
Deploy LLM-powered APIs using LangServe with LangChain for production-ready AI services. 520 views
CrewAI Multi-Agent Workflows
Build AI agent teams with CrewAI that collaborate on complex tasks using role-based agents and structured workflows. 571 views
Whisper.cpp Speech-to-Text Server
Deploy Whisper.cpp for fast, local speech-to-text transcription without sending audio to external APIs. 583 views
Continue.dev AI Code Assistant with Ollama
Set up Continue.dev as a self-hosted Copilot alternative using Ollama for private AI code assistance in VS Code. 572 views
ComfyUI Stable Diffusion API
Deploy ComfyUI as a node-based Stable Diffusion interface with API access for programmatic image generation. 288 views
LiteLLM Multi-Provider LLM Routing
Deploy LiteLLM as a unified proxy for routing requests across multiple LLM providers with a single OpenAI-compatible API. 638 views
AI Image Generation with SDXL
Set up Stable Diffusion XL on your VPS for high-quality AI image generation with GPU acceleration. 322 views
Semantic Search with Sentence Transformers and Qdrant
Build a semantic search engine using Sentence Transformers for embeddings and Qdrant as the vector database. 383 views
Custom GPT Chatbot with Open Source Models
Build a custom AI chatbot using open-source models with Ollama, LangChain, and a web interface. 518 views
AI-Powered Log Analysis
Use AI models to analyze server logs, detect anomalies, and generate human-readable incident summaries. 672 views
Deploy Mixtral MoE Models
Run Mixtral Mixture-of-Experts models on your server for high-quality AI inference with efficient resource usage. 619 views
Model Caching for Faster LLM Inference
Optimize LLM inference speed with model caching, KV cache management, and pre-loading strategies. 577 views
AI Translation Service
Deploy a self-hosted AI translation service using open-source models for multilingual text translation. 624 views
Automatic1111 Forge for Image Generation
Deploy Stable Diffusion WebUI Forge for advanced AI image generation with an extensive extension ecosystem. 548 views
Voice Cloning with Coqui TTS
Set up Coqui TTS for high-quality text-to-speech synthesis and voice cloning on your own server. 532 views
AI Monitoring and Anomaly Detection
Use machine learning for server monitoring anomaly detection to catch issues before they cause outages. 179 views
OpenAI-Compatible API Gateway with LiteLLM
Set up LiteLLM as a unified API gateway providing OpenAI-compatible endpoints for any LLM provider. 716 views
Set Up a Private AI Coding Assistant on Your VPS
Deploy a private AI coding assistant like Tabby or Continue.dev on your VPS with local LLM inference for secure, unlimited code completions. 147 views
Run GGUF Models with llama.cpp for CPU Inference
Learn to run quantized GGUF language models on CPU using llama.cpp with optimized inference settings, systemd service setup, and production tuning. 175 views
Build a Document Q&A System with LangChain and RAG
Build a complete document Q&A system using LangChain, ChromaDB, and local LLMs with RAG for accurate, context-grounded answers from your own documents. 175 views
Automate Model Updates and Management with Ollama
Automate Ollama model updates with scheduled scripts, custom Modelfiles, version management, health monitoring, and resource optimization on your VPS. 136 views
Build an AI-Powered Search Engine with Vector Embeddings
Build an AI-powered semantic search engine using vector embeddings, Qdrant or pgvector, and sentence-transformers for intelligent document search on your VPS. 185 views
Running Stable Diffusion XL on a Budget VPS
Learn how to set up and configure stable-diffusion on your VPS with step-by-step instructions, including sdxl and best practices for production environments. 4 views
How to Set Up a JupyterHub Server for Team Collaboration
How to Set Up a JupyterHub Server for Team Collaboration JupyterHub is a multi-user server that allows entire teams to work in Jupyter notebooks simultaneous... 27 views
How to Deploy a Machine Learning Model with FastAPI
How to Deploy a Machine Learning Model with FastAPI FastAPI is a modern, high-performance Python web framework ideal for serving machine learning models as R... 34 views
How to Install and Use Stable Diffusion on Your Breeze
How to Install and Use Stable Diffusion on Your Breeze Stable Diffusion is an open-source text-to-image AI model that generates high-quality images from text... 29 views
How to Set Up a Vector Database with Qdrant
How to Set Up a Vector Database with Qdrant Qdrant is an open-source vector similarity search engine designed for storing and querying high-dimensional embed... 29 views
How to Run Whisper Speech-to-Text on Your Breeze
How to Run Whisper Speech-to-Text on Your Breeze Whisper is an open-source automatic speech recognition (ASR) model that can transcribe audio in dozens of la... 28 views
How to Set Up a RAG Pipeline with LangChain
How to Set Up a RAG Pipeline with LangChain Retrieval-Augmented Generation (RAG) combines the power of large language models with your own documents to produ... 28 views
How to Fine-Tune a Language Model on Your Own Data
How to Fine-Tune a Language Model on Your Own Data Fine-tuning adapts a pre-trained language model to your specific domain or task by training it further on ... 27 views
How to Deploy TensorFlow Serving for Model Inference
How to Deploy TensorFlow Serving for Model Inference TensorFlow Serving is a production-grade serving system for deploying machine learning models. It provid... 30 views
How to Set Up MLflow for Experiment Tracking
How to Set Up MLflow for Experiment Tracking MLflow is an open-source platform for managing the full machine learning lifecycle, including experiment trackin... 32 views
How to Install and Use ComfyUI for AI Image Generation
How to Install and Use ComfyUI for AI Image Generation ComfyUI is a powerful, node-based interface for Stable Diffusion that gives you complete control over ... 30 views
How to Set Up a ChatGPT-Compatible API with LocalAI
How to Set Up a ChatGPT-Compatible API with LocalAI LocalAI is an open-source drop-in replacement for the OpenAI API that runs entirely on your own hardware.... 31 views
How to Use vLLM for High-Performance LLM Serving
How to Use vLLM for High-Performance LLM Serving vLLM is a high-throughput, memory-efficient inference engine for large language models. It uses PagedAttenti... 33 views
How to Set Up Label Studio for Data Annotation
How to Set Up Label Studio for Data Annotation Label Studio is an open-source data labeling platform that supports text, image, audio, video, and time-series... 29 views
How to Deploy a Hugging Face Model on Your Breeze
How to Deploy a Hugging Face Model on Your Breeze Hugging Face hosts thousands of pre-trained models for natural language processing, computer vision, and au... 30 views
How to Build an AI Chatbot with Open-Source Models
How to Build an AI Chatbot with Open-Source Models Building your own AI chatbot with open-source models gives you full control over the model, data privacy, ... 28 views
How to Set Up a Vector Database with Qdrant on a VPS
Qdrant is a high-performance vector database designed for similarity search and AI applications. Running it on your Breeze gives you full control over your e... 26 views
How to Build a RAG Pipeline on Your Server
Retrieval-Augmented Generation (RAG) combines document retrieval with language model generation to produce grounded, accurate answers. Building a RAG pipelin... 30 views
How to Set Up Automatic1111 for Stable Diffusion WebUI
Automatic1111 is the most popular open-source web interface for running Stable Diffusion image generation. A GPU-enabled Breeze makes it easy to generate AI ... 26 views
How to Fine-Tune Language Models on a VPS
Fine-tuning allows you to adapt a pre-trained language model to your specific domain or task. Running the process on your own Breeze keeps proprietary traini... 29 views
How to Set Up SearXNG Private Search Engine
SearXNG is a privacy-respecting metasearch engine that aggregates results from multiple sources without tracking. Hosting it on your Breeze gives you a self-... 29 views
How to Install Ollama on a VPS
How to Install Ollama on a VPS Ollama makes it easy to run large language models locally on your Breeze. It handles model downloading, quantization, and serv... 30 views
Running Large Language Models on a VPS
Running Large Language Models on a VPS Running LLMs on your own Breeze gives you full control over data privacy, latency, and costs. This guide covers the ke... 32 views
How to Set Up Open WebUI for Local AI Chat
How to Set Up Open WebUI for Local AI Chat Open WebUI provides a polished chat interface for interacting with local LLMs. It connects to Ollama or any OpenAI... 31 views
How to Deploy Stable Diffusion on a Server
How to Deploy Stable Diffusion on a Server Stable Diffusion generates images from text prompts. You can self-host it on your Breeze for private, unlimited im... 32 views
How to Set Up Jupyter Notebook on a VPS
How to Set Up Jupyter Notebook on a VPS Jupyter Notebook provides an interactive environment for data science, machine learning experimentation, and code pro... 27 views
How to Install TensorFlow on Linux
How to Install TensorFlow on Linux TensorFlow is an open-source machine learning framework for building and training neural networks. This guide covers insta... 29 views
How to Install PyTorch on a VPS
How to Install PyTorch on a VPS PyTorch is a popular deep learning framework known for its flexibility and Pythonic API. It runs well on CPU-based Breezes fo... 27 views
How to Deploy LocalAI for API-Compatible LLM Hosting
How to Deploy LocalAI for API-Compatible LLM Hosting LocalAI is an OpenAI API-compatible server that runs LLMs, image generation, and audio models locally on... 24 views
How to Set Up Whisper for Speech-to-Text
How to Set Up Whisper for Speech-to-Text OpenAI Whisper is an open-source speech recognition model that transcribes audio to text with high accuracy. Run it ... 29 views
Optimizing Memory for Large Language Models
Optimizing Memory for Large Language Models Memory is the primary bottleneck when running LLMs on a Breeze. These techniques help you run larger models withi... 26 views
How to Deploy Flowise AI Workflow Builder
How to Deploy Flowise AI Workflow Builder Flowise is a drag-and-drop tool for building LLM workflows and chatbots. Deploy it on your Breeze for a visual AI a... 28 views
How to Self-Host LibreChat
How to Self-Host LibreChat LibreChat is a feature-rich chat interface that supports multiple AI backends. Self-hosting it on your Breeze gives you a private,... 33 views
GPU vs CPU for AI Workloads on VPS
GPU vs CPU for AI Workloads on VPS Choosing between GPU and CPU for AI tasks on your Breeze depends on the workload type, budget, and performance requirement... 30 views
How to Set Up Text Generation WebUI
How to Set Up Text Generation WebUI Text Generation WebUI (oobabooga) is a full-featured interface for running LLMs with advanced options for sampling, LoRA ... 28 views
How to Set Up ComfyUI for AI Image Generation
How to Set Up ComfyUI for AI Image Generation ComfyUI is a node-based interface for Stable Diffusion that gives you precise control over image generation wor... 31 views
Vector Database Comparison: Milvus vs Qdrant vs Weaviate
A comprehensive guide to vector-db covering installation, configuration, and optimization for Linux VPS environments. 4 views
Fine-Tuning LLMs on Cloud VPS Instances
Everything you need to know about fine-tuning for your VPS, from initial setup to production-ready configuration with llm. 4 views
Optimizing PyTorch Inference on CPU-Only Servers
Everything you need to know about pytorch for your VPS, from initial setup to production-ready configuration with cpu. 6 views
Building an AI Chatbot with LangChain and Redis
Step-by-step tutorial for langchain on Ubuntu/Debian servers, with practical code examples and troubleshooting tips. 4 views
GPU Passthrough for Machine Learning Workloads
A comprehensive guide to gpu covering installation, configuration, and optimization for Linux VPS environments. 6 views
Setting Up a Private ChatGPT Instance with LocalAI
Step-by-step tutorial for localai on Ubuntu/Debian servers, with practical code examples and troubleshooting tips. 5 views
Deploying Whisper Speech Recognition on Linux
Learn how to set up and configure whisper on your VPS with step-by-step instructions, including speech and best practices for production environments. 7 views
Building a RAG Pipeline with ChromaDB
Everything you need to know about rag for your VPS, from initial setup to production-ready configuration with chromadb. 5 views