1、Ollama简介

1.1、概要

Ollama 是一个基于 Go 语言开发的简单易用的本地大语言模型运行框架,可以将它类比为 Docker

1.2、GitHub 地址

https://github.com/ollama/ollama
# 模型库
https://ollama.com/library

2、 install

curl -fsSL https://ollama.com/install.sh | sh

3、ollama 命令

3.1、命令行语法

Usage:
  ollama [flags]
  ollama [command]

Available Commands:
  serve       Start ollama
  create      Create a model from a Modelfile
  show        Show information for a model      
  run         Run a model
  stop        Stop a running model
  pull        Pull a model from a registry
  push        Push a model to a registry
  list        List models
  ps          List running models
  cp          Copy a model
  rm          Remove a model
  help        Help about any command

Flags:
  -h, --help      help for ollama
  -v, --version   Show version information

3.2、查看Ollama状态

systemctl status ollama
# /etc/systemd/system/default.target.wants/ollama.service

3.3、启动Ollama

sudo systemctl start ollama

3.4、自定义配置

sudo systemctl edit ollama
#或者 创建手动创建覆盖文件 /etc/systemd/system/ollama.service.d/override.conf

3.5、升级

curl -fsSL https://ollama.com/install.sh | sh

3.6、查看运行日志

journalctl -e -u ollama

3.7、卸载Ollama

sudo systemctl stop ollama
sudo systemctl disable ollama
sudo rm /etc/systemd/system/ollama.service

3.8、允许外网访问 Ollama

vim /etc/systemd/system/default.target.wants/ollama.service
#
[Service]
Type=simple
User=
WorkingDirectory=/path/to/ollama
ExecStart=/usr/bin/ollama serve
Restart=on-failure
Environment="OLLAMA_HOST=0.0.0.0:11434" #增加该配置

4、模型

注意：

8GB 内存运行 7B模型，16GB运行 13B模型，32GB运行 33B模型

You should have at least 8 GB of RAM available to run the 7B models, 16 GB to run the 13B models, and 32 GB to run the 33B models.

Ollama supports a list of models available on ollama.com/library

Model	Parameters	Size	Download
Llama 3.3	70B	43GB	ollama run llama3.3
Llama 3.2	3B	2.0GB	ollama run llama3.2
Llama 3.2	1B	1.3GB	ollama run llama3.2:1b
Llama 3.2 Vision	11B	7.9GB	ollama run llama3.2-vision
Llama 3.2 Vision	90B	55GB	ollama run llama3.2-vision:90b
Llama 3.1	8B	4.7GB	ollama run llama3.1
Llama 3.1	405B	231GB	ollama run llama3.1:405b
Phi 4	14B	9.1GB	ollama run phi4
Phi 3 Mini	3.8B	2.3GB	ollama run phi3
Gemma 2	2B	1.6GB	ollama run gemma2:2b
Gemma 2	9B	5.5GB	ollama run gemma2
Gemma 2	27B	16GB	ollama run gemma2:27b
Mistral	7B	4.1GB	ollama run mistral
Moondream 2	1.4B	829MB	ollama run moondream
Neural Chat	7B	4.1GB	ollama run neural-chat
Starling	7B	4.1GB	ollama run starling-lm
Code Llama	7B	3.8GB	ollama run codellama
Llama 2 Uncensored	7B	3.8GB	ollama run llama2-uncensored
LLaVA	7B	4.5GB	ollama run llava
Solar	10.7B	6.1GB	ollama run solar

5、社区集成

5.1、Web & Desktop

Open WebUI
Enchanted (macOS native)
Hollama
Lollms-Webui
LibreChat
Bionic GPT
HTML UI
Saddle
Chatbot UI
Chatbot UI v2
Typescript UI
Minimalistic React UI for Ollama Models
Ollamac
big-AGI
Cheshire Cat assistant framework
Amica
chatd
Ollama-SwiftUI
Dify.AI
MindMac
NextJS Web Interface for Ollama
Msty
Chatbox
WinForm Ollama Copilot
NextChat with Get Started Doc
Alpaca WebUI
OllamaGUI
OpenAOE
Odin Runes
LLM-X (Progressive Web App)
AnythingLLM (Docker + MacOs/Windows/Linux native app)
Ollama Basic Chat: Uses HyperDiv Reactive UI
Ollama-chats RPG
IntelliBar (AI-powered assistant for macOS)
QA-Pilot (Interactive chat tool that can leverage Ollama models for rapid understanding and navigation of GitHub code repositories)
ChatOllama (Open Source Chatbot based on Ollama with Knowledge Bases)
CRAG Ollama Chat (Simple Web Search with Corrective RAG)
RAGFlow (Open-source Retrieval-Augmented Generation engine based on deep document understanding)
StreamDeploy (LLM Application Scaffold)
chat (chat web app for teams)
Lobe Chat with Integrating Doc
Ollama RAG Chatbot (Local Chat with multiple PDFs using Ollama and RAG)
BrainSoup (Flexible native client with RAG & multi-agent automation)
macai (macOS client for Ollama, ChatGPT, and other compatible API back-ends)
RWKV-Runner (RWKV offline LLM deployment tool, also usable as a client for ChatGPT and Ollama)
Ollama Grid Search (app to evaluate and compare models)
Olpaka (User-friendly Flutter Web App for Ollama)
OllamaSpring (Ollama Client for macOS)
LLocal.in (Easy to use Electron Desktop Client for Ollama)
Shinkai Desktop (Two click install Local AI using Ollama + Files + RAG)
AiLama (A Discord User App that allows you to interact with Ollama anywhere in discord )
Ollama with Google Mesop (Mesop Chat Client implementation with Ollama)
R2R (Open-source RAG engine)
Ollama-Kis (A simple easy to use GUI with sample custom LLM for Drivers Education)
OpenGPA (Open-source offline-first Enterprise Agentic Application)
Painting Droid (Painting app with AI integrations)
Kerlig AI (AI writing assistant for macOS)
AI Studio
Sidellama (browser-based LLM client)
LLMStack (No-code multi-agent framework to build LLM agents and workflows)
BoltAI for Mac (AI Chat Client for Mac)
Harbor (Containerized LLM Toolkit with Ollama as default backend)
PyGPT (AI desktop assistant for Linux, Windows and Mac)
Alpaca (An Ollama client application for linux and macos made with GTK4 and Adwaita)
AutoGPT (AutoGPT Ollama integration)
Go-CREW (Powerful Offline RAG in Golang)
PartCAD (CAD model generation with OpenSCAD and CadQuery)
Ollama4j Web UI - Java-based Web UI for Ollama built with Vaadin, Spring Boot and Ollama4j
PyOllaMx - macOS application capable of chatting with both Ollama and Apple MLX models.
Claude Dev - VSCode extension for multi-file/whole-repo coding
Cherry Studio (Desktop client with Ollama support)
ConfiChat (Lightweight, standalone, multi-platform, and privacy focused LLM chat interface with optional encryption)
Archyve (RAG-enabling document library)
crewAI with Mesop (Mesop Web Interface to run crewAI with Ollama)
Tkinter-based client (Python tkinter-based Client for Ollama)
LLMChat (Privacy focused, 100% local, intuitive all-in-one chat interface)
Local Multimodal AI Chat (Ollama-based LLM Chat with support for multiple features, including PDF RAG, voice chat, image-based interactions, and integration with OpenAI.)
ARGO (Locally download and run Ollama and Huggingface models with RAG on Mac/Windows/Linux)
OrionChat - OrionChat is a web interface for chatting with different AI providers
G1 (Prototype of using prompting strategies to improve the LLM's reasoning through o1-like reasoning chains.)
Web management (Web management page)
Promptery (desktop client for Ollama.)
Ollama App (Modern and easy-to-use multi-platform client for Ollama)
SpaceLlama (Firefox and Chrome extension to quickly summarize web pages with ollama in a sidebar)
YouLama (Webapp to quickly summarize any YouTube video, supporting Invidious as well)
DualMind (Experimental app allowing two models to talk to each other in the terminal or in a web interface)
ollamarama-matrix (Ollama chatbot for the Matrix chat protocol)
ollama-chat-app (Flutter-based chat app)
Perfect Memory AI (Productivity AI assists personalized by what you have seen on your screen, heard and said in the meetings)
Hexabot (A conversational AI builder)
Reddit Rate (Search and Rate Reddit topics with a weighted summation)
OpenTalkGpt (Chrome Extension to manage open-source models supported by Ollama, create custom models, and chat with models from a user-friendly UI)
VT (A minimal multimodal AI chat app, with dynamic conversation routing. Supports local models via Ollama)
Nosia (Easy to install and use RAG platform based on Ollama)
Witsy (An AI Desktop application available for Mac/Windows/Linux)
Abbey (A configurable AI interface server with notebooks, document storage, and YouTube support)
Minima (RAG with on-premises or fully local workflow)
aidful-ollama-model-delete (User interface for simplified model cleanup)
Perplexica (An AI-powered search engine & an open-source alternative to Perplexity AI)
AI Toolkit for Visual Studio Code (Microsoft-official VSCode extension to chat, test, evaluate models with Ollama support, and use them in your AI applications.)
MinimalNextOllamaChat (Minimal Web UI for Chat and Model Control)
Chipper AI interface for tinkerers (Ollama, Haystack RAG, Python)

6、hollama安装

6.1、官方地址

https://kkgithub.com/fmaclen/hollama

6.2 hollama 配置

7、deepseek 安装

ollama run deepseek-r1:14b

使用Ollama 部署deepseek-r1 实现本地知识库

1、Ollama简介

1.1、概要

1.2、GitHub 地址

2、 install

3、ollama 命令

3.1、命令行语法

3.2、查看Ollama状态

3.3、启动Ollama

3.4、自定义配置

3.5、升级

3.6、查看运行日志

3.7、卸载Ollama

3.8、允许外网访问 Ollama

4、模型

5、社区集成

5.1、Web & Desktop

6、hollama安装

6.1、官方地址

6.2 hollama 配置

7、deepseek 安装

8、使用 hollama 调用deepseek 模型

8.1 、调用deepseek 模型

8.2 、创建本地知识库

8.2.1、创建知识

8.2.2 、调用本地知识库

相关推荐

取消回复欢迎你发表评论:

Linux:Ubuntu22.04上安装python3.11，简单易上手

宝马阿布达比分公司推出独特M4升级套件，整套升级约在20万

MATLAB中图片保存的五种方法(一)（matlab中保存图片命令）

别再傻傻搞不清楚Workstation Player和Workstation Pro的区别了

Linux上使用tinyproxy快速搭建HTTP/HTTPS代理器

如何提取、修改、强刷A卡bios a卡刷bios工具

Element Plus 的 Dialog 组件实现点击遮罩层不关闭对话框

日本组合“岚”将于2020年12月31日停止团体活动

SpringCloud OpenFeign 使用 okhttp 发送 HTTP 请求与 HTTP/2 探索

tinymce 号称富文本编辑器世界第一，大家同意么?

使用Ollama 部署deepseek-r1 实现本地知识库

1、Ollama简介

1.1、概要

1.2、GitHub 地址

2、 install

3、ollama 命令

3.1、命令行语法

3.2、查看Ollama状态

3.3、启动Ollama

3.4、自定义配置

3.5、升级

3.6、查看运行日志

3.7、卸载Ollama

3.8、允许外网访问 Ollama

4、模型

5、社区集成

5.1、Web & Desktop

6、hollama安装

6.1、官方地址

6.2 hollama 配置

7、deepseek 安装

8、使用 hollama 调用deepseek 模型

8.1 、调用deepseek 模型

8.2 、创建本地知识库

8.2.1、创建知识

8.2.2 、调用本地知识库

相关推荐

取消回复欢迎 你 发表评论:

Linux:Ubuntu22.04上安装python3.11，简单易上手

宝马阿布达比分公司推出独特M4升级套件，整套升级约在20万

MATLAB中图片保存的五种方法(一)（matlab中保存图片命令）

别再傻傻搞不清楚Workstation Player和Workstation Pro的区别了

Linux上使用tinyproxy快速搭建HTTP/HTTPS代理器

如何提取、修改、强刷A卡bios a卡刷bios工具

Element Plus 的 Dialog 组件实现点击遮罩层不关闭对话框

日本组合“岚”将于2020年12月31日停止团体活动

SpringCloud OpenFeign 使用 okhttp 发送 HTTP 请求与 HTTP/2 探索

tinymce 号称富文本编辑器世界第一，大家同意么?

取消回复欢迎你发表评论: