Build Your Local LLM Knowledge Base: DeepSeek R1, Ollama, and AnythingLLM

Deepseek R1 + ollama + AnythingLLM 本地搭建大模型知识库 ...

Build Your Local LLM Knowledge Base: DeepSeek R1, Ollama, and AnythingLLM

Large Language Models (LLMs) are revolutionizing how we interact with information. Imagine having a powerful LLM, customized with your own knowledge base, running entirely on your local machine. This article guides you through setting up DeepSeek R1 with Ollama and AnythingLLM, allowing you to create a personalized and private AI assistant.

Understanding DeepSeek R1 and Model Distillation

DeepSeek R1 stands out as a powerful open-source LLM. It offers different model sizes, catering to various hardware capabilities. When exploring LLMs, it's helpful to understand the concept of "distillation." The original article mentions "671B" as the "base model”. Distillation involves taking a large, complex model (like a 671B parameter model) and creating smaller, more efficient versions (such as 1.5B, 7B, or 8B parameter models). These distilled models, like those derived from Qwen (通义千问) or Llama, retain much of the original model's knowledge while requiring less computational power.

Here's a breakdown of the DeepSeek R1 model sizes and their requirements, as mentioned in the original article:

Model Size	Minimum GPU Memory	Recommended GPU	CPU Memory	Use Case
1.5B	4GB	RTX 3050	8GB	Personal Learning
7B/8B	16GB	RTX 4090	32GB	Small Projects
14B	24GB	A5000 x2	64GB	Professional Use
32B	48GB	A100 40GB x2	128GB	Enterprise Service
70B	80GB	A100 80GB x4	256GB	High-Performance Computing
671B	640GB+	H100 Cluster	N/A	Supercomputing/Cloud Computing
Note: This information is based on recommendations and may vary depending on your specific setup and usage.

This article will focus on setting up a smaller model size suitable for personal use and those with limited GPU resources.

Prerequisites

Before we begin, ensure your system meets the following requirements:

Operating System: Windows 11 (The guide focuses on Windows, though Ollama supports Linux and macOS)
CPU: Intel Core i7-11800H (or equivalent)
RAM: 16GB (or more, depending on the model size you choose)
GPU: NVIDIA RTX 3050 Ti with 4GB VRAM (or better, depending on the model size)

Step 1: Installing Ollama

Ollama is a powerful tool that simplifies running LLMs locally. It handles the complexities of managing dependencies and configurations, making it easy to get started.

Download Ollama: Visit the Ollama GitHub Releases page and download the appropriate version for your operating system. Ensure you download the latest version for the most up-to-date features and bug fixes.
Install Ollama: Run the downloaded installer and follow the on-screen instructions.
Configure Environment Variables: After installation, you may need to add Ollama to your system's PATH environment variable. This allows you to run the ollama command from any terminal window.
- Search for "Edit the system environment variables" in Windows.
- Click "Environment Variables."
- Under "System variables," find the "Path" variable and click "Edit."
- Add the path to the Ollama installation directory (e.g., C:\Program Files\Ollama).
Verify Installation: Open a new command prompt or terminal and type ollama. If Ollama is installed correctly, you should see a list of available commands.

Step 2: Downloading the DeepSeek-R1 Model with Ollama

Now that Ollama is installed, you can download the DeepSeek-R1 model.

Pull the Model: In your terminal, run the following command to download the 8B version of the DeepSeek-R1 model:
```
ollama run deepseek-r1:8b
```
This command will download the model weights. The first time you run this, it might take a while, depending on your internet speed.
List Installed Models: To confirm the model has been downloaded successfully, run the following command:
```
ollama list
```
This will display a list of all models currently installed by Ollama.
Run the Model: Once downloaded, you can run the model directly from the command line:
```
ollama run deepseek-r1:8b
```
This will start the DeepSeek-R1 model in interactive mode, allowing you to start asking questions and testing its capabilities.

Step 3: Installing AnythingLLM

AnythingLLM provides a user-friendly interface for interacting with your local LLM and building knowledge bases.

Download AnythingLLM: Go to the AnythingLLM website and download the desktop application.
Install AnythingLLM: Run the installer and follow the instructions.

Step 4: Creating a Knowledge Base in AnythingLLM

AnythingLLM allows you to upload documents and create a knowledge base that the LLM can use to answer your questions.

Open AnythingLLM: Launch the AnythingLLM desktop application.
Create a Workspace: Set up a new workspace to house your knowledge base.
Upload Documents: Click the upload button (often found near the workspace name) to upload your documents (e.g., PDF files, text files). The original article mentions uploading three research papers.
Pin Important Documents: After uploading, consider "pinning" the most relevant documents. This helps the LLM prioritize these documents during the question-answering process.

Step 5: Configuring DeepSeek R1 with Your Knowledge Base

To effectively use DeepSeek R1 with your knowledge base, you need to connect it within AnythingLLM and configure appropriate chat prompts.

Select Ollama as your Model Provider: In AnythingLLM's settings, choose Ollama as the model provider. This tells AnythingLLM to use the local Ollama installation to run the LLM.
Specify the Model: Select "deepseek-r1:8b" (or the version you downloaded) as the model to use.
Customize Chat Prompts: This is a crucial step. You need to create effective chat prompts that instruct the LLM on how to use the knowledge base. For example, you could use a prompt like:

"You are a helpful research assistant. Use the provided documents to answer the following question. If the answer is not found in the documents, respond that you cannot answer based on the provided information."

Experiment with different prompts to see what works best for your specific use case. Clear and specific prompts will lead to more accurate and relevant answers.

Using Your Local LLM Knowledge Base

With everything set up, you can now start asking questions and leveraging your local LLM knowledge base. In the AnythingLLM interface, type your question and submit it. The LLM will use the uploaded documents and the chat prompt to generate an answer.

Optimizing Performance and Further Exploration

Hardware Considerations: If you find the performance is slow, consider upgrading your GPU or RAM. Larger models require more resources.
Prompt Engineering: Experiment with different prompts to improve the accuracy and relevance of the LLM's responses.
Document Preparation: Ensure your documents are clean and well-formatted for optimal results.
Explore Other Models: Ollama supports a wide range of LLMs. Consider trying other models to see which one best suits your needs. Refer to the Ollama library for available models.

By following these steps, you can create a powerful and private LLM knowledge base on your local machine, enabling you to explore, learn, and innovate with AI.

. . .

Deepseek API 升级维护？进不去开发者后台怎么办？有办法了 ...

站长注：详细介绍在Deepseek API 升级维护期间，如何通过API易快速接入并使用Deepseek API，确保业务正常运转。这两天（从1 月28 日起），很多开发者反馈无法访问Deepseek ...

Download speed on Edge is too slow - Parallel downloading is ...

Feb 20, 2021 ... I've been testing and using this feature for as long as the new Edge has been around,without parallel downloading flag...

Free Paraphrasing Tool — No-Signup, Full-Length Rewrites

Reword any text in seconds with our AI-powered paraphrasing tool. Modes: Standard. .

Maze Generator

Jul 23, 2016 ... My generator actually uses David H Ahl's algorithm, except mine starts in a random spot anywhere, not just a random spot on the top.

MS220-48LP && NetFlow Analyzer Setup (ManageEngine) - The ...

Nov 12, 2019 ... Hi All, I am trying to set up ManageEngine NetFlow Analyzer to gather data from my MX appliance. I need the SSH or Telnet credentials for ...