Guozhen AIGlobal AI field notes and model intelligence

English translation

26. DeepSeek + RealAgent: Automate End-to-End Workflows with a Single Command

Published:

Category: DeepSeek Learning

Read time: 7 min

Reads: 0

Lesson #26Views are counted together with the original Chinese articleImages are preserved from the source page

DeepSeek + RealAgent: Generating Fully Automated Workflows with a Single Instruction — Practical Test Record

Generating workflows from a single instruction sounds incredibly appealing—but what truly matters to me is where and when the system pauses for human confirmation. Actions involving files, accounts, payments, or publishing must never proceed fully autonomously. Good automation should eliminate redundant steps—not responsibility.

When designing workflows, clearly label three types of nodes:

  • Fully automatable steps,
  • Steps requiring mandatory human approval, and
  • Steps that trigger rollback upon failure.

Once these categories are rigorously distinguished, automation becomes significantly more robust—and far better suited for real-world work.

Recently, nearly one hundred readers have reached out to me daily via private messages. Their questions cluster around three main topics: whether personal knowledge base software supports offline use, how to develop such tools, and underlying technical principles.

This article addresses those questions head-on—readers interested in these topics will find it especially useful.

Currently, mainstream personal knowledge base software runs remotely in the cloud. That means your documents are uploaded to remote servers for processing—and thus cannot operate in a purely offline mode.

Curious readers may ask: Why do most knowledge base tools require online cloud operation instead of supporting pure offline use? If you’re intrigued by this question, keep reading—the explanation follows below.

1. Personal Knowledge Bases

Large language models (LLMs) have already absorbed most publicly available human knowledge—but they remain entirely unaware of your personal files or proprietary corporate documents. Consequently, their responses to queries about such private data lack precision. For example, asking DeepSeek about this month’s supermarket spending yields no answer—precisely because, as the model itself states, the required data source is missing:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #01

To solve this problem, the first approach leverages the LLM’s strong generalization capability: it functions as a zero-shot learner, capable of extracting insights from previously unseen documents. You simply upload a file, and the model learns from it on-the-fly to answer your question. However, this method has an obvious drawback—it forces you to manually locate and re-upload files every time, which is highly inconvenient.

So researchers proposed a second solution: build a personal knowledge base, where all your documents reside centrally—eliminating repeated file searches and uploads.

This led to the development of Retrieval-Augmented Generation (RAG), and—driven by rapid advances in AI agent technology over the past two years—to even more sophisticated RAG agents.

2. The “Brain” of the RAG Agent

A RAG agent consists of two core components:

  • Its decision-making “brain”, typically a large language model, and
  • External plugins that connect the LLM to your personal knowledge.

I’m currently developing such a RAG application called DeepSeekMine, now in active v0.6 iteration—with the goal of enabling fast, fully offline local execution.

Before starting development, we surveyed today’s most popular RAG agents—and found that the vast majority operate exclusively in the cloud. Two stood out:

  • IMA, developed by Tencent (primarily an end-user application), and
  • RealAgent, developed by RealAI (a platform supporting both using and building intelligent agents).

Below, we focus specifically on RealAgent’s workflow for building a personal knowledge base agent.

First, navigate to RealAgent’s agent-building interface—shown in its entirety below:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #02

The top-right corner of that interface is zoomed in below. Click the “Create Agent” button:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #03

You’ll then enter a drag-and-drop workflow builder—essentially a visual pipeline comprising input → LLM → optional plugins → output:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #04

For building a knowledge-base agent, two steps are critical:

  1. Selecting an appropriate large language model, and
  2. Uploading and configuring your personal knowledge base.

Let’s begin with step one—choosing the right LLM. The model selection interface appears as follows:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #05

Options include Qwen2.5, TARS-67B (RealAI’s in-house model), and DeepSeek-R1. Notably, smaller LLMs are rapidly improving in performance—making local deployment increasingly practical. Choose any model based on preference; here, we select DeepSeek-R1-70B:

image-20250327232442570

That completes the first phase: selecting an LLM via the UI. Next, we cover the second phase—constructing the knowledge base.

3. External Plugins for the RAG Agent

While the LLM serves as the RAG agent’s “brain,” the personal knowledge base functions as its external plugin. RealAgent’s knowledge-base setup is entirely GUI-driven and highly configurable.

For instance, the first configuration option is N-out-of-1 retrieval vs. multi-path retrieval:

  • N-out-of-1 selects the single most relevant document within one knowledge base.
  • Multi-path retrieves fragments across multiple knowledge bases simultaneously:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #06

Another key RAG parameter is Score, shown below with its default value of 0.5. This represents the cosine similarity between the user’s query vector and a retrieved text chunk. Higher values indicate stronger semantic alignment—and 0.5 is already considered excellent:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #07

Empirically, lowering this threshold slightly (e.g., to 0.45) increases recall—allowing more candidate chunks to be fed into the LLM, thereby maximizing contextual relevance:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #08

That covers the main knowledge-base configuration parameters.

Next, let’s upload personal documents to the remote knowledge base. Click the + button:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #09

Then click the blue link labeled “Go Create”:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #10

A new web page opens:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #11

Click “Create New Knowledge Base” in the top-right corner:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #12

Enter a name and description:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #13

Click the blue “Select Files”, which supports numerous formats: .txt, .md, .pdf, .html, .xlsx, .docx, .csv, .doc, and more:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #14

Now upload the DeepSeek-R1 paper PDF:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #15

And similarly upload the DeepSeek-V3 paper:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #16

Upon completion, upload progress appears—confirming both files succeeded:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #17

After upload, RealAgent automatically segments documents using its default logic:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #18

Once segmentation finishes, the interface confirms successful upload:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #19

RealAgent’s visualization is exceptionally clear: here, we see the original PDF split into 46 segments. Note that these “segments” don’t necessarily align with natural paragraphs—they’re algorithmically determined semantic units:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #20

At this point, your personal knowledge base is fully built. Next, you can instantly create a knowledge-retrieval agent via drag-and-drop:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #21

Within that module, select the newly created knowledge base. To test, first enable just one document and click Add:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #22

Congratulations—you’ve just built your first knowledge-base agent using RealAgent! Now ask it a question. It will automatically:

  • Retrieve relevant knowledge snippets (RAG),
  • Inject them into the configured LLM, and
  • Generate a grounded response.

Try: “How was DeepSeek-R1-Zero trained?”

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #23

Watch the internal workflow activate—from Input Start, through Knowledge Retrieval #2, to the LLM, and finally to Response Output:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #24

Expanding Knowledge Retrieval #2, you’ll see the original query embedded in the query field:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #25

And the matched knowledge-base segment—with a similarity score of 0.5486:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #26

The final response draws comprehensively from the source material, structured into three clear sections covering DeepSeek-R1-Zero’s introduction and training methodology:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #27

Once verified, publish the agent. Before publishing, rename it to something descriptive—e.g., “Knowledge Base Agent”:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #28

After publishing, the agent appears in your agent list:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #29

Return to RealAgent’s chat interface, type @, and your new agent appears automatically:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #30

Select it—and begin asking questions immediately:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #31

For example, ask about DeepSeek-R1’s other key algorithm—GRPO (Group Relative Policy Optimization)—and observe its accurate, well-grounded response:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #32

That concludes our full walkthrough of building a knowledge-base agent with RealAgent. Interested users can download the software and follow these exact steps:

https://www.ai-indeed.com/downloadCenter

4. Fully Autonomous Agents

Beyond ease of agent creation, RealAgent offers another powerful capability: one-shot workflow generation. Simply describe a task in plain language—and RealAgent auto-generates and executes the entire operational sequence.

Example: Extract the Douban Movie Top Chart (movie titles + ratings). Just type:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #33

RealAgent instantly generates a precise 4-step workflow:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #34

Press Enter, and it executes all steps automatically: opens the webpage, locates the chart, parses titles and scores—all within ~60 seconds.

Finally, it outputs clean, formatted results:

DeepSeek + RealAgent: Screenshot of Automated Workflow Operation #35

One sentence → auto-generated workflow → one keystroke → full automation. Truly impressive. Try it yourself using the steps above.

It’s worth reflecting: before LLMs and intelligent agents, building such a scraper would take a developer at least an hour. Today? One minute.

Summary

This article answers common questions about integrating LLMs with personal knowledge bases—including the online/offline trade-offs. As noted, mainstream RAG solutions today are cloud-based.

Regarding agent development, we demonstrated—using RealAgent—a practical, code-free method to build a knowledge-base agent. Key advantages include:

  1. Zero coding required—accessible to non-programmers;
  2. Intuitive drag-and-drop interface—streamlined UX;
  3. Polished, highly visual configuration UI—especially for knowledge-base settings.

We also explained core RAG concepts: origins of knowledge-base agents, architectural components, and critical algorithmic parameters (Top-K, Score, retrieval strategies).

Finally, we showcased RealAgent’s fully autonomous agent feature: describe a task in one sentence → generate a workflow → execute it with one keystroke.

Continue

Keep reading from here

Browse English site

Reader Messages

Reader messages

Questions, corrections, extra sources, or hands-on results can be left here. No login is required.

Max 800 characters

To reduce spam, each message is checked for length, link count, and posting frequency.

0/800

Messages

0 messages
Loading messages...