Public

WeChat Login

Code Issues Pull requests Events Packages Insights

main

Branch

Tag

Manx98<wenyiyu98@gmail.com>

fix(frontend): prevent duplicate click binding

66f4207b

1005 commits

| English | 简体中文 | 日本語 |

Overview • Architecture • Key Features • Getting Started • API Reference • Developer Guide

💡 WeKnora - LLM-Powered Document Understanding & Retrieval Framework

📌 Overview

WeKnora is an LLM-powered framework designed for deep document understanding and semantic retrieval, especially for handling complex, heterogeneous documents.

It adopts a modular architecture that combines multimodal preprocessing, semantic vector indexing, intelligent retrieval, and large language model inference. At its core, WeKnora follows the RAG (Retrieval-Augmented Generation) paradigm, enabling high-quality, context-aware answers by combining relevant document chunks with model reasoning.

Website: https://weknora.weixin.qq.com

✨ Latest Updates

v0.3.5 Highlights:

Telegram, DingTalk & Mattermost IM Integration: Added Telegram bot (webhook/long-polling, streaming via editMessageText), DingTalk bot (webhook/Stream mode, AI Card streaming), and Mattermost adapter; IM channel coverage now includes WeCom, Feishu, Slack, Telegram, DingTalk, and Mattermost
IM Slash Commands & QA Queue: Pluggable slash-command system (/help, /info, /search, /stop, /clear) with a bounded QA worker pool, per-user rate limiting, and Redis-based multi-instance coordination
Suggested Questions: Agents surface context-aware suggested questions based on configured knowledge bases; image knowledge automatically enqueues question generation
VLM Auto-Describe MCP Tool Images: When MCP tools return images, the agent generates text descriptions via the configured VLM model, enabling image content to be used by text-only LLMs
Novita AI Provider: New LLM provider with OpenAI-compatible API supporting chat, embedding, and VLLM model types
MCP Tool Name Stability: Tool names now based on service name (stable across reconnections) instead of UUID; unique name constraint added; frontend formats names into human-readable form
Channel Tracking: Knowledge entries and messages record source channel (web/api/im/browser_extension) for traceability
Bug Fixes: Fixed agent empty response when no knowledge base is configured, UTF-8 truncation in summaries for Chinese/emoji documents, API key encryption loss on tenant settings update, vLLM streaming reasoning content propagation, and rerank empty passage errors

v0.3.4 Highlights:

IM Bot Integration: WeCom, Feishu, and Slack IM channel support with WebSocket/Webhook modes, streaming, and knowledge base integration
Multimodal Image Support: Image upload and multimodal image processing with enhanced session management
Manual Knowledge Download: Download manual knowledge content as files with proper filename sanitization
NVIDIA Model API: Support NVIDIA chat model API with custom endpoint and VLM model configuration
Weaviate Vector DB: Added Weaviate as a new vector database backend for knowledge retrieval
AWS S3 Storage: Integrated AWS S3 storage adapter with configuration UI and database migrations
AES-256-GCM Encryption: API keys encrypted at rest with AES-256-GCM for enhanced security
Built-in MCP Service: Built-in MCP service support for extending agent capabilities
Hybrid Search Optimization: Grouped targets and reused query embeddings for better retrieval performance
Final Answer Tool: New final_answer tool with agent duration tracking for improved agent workflows

Earlier Releases

v0.3.3 Highlights:

Parent-Child Chunking: Hierarchical parent-child chunking strategy for enhanced context management and more accurate retrieval
Knowledge Base Pinning: Pin frequently-used knowledge bases for quick access
Fallback Response: Fallback response handling with UI indicators when no relevant results are found
Passage Cleaning for Rerank: Passage cleaning for rerank model to improve relevance scoring accuracy
Storage Auto-Creation: Storage engine connectivity check with auto-creation of buckets
Milvus Vector DB: Added Milvus as a new vector database backend for knowledge retrieval

v0.3.2 Highlights:

🔍 Knowledge Search: New "Knowledge Search" entry point with semantic retrieval, supporting bringing search results directly into the conversation window
⚙️ Parser & Storage Engine Configuration: Configure document parser engines and storage engines for different sources in settings, with per-file-type parser selection in knowledge base
🖼️ Image Rendering in Local Storage: Support image rendering during conversations in local storage mode, with optimized streaming image placeholders
📄 Document Preview: Embedded document preview component for previewing user-uploaded original files
🎨 UI Optimization: Knowledge base, agent, and shared space list page interaction redesign
🗄️ Milvus Support: Added Milvus as a new vector database backend for knowledge retrieval
🌋 Volcengine TOS: Added Volcengine TOS object storage support
📊 Mermaid Rendering: Support mermaid diagram rendering in chat with fullscreen viewer, zoom, pan, toolbar and export
💬 Batch Conversation Management: Batch management and delete all sessions functionality
🔗 Remote URL Knowledge: Support creating knowledge entries from remote file URLs
🧠 Memory Graph Preview: Preview of user-level memory graph visualization
🔄 Async Re-parse: Async API for re-processing existing knowledge documents

v0.3.0 Highlights:

🏢 Shared Space: Shared space with member invitations, shared knowledge bases and agents across members, tenant-isolated retrieval
🧩 Agent Skills: Agent skills system with preloaded skills for smart-reasoning agent, sandboxed execution environment for security isolation
🤖 Custom Agents: Support for creating, configuring, and selecting custom agents with knowledge base selection modes (all/specified/disabled)
📊 Data Analyst Agent: Built-in Data Analyst agent with DataSchema tool for CSV/Excel analysis
🧠 Thinking Mode: Support thinking mode for LLM and agents, intelligent filtering of thinking content
🔍 Web Search Providers: Added Bing and Google search providers alongside DuckDuckGo
📋 Enhanced FAQ: Batch import dry run, similar questions, matched question in search results, large imports offloaded to object storage
🔑 API Key Auth: API Key authentication mechanism with Swagger documentation security
📎 In-Input Selection: Select knowledge bases and files directly in the input box with @mention display
☸️ Helm Chart: Complete Helm chart for Kubernetes deployment with Neo4j GraphRAG support
🌍 i18n: Added Korean (한국어) language support
🔒 Security Hardening: SSRF-safe HTTP client, enhanced SQL validation, MCP stdio transport security, sandbox-based execution
⚡ Infrastructure: Qdrant vector DB support, Redis ACL, configurable log level, Ollama embedding optimization, DISABLE_REGISTRATION control

v0.2.0 Highlights:

🤖 Agent Mode: New ReACT Agent mode that can call built-in tools, MCP tools, and web search, providing comprehensive summary reports through multiple iterations and reflection
📚 Multi-Type Knowledge Bases: Support for FAQ and document knowledge base types, with new features including folder import, URL import, tag management, and online entry
⚙️ Conversation Strategy: Support for configuring Agent models, normal mode models, retrieval thresholds, and Prompts, with precise control over multi-turn conversation behavior
🌐 Web Search: Support for extensible web search engines with built-in DuckDuckGo search engine
🔌 MCP Tool Integration: Support for extending Agent capabilities through MCP, with built-in uvx and npx launchers, supporting multiple transport methods
🎨 New UI: Optimized conversation interface with Agent mode/normal mode switching, tool call process display, and comprehensive knowledge base management interface upgrade
⚡ Infrastructure Upgrade: Introduced MQ async task management, support for automatic database migration, and fast development mode

🔒 Security Notice

Important: Starting from v0.1.3, WeKnora includes login authentication functionality to enhance system security. For production deployments, we strongly recommend:

Deploy WeKnora services in internal/private network environments rather than public internet
Avoid exposing the service directly to public networks to prevent potential information leakage
Configure proper firewall rules and access controls for your deployment environment
Regularly update to the latest version for security patches and improvements

🏗️ Architecture

WeKnora employs a modern modular design to build a complete document understanding and retrieval pipeline. The system primarily includes document parsing, vector processing, retrieval engine, and large model inference as core modules, with each component being flexibly configurable and extendable.

🎯 Key Features

🤖 Agent Mode: Support for ReACT Agent mode that can use built-in tools to retrieve knowledge bases, MCP tools, and web search tools to access external services, providing comprehensive summary reports through multiple iterations and reflection
🔍 Precise Understanding: Structured content extraction from PDFs, Word documents, images and more into unified semantic views
🧠 Intelligent Reasoning: Leverages LLMs to understand document context and user intent for accurate Q&A and multi-turn conversations
📚 Multi-Type Knowledge Bases: Support for FAQ and document knowledge base types, with folder import, URL import, tag management, and online entry capabilities
🔧 Flexible Extension: All components from parsing and embedding to retrieval and generation are decoupled for easy customization
⚡ Efficient Retrieval: Hybrid retrieval strategies combining keywords, vectors, and knowledge graphs, with cross-knowledge base retrieval support
🌐 Web Search: Support for extensible web search engines with built-in DuckDuckGo search engine
🔌 MCP Tool Integration: Support for extending Agent capabilities through MCP, with built-in uvx and npx launchers, supporting multiple transport methods
⚙️ Conversation Strategy: Support for configuring Agent models, normal mode models, retrieval thresholds, and Prompts, with precise control over multi-turn conversation behavior
🎯 User-Friendly: Intuitive web interface and standardized APIs for zero technical barriers
🔒 Secure & Controlled: Support for local deployment and private cloud, ensuring complete data sovereignty

📊 Application Scenarios

Scenario	Applications	Core Value
Enterprise Knowledge Management	Internal document retrieval, policy Q&A, operation manual search	Improve knowledge discovery efficiency, reduce training costs
Academic Research Analysis	Paper retrieval, research report analysis, scholarly material organization	Accelerate literature review, assist research decisions
Product Technical Support	Product manual Q&A, technical documentation search, troubleshooting	Enhance customer service quality, reduce support burden
Legal & Compliance Review	Contract clause retrieval, regulatory policy search, case analysis	Improve compliance efficiency, reduce legal risks
Medical Knowledge Assistance	Medical literature retrieval, treatment guideline search, case analysis	Support clinical decisions, improve diagnosis quality

🧩 Feature Matrix

Module	Support	Description
Agent Mode	✅ ReACT Agent Mode	Built-in tools for knowledge base retrieval, MCP tool calls, and web search; cross-knowledge base retrieval with multi-step iteration
Knowledge Base Types	✅ FAQ / Document	FAQ and document knowledge bases with folder import, URL import, tag management, online entry, and knowledge move
Document Formats	✅ PDF / Word / Txt / Markdown / HTML / Images (OCR + Caption)	Structured and unstructured document parsing; image text extraction via OCR; image caption generation via VLM
IM Channel Integration	✅ WeCom / Feishu / Slack / Telegram / DingTalk / Mattermost	WebSocket and Webhook modes; streaming replies; slash commands (/help, /info, /search, /stop, /clear); per-user rate limiting; Redis-based multi-instance coordination
Model Management	✅ Centralized configuration, built-in model sharing	Centralized model config with per-knowledge-base model selection; multi-tenant shared built-in model support
Embedding Models	✅ Local models (Ollama), BGE / GTE / OpenAI-compatible APIs	Customizable embedding models compatible with local deployment and cloud vector generation APIs
Vector DB Integration	✅ PostgreSQL (pgvector) / Elasticsearch / Milvus / Weaviate / Qdrant	Five vector index backends with flexible switching to match retrieval scenario requirements
Object Storage	✅ Local / MinIO / AWS S3 / Volcengine TOS	Pluggable storage adapters for file and image assets; bucket auto-creation on startup
Retrieval Strategies	✅ BM25 / Dense Retrieval / GraphRAG	Sparse/dense recall and knowledge graph-enhanced retrieval; customizable retrieve-rerank-generate pipeline
LLM Integration	✅ Qwen / DeepSeek / MiniMax / NVIDIA / Novita AI / OpenAI-compatible	Local models via Ollama or external API services; thinking/non-thinking mode switching; vLLM streaming reasoning content support
Conversation Strategy	✅ Agent model, normal model, retrieval threshold, Prompt configuration	Online Prompt editing; retrieval threshold tuning; precise multi-turn conversation behavior control
Web Search	✅ DuckDuckGo / Bing / Google (extensible)	Pluggable search engine providers; web search toggle per conversation
MCP Tools	✅ uvx / npx launchers, Stdio / HTTP Streamable / SSE	Extend agent capabilities via MCP; stable tool naming with collision protection; VLM auto-description for tool-returned images
Suggested Questions	✅ Knowledge-base-driven question suggestions	Agent surfaces context-aware suggested questions in chat interface; image knowledge auto-generates questions
QA Capabilities	✅ Context-aware, multi-turn dialogue, prompt templates	Complex semantic modeling, instruction control, chain-of-thought Q&A with configurable prompts and context windows
Security	✅ AES-256-GCM at-rest encryption, SSRF protection	API keys encrypted at rest; SSRF-safe HTTP client for remote API calls; sandbox execution for agent skills
E2E Testing	✅ Retrieval + generation visualization and metric evaluation	End-to-end test tools for evaluating recall hit rates, answer coverage, BLEU/ROUGE metrics
Deployment Modes	✅ Local / Docker / Kubernetes (Helm)	Private and offline deployment; fast development mode with hot-reload; Helm chart for Kubernetes
User Interfaces	✅ Web UI + RESTful API	Interactive web interface and standard API; Agent/normal mode switching; tool call process display
Task Management	✅ MQ async tasks, automatic database migration	MQ-based async task state; automatic schema and data migration on version upgrade

🚀 Getting Started

🛠 Prerequisites

Make sure the following tools are installed on your system:

📦 Installation

① Clone the repository


# Clone the main repository
git clone https://github.com/Tencent/WeKnora.git
cd WeKnora

② Configure environment variables


# Copy example env file
cp .env.example .env

# Edit .env and set required values
# All variables are documented in the .env.example comments

③ Start the services (include Ollama)

Check the images that need to be started in the .env file.


./scripts/start_all.sh


make start-all

③.0 Start ollama services (Optional)


ollama serve > /dev/null 2>&1 &

③.1 Activate different combinations of features

Minimum core services


docker compose up -d

All features enabled


docker-compose --profile full up -d

Tracing logs required


docker-compose --profile jaeger up -d

Neo4j knowledge graph required


docker-compose --profile neo4j up -d

Minio file storage service required


docker-compose --profile minio up -d

Multiple options combination


docker-compose --profile neo4j --profile minio up -d

④ Stop the services


./scripts/start_all.sh --stop
# Or
make stop-all

🌐 Access Services

Once started, services will be available at:

Web UI: http://localhost
Backend API: http://localhost:8080
Jaeger Tracing: http://localhost:16686

🔌 Using WeChat Dialog Open Platform

WeKnora serves as the core technology framework for the WeChat Dialog Open Platform, providing a more convenient usage approach:

Zero-code Deployment: Simply upload knowledge to quickly deploy intelligent Q&A services within the WeChat ecosystem, achieving an "ask and answer" experience
Efficient Question Management: Support for categorized management of high-frequency questions, with rich data tools to ensure accurate, reliable, and easily maintainable answers
WeChat Ecosystem Integration: Through the WeChat Dialog Open Platform, WeKnora's intelligent Q&A capabilities can be seamlessly integrated into WeChat Official Accounts, Mini Programs, and other WeChat scenarios, enhancing user interaction experiences

🔗 Access WeKnora via MCP Server

1️⃣ Clone the repository


git clone https://github.com/Tencent/WeKnora

2️⃣ Configure MCP Server

It is recommended to directly refer to the MCP Configuration Guide for configuration.

Configure the MCP client to connect to the server:


{
  "mcpServers": {
    "weknora": {
      "args": [
        "path/to/WeKnora/mcp-server/run_server.py"
      ],
      "command": "python",
      "env":{
        "WEKNORA_API_KEY":"Enter your WeKnora instance, open developer tools, check the request header x-api-key starting with sk",
        "WEKNORA_BASE_URL":"http(s)://your-weknora-address/api/v1"
      }
    }
  }
}

Run directly using stdio command:


pip install weknora-mcp-server
python -m weknora-mcp-server

🔧 Initialization Configuration Guide

To help users quickly configure various models and reduce trial-and-error costs, we've improved the original configuration file initialization method by adding a Web UI interface for model configuration. Before using, please ensure the code is updated to the latest version. The specific steps are as follows: If this is your first time using this project, you can skip steps ①② and go directly to steps ③④.

① Stop the services


./scripts/start_all.sh --stop

② Clear existing data tables (recommended when no important data exists)


make clean-db

③ Compile and start services


./scripts/start_all.sh

④ Access Web UI

http://localhost

On your first visit, you will be automatically redirected to the registration/login page. After completing registration, please create a new knowledge base and finish the relevant settings on its configuration page.

📱 Interface Showcase

Web UI Interface

Knowledge Base Management	Conversation Settings
Agent Mode Tool Call Process

Knowledge Base Management: Support for creating FAQ and document knowledge base types, with multiple import methods including drag-and-drop, folder import, and URL import. Automatically identifies document structures and extracts core knowledge to establish indexes. Supports tag management and online entry. The system clearly displays processing progress and document status, achieving efficient knowledge base management.

Agent Mode: Support for ReACT Agent mode that can use built-in tools to retrieve knowledge bases, call user-configured MCP tools and web search tools to access external services, providing comprehensive summary reports through multiple iterations and reflection. Supports cross-knowledge base retrieval, allowing selection of multiple knowledge bases for simultaneous retrieval.

Conversation Strategy: Support for configuring Agent models, normal mode models, retrieval thresholds, and online Prompt configuration, with precise control over multi-turn conversation behavior and retrieval execution methods. The conversation input box supports Agent mode/normal mode switching, enabling/disabling web search, and selecting conversation models.

Document Knowledge Graph

WeKnora supports transforming documents into knowledge graphs, displaying the relationships between different sections of the documents. Once the knowledge graph feature is enabled, the system analyzes and constructs an internal semantic association network that not only helps users understand document content but also provides structured support for indexing and retrieval, enhancing the relevance and breadth of search results.

For detailed configuration, please refer to the Knowledge Graph Configuration Guide.

MCP Server

Please refer to the MCP Configuration Guide for the necessary setup.

📘 API Reference

Troubleshooting FAQ: Troubleshooting FAQ

Detailed API documentation is available at: API Docs

Product plans and upcoming features: Roadmap

🧭 Developer Guide

⚡ Fast Development Mode (Recommended)

If you need to frequently modify code, you don't need to rebuild Docker images every time! Use fast development mode:


# Method 1: Using Make commands (Recommended)
make dev-start      # Start infrastructure
make dev-app        # Start backend (new terminal)
make dev-frontend   # Start frontend (new terminal)

# Method 2: One-click start
./scripts/quick-dev.sh

# Method 3: Using scripts
./scripts/dev.sh start     # Start infrastructure
./scripts/dev.sh app       # Start backend (new terminal)
./scripts/dev.sh frontend  # Start frontend (new terminal)

Development Advantages:

✅ Frontend modifications auto hot-reload (no restart needed)
✅ Backend modifications quick restart (5-10 seconds, supports Air hot-reload)
✅ No need to rebuild Docker images
✅ Support IDE breakpoint debugging

Detailed Documentation: Development Environment Quick Start

📁 Directory Structure


WeKnora/
├── client/      # go client
├── cmd/         # Main entry point
├── config/      # Configuration files
├── docker/      # docker images files
├── docreader/   # Document parsing app
├── docs/        # Project documentation
├── frontend/    # Frontend app
├── internal/    # Core business logic
├── mcp-server/  # MCP server
├── migrations/  # DB migration scripts
└── scripts/     # Shell scripts

🤝 Contributing

We welcome community contributions! For suggestions, bugs, or feature requests, please submit an Issue or directly create a Pull Request.

🎯 How to Contribute

🐛 Bug Fixes: Discover and fix system defects
✨ New Features: Propose and implement new capabilities
📚 Documentation: Improve project documentation
🧪 Test Cases: Write unit and integration tests
🎨 UI/UX Enhancements: Improve user interface and experience

📋 Contribution Process

Fork the project to your GitHub account
Create a feature branch git checkout -b feature/amazing-feature
Commit changes git commit -m 'Add amazing feature'
Push branch git push origin feature/amazing-feature
Create a Pull Request with detailed description of changes

🎨 Code Standards

Follow Go Code Review Comments
Format code using gofmt
Add necessary unit tests
Update relevant documentation

📝 Commit Guidelines

Use Conventional Commits standard:


feat: Add document batch upload functionality
fix: Resolve vector retrieval precision issue
docs: Update API documentation
test: Add retrieval engine test cases
refactor: Restructure document parsing module

👥 Contributors

Thanks to these excellent contributors:

📄 License

This project is licensed under the MIT License. You are free to use, modify, and distribute the code with proper attribution.

📈 Project Statistics

About

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

55.52 MiB

0 forks 0 stars 1 branches 0 TagREADMEOther license

Release
0

Tag

Contributors
60

Language

Go55.4%

Vue26.9%

TypeScript11.8%

Python4%

Others1.9%

35/F,Tencent Building,Kejizhongyi Avenue,Nanshan District,Shenzhen

京ICP备11018762号-111

.github
client
cmd
config
dataset
docker
docreader
docs
examples
frontend
helm
internal
mcp-server
migrations
packages
scripts
skills
testdata
.air.toml
.dockerignore
.env.example
.gitattributes
.gitignore
.golangci.yml
CHANGELOG.md
LICENSE
Makefile
README.md
README_CN.md
README_JA.md
README_KO.md
SECURITY.md
VERSION
docker-compose.dev.yml
docker-compose.yml
go.mod
go.sum
package-lock.json
rerank_server_demo.py
test_agent_config.sh