Public

WeChat Login

Pull requests Events Packages Insights

main

Branch

Tag

godog431<aayou_123@163.com>

docs: 优化所有 README 文档,将快速上手指南提前至文档开头

69a4c35b

11 commits

CodeSentinel

An automated code security audit system powered by AI large language models, helping developers quickly identify security vulnerabilities in code and providing remediation suggestions.

🚀 Quick Start (Recommended for Beginners)

Get full code security audit capabilities in just 3 steps — no server deployment required!

Step 1: Copy the Skills Folder

Copy the code_review_skills/ folder to your project root directory:


# Method 1: If you've already cloned the full repository
cp -r code_review_skills/ /path/to/your-project/

# Method 2: Download only the Skills folder (recommended)
git clone --depth 1 --filter=blob:none --sparse https://github.com/azelfeng/AI_code_review_agent.git
cd AI_code_review_agent
git sparse-checkout set code_review_skills
cp -r code_review_skills/ /path/to/your-project/

Step 2: Open Project in AI Editor

No configuration needed! These AI editors automatically recognize and load the audit rules:

Editor	Auto-Recognition	Notes
Cursor	✅	Auto-recognized as Rules, ready to use
Windsurf	✅	Auto-recognized as Rules, ready to use
CodeBuddy	✅	Auto-recognized as Skills, ready to use
Claude Code	✅	Auto-loaded after configuring as Project Knowledge

Step 3: Start Auditing

In your AI editor, select your code file and tell the AI:


Please use security audit rules to check this file

Or be more specific:


Please check my project for SQL injection, XSS, command injection, and other vulnerabilities

That's it! The AI will automatically use rules from code_review_skills/ to perform professional security audits on your code. ✨

📋 Select Skills by Scenario (Optional)

If you only want to audit specific vulnerability types, copy only the relevant files:

Web Projects (Frontend + Backend):


code_review_skills/00-system-prompt.md
code_review_skills/01-audit-workflow.md
code_review_skills/02-sql-injection.md
code_review_skills/03-xss.md
code_review_skills/04-hardcoded-secrets.md
code_review_skills/07-ssrf.md
code_review_skills/13-ssti.md
code_review_skills/14-insecure-jwt.md
code_review_skills/16-open-redirect.md

API Backend Projects:


code_review_skills/00-system-prompt.md
code_review_skills/02-sql-injection.md
code_review_skills/05-command-injection.md
code_review_skills/06-path-traversal.md
code_review_skills/14-insecure-jwt.md

C/C++ Projects:


code_review_skills/00-system-prompt.md
code_review_skills/09-memory-safety.md
code_review_skills/05-command-injection.md

💡 Usage Tips

First-Time Use: Start with a small file to experience the audit process
Audit Entire Project: Simply say "Please audit the entire project for security issues"
View Detailed Rules: All audit rules are in the code_review_skills/ folder — view or modify them anytime
Custom Rules: Modify or add new audit rules based on your team's needs

💡 Tip: If you need more powerful features (batch auditing, report management, multi-model switching, etc.), continue reading the "Full Web Application Deployment" section below.

Key Features

Three Iron Rules — Eliminate AI hallucinations at the root: no guessing file paths, no fabricating code, no reporting unseen vulnerabilities (see Core Philosophy)
Four-Round Challenge Verification — Every vulnerability undergoes four progressive rounds of challenge on reachability, code logic, data flow, and exploitability, with false positives automatically filtered (see Challenge Mechanism)
Code Upload & Parsing — Supports ZIP source code packages with automatic extraction, project structure recognition, and tech stack identification
Multi-Model Support — Compatible with multiple LLMs via OpenAI API format (Claude, GPT-4o, DeepSeek, Qwen, Hunyuan, etc.)
Model Configuration Management — 10 built-in preset templates, supports custom API endpoints and keys, one-click default model selection
AI Security Audit — Covers 31 vulnerability types including SQL injection, XSS, command injection, SSTI, JWT, LDAP injection, plus 8 composite function vulnerability patterns
Memory Safety Audit — Supports C/C++/Rust memory safety detection: buffer overflow, use-after-free, memory leak, integer overflow, format string, and 8 memory safety vulnerability types
Cross-File Semantic Analysis — Automatically parses import/require/#include dependencies, tracks cross-module data flows and composite function vulnerabilities
Batch Concurrency + Checkpoint Resume — Supports large-scale projects (1000+ code chunks), automatic batch concurrent processing, saves progress on timeout for later continuation
Security Report Generation — Generates structured audit reports (Markdown + JSON) with vulnerability details, risk levels, and remediation suggestions
Real-Time Logs — Displays thinking, findings, progress, warnings, and other log types in real time during auditing
Task Management — Asynchronous task processing with audit progress tracking and history

Core Philosophy

Three Iron Rules of Auditing

This project enforces three inviolable hard constraints on AI audit behavior, fundamentally eliminating hallucinations and fabrications to ensure every vulnerability report is evidence-based:

Rule	Prohibited Behavior	Correct Approach	Violation Consequence
Rule 1: No Guessing File Paths	Referencing file paths from memory or speculation	Only reference code content actually provided to the AI	All analysis based on non-existent files is invalid
Rule 2: No Fabricating Code Snippets	Describing code from impression, or referencing code not actually seen	Must reference actual line numbers and content from provided code	All vulnerability analysis based on fabricated code is invalid
Rule 3: No Reporting Vulnerabilities in Unseen Code	Reporting vulnerabilities without actually seeing the code	See code → Analyze code → Then report vulnerabilities	Vulnerability reports for unseen code are directly marked invalid

Violating any iron rule will invalidate the audit results.

Design motivation: Large language models are prone to "hallucinations" in code audit scenarios — fabricating non-existent file paths, inventing code snippets that never appeared, or asserting vulnerabilities in files they never read. These three iron rules constrain AI behavior at the highest priority, anchoring the audit process to actual code evidence, thereby ensuring report credibility and traceability.

Core Audit Principle


  Code defect exists ≠ Vulnerability is exploitable

The system requires the AI to verify the following 9 dimensions based on actual code before reporting each vulnerability:

#	Verification Dimension	Description
1	Defect Authenticity	Whether the code defect truly exists, whether overlooked upstream protections exist
2	Path Reachability	Whether the code path is reachable (excluding dead code, legacy code, unsatisfied conditions)
3	Input Reachability	Whether user input can actually reach the danger point
4	Practical Exploitability	Whether an attacker can exploit it in a real environment
5	Systematic Design	Whether the pattern is a systematic framework design rather than an individual oversight
6	Source Type	Whether the source is external user input rather than trusted server-side code
7	Self-Attack Test	Whether the prerequisite privileges already exceed the vulnerability's own capability
8	Design Intent	Whether the behavior is the framework's intended design rather than a defect
9	Runtime Feasibility	Whether the theoretical attack is feasible in the actual runtime environment

Incorrectly labeling secure code as a "vulnerability" is misleading. Accuracy over quantity — one accurate vulnerability report is far better than ten false positives.

Four-Round Challenge Verification Mechanism

Every potential vulnerability detected by AI must pass four progressive rounds of challenge verification before being written to the final report. Failure in any round affects the final determination:


  Potential Vuln → Round 0 → Round 1 → Round 2 → Round 3 → Final Verdict
                     │          │          │          │
                     ▼          ▼          ▼          ▼
                Reachability  Code Logic  Data Flow  Exploitability

Detailed rules for each round:

Round	Name	Challenge Question	Pass Condition	Elimination Condition
Round 0	Reachability & Design Intent	Is the code path reachable?	Path reachable and not design behavior	Dead code / Legacy code / Intended design behavior
Round 1	Code Logic Challenge	Does the dangerous code pattern truly exist?	Confidence is MEDIUM or HIGH	Confidence is LOW → Direct elimination
Round 2	Data Flow Challenge	Can user input reach the danger point?	Clear data flow description exists (>10 chars)	Data flow unclear or non-existent
Round 3	Exploitability Challenge	Can an attacker construct an effective attack?	Confidence HIGH or severity CRITICAL	Insufficient confidence for regular vulns (composite vulns have exemption rules)

Final verdict criteria:

Rounds Passed	Verdict Status	Action
4/4 passed	`passed` — Confirmed vulnerability	Written to final report
2-3/4 passed	`partial` — Under observation	Written to report with pending verification note
0-1/4 passed	`failed` — False positive	Filtered out, excluded from final report

Composite function vulnerabilities (involving cross-file data flows) enjoy special exemption rules in Round 3 — they pass as long as confidence is not LOW, because cross-file vulnerabilities are inherently harder to confirm but often more damaging.

Memory safety vulnerabilities (buffer overflow, UAF, format string, etc.) enjoy independent exemption rules in Round 3 — they automatically pass when severity is CRITICAL or HIGH, because exploitation techniques for these vulnerabilities are highly mature.

Supported Vulnerability Detection Types

Application Security Vulnerabilities (14 Types)

Vulnerability Type	CWE ID	Default Level	Description
SQL Injection	CWE-89	HIGH	Detects SQL concatenation vulnerabilities, distinguishes parameterized queries (safe) from string concatenation (dangerous)
XSS (Cross-Site Scripting)	CWE-79	HIGH	Detects reflected, stored, and DOM-based XSS, covers innerHTML/v-html/dangerouslySetInnerHTML
Hardcoded Secrets	CWE-798	HIGH	Detects API Keys/Tokens/Passwords/Private Keys/Connection Strings, auto-excludes placeholders and env variable references
Command Injection	CWE-78	CRITICAL	Detects dangerous calls like eval/exec/system/child_process
Path Traversal	CWE-22	HIGH	Detects file path manipulation vulnerabilities
SSRF	CWE-918	HIGH	Detects Server-Side Request Forgery vulnerabilities
Insecure Deserialization	CWE-502	HIGH	Detects pickle.loads/ObjectInputStream/unserialize, etc.
Authentication Flaws	CWE-287	HIGH	Detects authentication/authorization implementation flaws
Sensitive Data Exposure	CWE-200	MEDIUM	Detects error message leakage, sensitive data in logs
XXE	CWE-611	HIGH	Detects XML External Entity injection
Insecure Randomness	CWE-330	LOW	Detects weak random number generators in security contexts
Prototype Pollution	CWE-1321	HIGH	Detects JavaScript prototype chain pollution
CSRF	CWE-352	MEDIUM	Detects Cross-Site Request Forgery
IDOR	CWE-639	MEDIUM	Detects Insecure Direct Object References

Memory Safety Vulnerabilities (8 Types)

Memory safety detection for C/C++/Rust (unsafe blocks):

Vulnerability Type	CWE ID	Default Level	Description
Buffer Overflow	CWE-120	CRITICAL	Detects unbounded buffer writes via strcpy/sprintf/gets
Use-After-Free	CWE-416	CRITICAL	Detects access to memory after it has been freed
Double Free	CWE-415	HIGH	Detects freeing the same memory region twice causing heap corruption
Memory Leak	CWE-401	MEDIUM	Detects malloc/new allocations not properly freed on all paths
Integer Overflow	CWE-190	HIGH	Detects integer arithmetic overflow causing insufficient buffer allocation or logic errors
Format String	CWE-134	CRITICAL	Detects user input as printf family format parameter enabling arbitrary memory read/write
Null Pointer Dereference	CWE-476	MEDIUM	Detects unchecked malloc/calloc return values or other null pointer dereferences
Uninitialized Memory Use	CWE-908	HIGH	Detects use of uninitialized local variables or heap memory leading to information disclosure

Composite Function Vulnerabilities (8 Patterns)

The system pays special attention to cross-function, cross-file composite security issues:

Pattern	Description
Cross-Function Data Flow Taint	Function A receives user input without sanitization → passes to Function B → Function B uses it in dangerous operations
Privilege Escalation Chain	Normal user modifies state via Function A → bypasses Function B's permission checks
Race Condition (TOCTOU)	Function A checks permissions → Function B modifies data after check but before operation
Error Handling Leak Chain	Function A's exception is caught by Function B → Function B returns error details to client
Auth/AuthZ Bypass Combo	Certain function call combinations skip intermediate authentication/authorization checks
Prototype Pollution Propagation	Object merge in Function A is polluted → affects Function B's logic decisions
Second-Order Injection	Function A stores unsanitized user input to database → Function B reads and uses it in dangerous operations
Callback/Event-Driven Vulnerability	Data passing between event handler functions lacks validation

Technical Architecture


┌──────────────────────────────────────────────────────┐
│                   Nginx (Port 80)                     │
│         Static Assets + API Reverse Proxy             │
├────────────────────┬─────────────────────────────────┤
│   React SPA        │   Express API (Port 3001)        │
│  TypeScript + Vite │  Node.js 20 + MongoDB Driver     │
│  Tailwind + DaisyUI│  AI Model Calls (OpenAI compat.) │
└────────────────────┴──────────┬──────────────────────┘
                                │
                       ┌────────┴────────┐
                       │  MongoDB 7      │
                       │  Data Storage   │
                       └─────────────────┘

Layer	Technology
Frontend Framework	React 19 + TypeScript
Build Tool	Vite 6
UI Styling	Tailwind CSS 3 + DaisyUI 4
Routing	React Router 6
Backend	Node.js 20 + Express 4
Database	MongoDB 7
AI Models	OpenAI-compatible API (Claude / GPT / DeepSeek / Qwen / Hunyuan, etc.)
Deployment	Docker Compose (Nginx + Node.js + MongoDB)

Directory Structure


AI_code_review_agent/
├── src/                          # Frontend source code
│   ├── components/               # Reusable components
│   │   ├── FileUpload.tsx        # File upload (ZIP drag & drop)
│   │   ├── TaskProgress.tsx      # Task progress & real-time logs
│   │   ├── ReportViewer.tsx      # Audit report viewer
│   │   ├── VulnerabilityCard.tsx  # Vulnerability detail card
│   │   ├── CodeHighlight.tsx     # Code syntax highlighting
│   │   ├── Navbar.tsx            # Top navigation bar
│   │   └── Footer.tsx            # Footer
│   ├── pages/                    # Page components
│   │   ├── HomePage.tsx          # Home page (upload entry)
│   │   ├── TaskPage.tsx          # Task details (progress/logs/report)
│   │   ├── HistoryPage.tsx       # Audit history
│   │   └── SettingsPage.tsx      # Model configuration management
│   ├── types/audit.ts            # TypeScript type definitions
│   ├── utils/api.ts              # API request utilities
│   ├── App.tsx                   # App routing entry
│   └── index.css                 # Global styles
├── server/                       # Backend API service
│   ├── src/
│   │   ├── index.js              # Express entry + route mounting
│   │   ├── routes/
│   │   │   ├── tasks.js          # Task CRUD + report data
│   │   │   ├── audit.js          # Trigger security audit
│   │   │   ├── report.js         # Trigger report generation
│   │   │   └── model-configs.js  # Model configuration management
│   │   ├── services/
│   │   │   ├── ai.js             # AI model calls (OpenAI compatible)
│   │   │   ├── analyzeCode.js    # ZIP extraction + code chunking
│   │   │   ├── securityAudit.js  # Audit engine (concurrency/retry/resume)
│   │   │   └── generateReport.js # Report generation (Markdown + JSON)
│   │   └── utils/db.js           # MongoDB connection & indexes
│   ├── Dockerfile                # Backend container config
│   ├── .env.example              # Environment variable template
│   └── package.json
├── shared/
│   ├── prompts/                  # AI audit prompt templates
│   │   ├── system.md            # System-level audit prompt
│   │   ├── sql-injection.md     # SQL injection detection skill
│   │   ├── xss.md               # XSS detection skill
│   │   ├── hardcoded-secrets.md # Hardcoded secrets detection skill
│   │   └── memory-safety.md     # Memory safety detection skill (C/C++/Rust)
│   ├── schemas/                  # Vulnerability output JSON Schema
│   └── constants/                # Audit rule constant definitions
├── agent-config/                 # AI audit Agent configuration docs
│   ├── 00-system-prompt.md      # Core system prompt
│   ├── 01-audit-workflow.md     # Audit workflow
│   ├── 02-skill-prompts.md      # Skill prompt library
│   ├── 03-vulnerability-schema.md # Vulnerability output schema
│   ├── 04-challenge-rules.md    # Challenge verification rules
│   └── 05-dangerous-functions-reference.md # Dangerous functions reference
├── code_review_skills/           # 🔌 Standalone portable security audit Skills (see below)
│   ├── 00-system-prompt.md      # System prompt (role, iron rules, verdict matrix)
│   ├── 01-audit-workflow.md     # 8-step audit workflow
│   ├── 02~09-*.md               # 8 core vulnerability detection Skills (SQLi/XSS/secrets/etc.)
│   ├── 10-vulnerability-schema.md # 31 vulnerability types + JSON Schema
│   ├── 11-challenge-rules.md    # Four-round challenge verification rules
│   ├── 12-dangerous-functions.md # Multi-language dangerous functions reference
│   └── 13~18-*.md               # 6 new vulnerability detection Skills (SSTI/JWT/LDAP/redirect/etc.)
├── docker-compose.yml            # Docker Compose orchestration
├── Dockerfile                    # Frontend multi-stage build (Vite → Nginx)
├── nginx.conf                    # Nginx reverse proxy config
├── .env                          # Frontend environment variables
└── package.json                  # Frontend dependencies

Use Audit Skills Standalone (No Deployment Required)

Simply copy the code_review_skills/ folder to any project root directory, and your local AI model will gain full code security audit capabilities — no server deployment, no database, no extra configuration needed.

The code_review_skills/ folder contains 19 standalone Skill files distilled from this platform, covering system-level audit instructions, 15 vulnerability detection rules, four-round challenge verification, and dangerous functions reference. All files use YAML Front Matter + Markdown format, compatible with mainstream AI editors' (Cursor / Windsurf / CodeBuddy / Claude Code, etc.) Rules systems.


# Get audit capabilities in one command
cp -r code_review_skills/ /path/to/your-project/

See code_review_skills/README.md for details.

Quick Start

Prerequisites

Docker and Docker Compose
At least one AI model API key compatible with OpenAI API format

Docker Deployment (Recommended)


# Clone the project
git clone <repo-url>
cd AI_code_review_agent

# Build and start all services
docker compose up -d

# View logs
docker compose logs -f

After startup, visit http://localhost:8080

Custom port:


APP_PORT=3000 docker compose up -d

Stop services:


docker compose down

Local Development

For scenarios requiring code modification with hot reload debugging.

1. Start MongoDB


docker run -d --name mongo -p 27017:27017 mongo:7

2. Start Backend


cd server
npm install

# Create .env (or copy .env.example)
cat > .env << EOF
MONGODB_URI=mongodb://localhost:27017/ai_code_review
PORT=3001
DATA_DIR=./data
EOF

npm run dev

3. Start Frontend (new terminal)


# Return to project root
npm install
npm run dev

The frontend dev server starts at http://localhost:5173 by default, with API requests automatically proxied to backend localhost:3001.

Usage

1. Configure AI Model

Configure the model before first use:

Open the app and click "Model Configuration" in the navigation bar
Select a model from preset templates (or click "Add Custom Configuration")
Enter the API Endpoint and API Key
Click "Set as Default" to designate the model for auditing

2. Upload Code for Audit

Click the upload area on the home page or drag & drop a ZIP file
The system automatically extracts, identifies the tech stack, and chunks the code
AI model audits each chunk automatically, with real-time progress and logs displayed
Report is automatically generated after audit completion

3. View Report

View the report directly on the task page after audit completion
Report includes: risk summary, vulnerability list (with severity level, CWE ID, code location, remediation suggestions)
Supports downloading the report in Markdown format

4. History

View all audit tasks on the "History" page, with pagination and deletion support.

Configuration

Environment Variables

Frontend (.env):

Variable	Default	Description
`VITE_API_BASE_URL`	`/api`	API request path prefix

Backend (server/.env):

Variable	Default	Description
`MONGODB_URI`	`mongodb://mongo:27017/ai_code_review`	MongoDB connection URI
`PORT`	`3001`	Backend service port
`DATA_DIR`	`/app/data`	Upload files and report storage path

Docker Compose:

Variable	Default	Description
`APP_PORT`	`8080`	Externally exposed access port

Audit Engine Parameters

Parameter	Value	Description
Batch Concurrency	2	Number of code chunks audited simultaneously per batch
Single Run Limit	100 chunks	Maximum code chunks per single execution
AI Request Timeout	150s (increments to 210s)	Initial 150s, +30s per retry
Max Retries	2	Retry count after individual chunk failure
Large Chunk Threshold	120 lines	Auto-split chunks exceeding this line count
Safe Exit Threshold	540s	Saves progress on timeout for later continuation

Supported AI Models

10 built-in preset templates, plus support for any OpenAI API-compatible model:

Model	Description
GPT-4o	OpenAI flagship model
GPT-4o Mini	OpenAI lightweight model
Claude Opus 4	Anthropic flagship model
Claude Sonnet 4	Anthropic cost-effective model
DeepSeek V3	DeepSeek general-purpose model
DeepSeek R1	DeepSeek reasoning model
Qwen Max	Alibaba Qwen flagship model
Hunyuan Turbo	Tencent Hunyuan high-performance reasoning model
Hunyuan Pro	Tencent Hunyuan best-effect model
Custom Model	Any endpoint compatible with OpenAI Chat Completions API

MongoDB Collections

Collection	Purpose
`audit_tasks`	Audit task info (status, file paths, tech stack, etc.)
`audit_results`	Audit results (vulnerability lists stored per file)
`audit_logs`	Audit logs (real-time progress, thinking process, etc.)
`audit_code_files`	Extracted code chunks
`audit_vulnerabilities`	Temporary vulnerability data (cleaned after report generation)
`model_configs`	AI model configurations

Common Commands


# Docker operations
docker compose up -d          # Start services
docker compose down           # Stop services
docker compose logs -f        # View logs
docker compose up -d --build  # Rebuild and start

# Local development
npm run dev                   # Start frontend dev server
npm run build                 # Build frontend
npm run lint                  # ESLint check
npm run format                # Prettier formatting
cd server && npm run dev      # Start backend dev server

License

This project is licensed under the Apache License 2.0.

About

AI Code Review Agent

915.00 KiB

0 forks 0 stars 1 branches 0 TagREADMEApache-2.0 license

Release
0

Tag

Language

TypeScript63.6%

JavaScript34.4%

CSS1.5%

Dockerfile0.3%

Others0.2%

35/F,Tencent Building,Kejizhongyi Avenue,Nanshan District,Shenzhen

京ICP备11018762号-111

.codebuddy
.rules
agent-config
code_review_skills
public
server
shared
src
.dockerignore
.gitignore
.prettierignore
.prettierrc
Dockerfile
LICENSE
README.md
README_CN.md
README_EN.md
docker-compose.yml
eslint.config.ts
index.html
nginx.conf
package-lock.json
package.json
postcss.config.js
tailwind.config.js
tsconfig.json
vite.config.ts