Public

WeChat Login

Pull requests Events Packages Insights

main

Branch

Tag

Chief<520@zut.edu.cn>

refactor: 使用 typing.List 替代内置 list[float]

94b486ca

255 commits

.cnb
src
.cnb.yml
.gitignore
README.en.md
README.md
requirements.txt

English | 简体中文

G1 NavGrasp

Let G1 see the world, walk to the target, and reach out.

Introduction

Unitree G1 humanoid robot YOLO object detection and path planning navigation workspace. Built on ROS2 Foxy + Python, featuring real-time object detection, 3D spatial localization, and autonomous navigation.

Tech Stack: Ubuntu 20.04 · ROS2 Foxy · Python 3.8+ · YOLOv11 · Nav2 · colcon

Core Capabilities:

YOLO real-time object detection (custom trained models supported)
2D detection to 3D spatial coordinate projection
Nav2 path planning and autonomous navigation
Emergency stop and speed limiting safety protection
Web Control Panel: full robot control from any browser/phone/tablet (recommended)

⚡ Quick Start (Web Control Panel — Recommended)

One command, control everything from a browser:

# One-time setup (first run)
python3 -m venv ~/g1act_venv --system-site-packages
source ~/g1act_venv/bin/activate
cd ~/g1act_ws
pip install -r requirements.txt
colcon build --packages-select g1_yolo_nav_py
. install/setup.bash

# Every session
source ~/g1act_venv/bin/activate
cd ~/g1act_ws && . install/setup.bash
ros2 run g1_yolo_nav_py web_panel

Open http://<robot_ip>:8080 from any phone/tablet/laptop on the same network.

Web Panel Modules

Page	Functionality
🎯 Grasp Task	Full pipeline: search → align → approach → grab → release (incl. turn-release / left-step-release)
🔍 Detection View	Immersive YOLO detection video stream with realtime stats
🕹️ Manual Control	D-Pad joystick (forward/back/left/right/stop) + speed slider
📊 System Status	Dashboard (FPS / detections / distance / u position / health ring) + log stream
📦 Node Manager	Start/stop RealSense camera / YOLO detector / RGBD capture with inline param editor
⚙️ Settings	19 hot-reloadable runtime params + dynamic background + UI prefs

Advantages:

🌐 Cross-device — works on phones/tablets/laptops, multi-user observation
🚫 Zero terminals — no need to ssh into 4 separate windows
🎨 Modern UI — liquid glass design + realtime visualizations
🔌 SSH-friendly — pure web, no X11 dependency, headless robots welcome

Local Frontend Preview (no ROS2 required)

Want to tweak the frontend without logging into the robot?

cd src/web_frontend
pip install flask pillow
python dev_server.py   # browse http://localhost:8080

dev_server.py is a pure-Flask mock backend that simulates all APIs and a synthetic MJPEG stream. Edit HTML/CSS/JS and refresh.

Project Structure

g1act_ws/
├── README.md                     # Project documentation (Chinese)
├── README.en.md                 # Project documentation (English)
├── CODEBUDDY.md                # AI coding assistant guide
├── requirements.txt             # Python dependencies
├── src/
│   ├── D455.md                 # D455 camera documentation
│   ├── web_frontend/          # Web control panel frontend (HTML/CSS/JS + local dev server)
│   │   ├── index.html              # Main page (6 view modules)
│   │   ├── css/
│   │   │   ├── style.css           # Liquid glass styles
│   │   │   └── lggc.css            # LGGC glass utility class
│   │   ├── js/app.js              # Routing / state polling / process mgmt / background
│   │   └── dev_server.py          # Local mock backend (no ROS2 required)
│   ├── g1_yolo_nav_py/        # Main ROS2 Python package
│   │   ├── setup.py                # Package setup (entry_points)
│   │   ├── package.xml            # ROS2 package description
│   │   ├── yolo_v11x_best.pt     # YOLOv11x custom trained model
│   │   ├── config/                # Parameter configuration files
│   │   │   └── yolo_nav.yaml
│   │   ├── launch/                # ROS2 launch files
│   │   │   ├── grasp_task.launch.py
│   │   │   └── yolo_nav.launch.py
│   │   ├── arm/                   # Arm control scripts (unitree_sdk2py, separate process)
│   │   │   ├── arm_common.py        # Shared module: joint constants, base class
│   │   │   ├── arm.py              # Arm control demo
│   │   │   ├── armup.py            # Grab action
│   │   │   └── armdown.py          # Release action
│   │   └── g1_yolo_nav_py/       # Python module code
│   │       ├── web_panel.py         # Web control panel node (Flask + ROS2) ★ recommended
│   │       ├── control_panel.py     # tkinter GUI control panel (legacy)
│   │       ├── yolo_detector.py     # YOLO detection node
│   │       ├── yaw_align.py         # Step-wise yaw alignment
│   │       ├── loco_forward.py      # Forward motion + depth-based stop
│   │       ├── grasp_task.py        # Terminal grasp task
│   │       ├── rgbd_capture.py      # RGBD data capture
│   │       ├── spatial_target.py    # 2D→3D spatial projection
│   │       ├── detection_visualizer.py  # Detection visualization
│   │       ├── _grasp_state.py      # Grasp state machine mixin
│   │       ├── _detection_utils.py  # Detection utilities
│   │       └── _dds_compat.py       # DDS compatibility layer
│   └── base/                   # Reference packages (not directly called)
│       ├── ctrl_keyboard/        # Loco API reference implementation
│       └── g1_description/      # URDF models (12/23/29 dof)

# Update commands
cd ~/g1act_ws/manact_ws
git pull
colcon build --packages-skip h1_description

# Activate virtual environment (every session)
cd ~/g1act_ws/manact_ws
source ~/g1act_venv/bin/activate
. install/setup.bash

Model Description

File	Description
`yolo_v11x_best.pt`	YOLOv11x custom trained model weights for object detection inference

Quick Start

Environment Setup (First Time)

Lab Requirement: Each experimenter must use an independent Python virtual environment. Installing packages in the system Python is prohibited.

# 1) Create virtual environment (with access to system ROS2 / unitree_sdk2py)
python3 -m venv ~/g1act_venv --system-site-packages

# 2) Activate virtual environment
source ~/g1act_venv/bin/activate

# 3) Install project dependencies
cd ~/g1act_ws/manact_ws
pip install -r requirements.txt

# 4) Build ROS2 workspace
colcon build

# --- Every session ---
source ~/g1act_venv/bin/activate
cd ~/g1act_ws
. install/setup.bash

Test YOLO Detection Only

# 1. Launch D455 camera (Terminal 1)
ros2 launch realsense2_camera rs_launch.py camera_namespace:=robot1 camera_name:=D455_1

# 2. Build and launch YOLO detector (Terminal 2)
source ~/g1act_venv/bin/activate
colcon build --packages-select g1_yolo_nav_py && . install/setup.bash
ros2 run g1_yolo_nav_py yolo_detector

# 3. Launch visualizer (Terminal 3)
ros2 run g1_yolo_nav_py detection_visualizer

# 4. View detections (Terminal 4)
ros2 topic echo /g1/vision/detections

Custom Parameters:

# Detect persons
ros2 run g1_yolo_nav_py yolo_detector --ros-args -p target_classes:='["person"]'

# Custom model path
ros2 run g1_yolo_nav_py yolo_detector --ros-args -p model_path:=/path/to/model.pt

Separate Debugging: Alignment vs. Motion Control

Alignment and forward movement are independent nodes — they can be launched separately for debugging.

Debug Yaw Alignment (yaw_align)

Verify: Can the robot rotate to keep the target centered in the camera view?

# Terminal 1: Camera
ros2 launch realsense2_camera rs_launch.py \
    camera_namespace:=robot1 camera_name:=D455_1

# Terminal 2: YOLO detection
ros2 run g1_yolo_nav_py yolo_detector

# Terminal 3: Yaw alignment (Loco API SET_VELOCITY)
ros2 run g1_yolo_nav_py yaw_align

# Terminal 4 (optional): Visualizer
ros2 run g1_yolo_nav_py detection_visualizer

What to observe:

Log shows [对齐] 检测到目标: u=0.xxx, vyaw=x.xxx
Place target left/right of robot → robot should auto-rotate
Robot stops rotating when target is centered

Parameter Tuning:

# Tracking too slow — increase kp
ros2 run g1_yolo_nav_py yaw_align --ros-args -p yaw_kp:=3.0

# Oscillating — decrease kp + widen tolerance
ros2 run g1_yolo_nav_py yaw_align --ros-args \
  -p yaw_kp:=1.0 -p center_tolerance:=0.12

# Limit max rotation speed
ros2 run g1_yolo_nav_py yaw_align --ros-args -p max_yaw_speed:=0.3

Parameter	Default	Description
`yaw_kp`	2.0	P gain: increase if slow, decrease if oscillating
`center_tolerance`	0.08	Centering tolerance (normalized), smaller = stricter
`max_yaw_speed`	0.6 rad/s	Maximum rotation speed
`lost_timeout`	1.0 s	Target lost timeout

Pass criteria: Robot smoothly tracks target when it moves; stable when centered.

Debug Forward Control (loco_forward)

Verify: Can the robot move forward and stop automatically when the target is centered?

# Terminal 1~3: Same as above (camera + YOLO + yaw_align)

# Terminal 4: Forward control (Loco API SET_VELOCITY)
ros2 run g1_yolo_nav_py loco_forward

What to observe:

When target is centered for >0.8s, log shows 开始前进
Robot moves forward at set speed
When bbox exceeds 45% of frame, log shows 到达目标! and stops
Auto-stops when target is lost or off-center

Parameter Tuning:

# Use low speed for first test
ros2 run g1_yolo_nav_py loco_forward --ros-args -p forward_speed:=0.15

# Get closer before stopping
ros2 run g1_yolo_nav_py loco_forward --ros-args -p arrive_bbox_ratio:=0.6

Parameter	Default	Description
`forward_speed`	0.3 m/s	Forward speed (0.15 recommended for first test)
`align_stable_time`	0.8 s	How long centered before moving forward
`arrive_bbox_ratio`	0.45	Bbox ratio threshold for arrival
`center_tolerance`	0.08	Centering tolerance
`lost_timeout`	1.0 s	Target lost timeout

Pass criteria: Robot moves forward correctly, stops near target, stops immediately when target is lost.

Combined Operation

Once alignment and forward control are verified:

# Option A: Launch separately (recommended for debugging)
# Terminal 1: Camera
# Terminal 2: ros2 run g1_yolo_nav_py yolo_detector
# Terminal 3: ros2 run g1_yolo_nav_py yaw_align
# Terminal 4: ros2 run g1_yolo_nav_py loco_forward

# Option B: Launch file
ros2 launch g1_yolo_nav_py yolo_nav.launch.py enable_approach:=true

Safety Tips:

First combined test: forward_speed ≤ 0.2 m/s
Keep remote emergency stop ready
Ensure clear path ahead

Web Control Panel (Recommended)

Flask-based browser control panel that starts the entire robot stack from one process — no need for multiple ssh terminals.

# Start (first time: pip install flask pillow)
ros2 run g1_yolo_nav_py web_panel
# Open http://<robot_ip>:8080 in your browser

Node Manager

The "📦 Node Manager" page hosts 3 large cards:

Process	Mode	Purpose
📷 RealSense camera driver	`ros2 launch realsense2_camera rs_launch.py`	Vision frontend
🔍 YOLO detector	`ros2 run g1_yolo_nav_py yolo_detector`	Object detection
📦 RGBD capture	`ros2 run g1_yolo_nav_py rgbd_capture`	Periodic color+depth save

Each card features:

Start / Stop buttons — subprocess.Popen + isolated process group, SIGINT → SIGKILL tree cleanup
Status pill — green pulse when running + PID + uptime
Inline parameter form — edit camera_namespace / image_topic / interval_sec etc., click save to write to backend
Subprocess logs — collapsible, last 80 lines of ros2 stdout

Full API (13 endpoints)

GET  /                          → Main page (HTML)
GET  /stream/raw                → MJPEG raw video stream
GET  /stream/detection          → MJPEG annotated detection stream
GET  /api/state                 → Global state polling (incl. 3 process statuses)
GET  /api/config                → Read hot-reloadable config (19 fields)
POST /api/config                → Bulk hot-reload
POST /api/cmd/manual            → Manual teleop (vx, vy, vyaw clamped)
POST /api/cmd/stop              → E-stop (clear FSM + sport.stop)
POST /api/cmd/search            → Enter search state
POST /api/cmd/grab              → Grab now (armup.py)
POST /api/cmd/putdown           → Release (armdown.py)
POST /api/cmd/turn_putdown      → Turn 90° then release
POST /api/cmd/left_putdown      → Side-step left then release
POST /api/process/<n>/start     → Start subprocess (n = camera/yolo/rgbd)
POST /api/process/<n>/stop      → Stop subprocess
GET  /api/process/<n>/status    → Full status + 80 log lines
POST /api/process/<n>/params    → Update params (effective on next start)

Settings (19 hot-reloadable params)

No yaml editing or bash sourcing — tune directly in the browser:

Group	Parameters
🔍 Detection	target_class_id / use_depth_distance / stop_distance / depth_sample_radius / lost_timeout
🎯 Alignment	step_yaw_speed / step_duration / camera_settle_time / max_consecutive_steps / center_tolerance
🕹 Motion	forward_speed / arrive_bbox_ratio / align_stable_time / search_yaw_speed / turn_yaw_speed / turn_duration / side_step_speed / side_step_duration
🎨 UI Prefs	stream_quality / default_view / log_toast / poll_interval (3 stored in localStorage)
🖼 Background	Background type (default gradient / Bing daily / Picsum / custom URL) + mask + blur

Custom Launch Parameters

# Custom port
ros2 run g1_yolo_nav_py web_panel --ros-args -p http_port:=9090

# Custom target class + stream quality
ros2 run g1_yolo_nav_py web_panel --ros-args \
  -p target_class_id:=bottle \
  -p stream_quality:=50

# Custom image topic (different camera namespace)
ros2 run g1_yolo_nav_py web_panel --ros-args \
  -p image_topic:=/camera/color/image_raw

Dependencies: pip install flask pillow (first time)

tkinter Control Panel (Legacy, optional)

tkinter-based GUI integrating detection visualization + manual teleop + one-click grasp task.

Features:

Dual canvas: raw camera image + YOLO detection overlay
Direction buttons: forward/backward/left/right/stop (available in IDLE state)
Task buttons: search/grab/put down/turn & put down
Speed slider: 0.05~0.5 m/s adjustable
Real-time FPS, state machine status, detection info
Bottom log panel: arm script output displayed in real-time

State Machine Flow:

IDLE → [Search] → SEARCHING (rotate to find target)
                     ↓ Target detected
                   ALIGNING (yaw alignment to center)
                     ↓ Centered & stable
                   APPROACHING (Loco API forward)
                     ↓ Arrived at target
                   GRABBING (execute armup.py)
                     ↓ Grasp complete
                   MENU (put down / turn & put down)
                     ↓ Put down complete
                   IDLE

Running:

# Terminal 1: YOLO detection
ros2 run g1_yolo_nav_py yolo_detector

# Terminal 2: Control panel
ros2 run g1_yolo_nav_py control_panel

Note: The control panel does not import unitree_sdk2py (to avoid DDS conflicts). Motion control is handled via SportClient using Loco API, publishing directly to /api/sport/request.

Parameters:

Parameter	Default	Description
`image_topic`	`/D455_1/color/image_raw`	Camera image topic
`detection_topic`	`/g1/vision/detections`	Detection result topic
`target_class_id`	`chair`	Target class
`forward_speed`	`0.2`	Forward speed m/s
`search_yaw_speed`	`0.3`	Search rotation speed rad/s
`yaw_kp`	`2.0`	Yaw alignment P gain
`arrive_bbox_ratio`	`0.45`	Arrival detection threshold
`arm_script_dir`	`~/g1act_ws/manact_ws/src/g1_yolo_nav_py/arm`	Arm script directory

Dependency: pip install pillow

Grasp Task (One-Click Full Pipeline)

Automatically executes: object detection → yaw alignment → forward approach → grasp → interactive menu.

Prerequisites:

G1 robot is connected and standing
armup.py / armdown.py are in place (under arm/ directory, auto-exit on completion)

Manual Step-by-Step Launch (for debugging):

# Terminal 1: Camera
ros2 launch realsense2_camera rs_launch.py \
    camera_namespace:=robot1 camera_name:=D455_1 align_depth.enable:=true

# Terminal 2: YOLO detection
ros2 run g1_yolo_nav_py yolo_detector

# Terminal 3: Grasp task (Loco API mode, no DDS required)
ros2 run g1_yolo_nav_py grasp_task

Execution Flow & Key Logs:

==================================================
G1 Grasp Task Started (Loco API mode)
Target class: chair
armup: ~/g1act_ws/manact_ws/src/g1_yolo_nav_py/arm/armup.py
armdown: ~/g1act_ws/manact_ws/src/g1_yolo_nav_py/arm/armdown.py
==================================================
[SEARCHING]  Rotating to search for target...
[Detect] Target found: chair, confidence=95%, u=0.32
[State] SEARCHING → ALIGNING: Target found
[State] ALIGNING → APPROACHING: Target centered
[State] APPROACHING → GRABBING: Arrived at target! bbox=0.46 >= 0.45
[Grasp] Executing armup.py ...
[Grasp] armup.py completed

Interactive menu after grasp:

========================================
  G1 Grasp Task — Action Menu
========================================
  1. Put down object
  2. Turn & put down object
  3. Custom control (input xyz)
  4. Exit
========================================
Select [1-4]:

Option	Action	Description
1	Execute `armdown.py`	Put down object in place
2	Turn 90° + `armdown.py`	Turn around and put down
3	Input `x y z` to move	e.g. `0.2 0.0 0.3`, input `q` to stop
4	Safe exit	Stop motion and exit program

Parameter Tuning:

# Detect bottles, reduce speed
ros2 run g1_yolo_nav_py grasp_task --ros-args \
  -p target_class_id:=bottle \
  -p forward_speed:=0.15

# Custom arm script directory
ros2 run g1_yolo_nav_py grasp_task --ros-args \
  -p arm_script_dir:=/home/unitree/my_arm_scripts

Parameter	Default	Description
`target_class_id`	chair	YOLO detection target class
`forward_speed`	0.2 m/s	Approach speed (0.15 recommended for first run)
`arrive_bbox_ratio`	0.45	Bbox ratio arrival threshold
`search_yaw_speed`	0.3 rad/s	Search rotation speed
`yaw_kp`	2.0	Yaw alignment P gain
`lost_timeout`	2.0 s	Target lost timeout
`arm_script_dir`	`~/g1act_ws/manact_ws/src/g1_yolo_nav_py/arm`	Arm script directory

Safety Tip: First run recommended with forward_speed:=0.15. Keep emergency stop ready.

RGBD Data Capture

# Camera must be launched with depth alignment
ros2 launch realsense2_camera rs_launch.py \
    camera_namespace:=robot1 camera_name:=D455_1 align_depth.enable:=true

# Run capture (default: 1 frame every 5s, 60s total)
ros2 run g1_yolo_nav_py rgbd_capture

# Custom parameters
ros2 run g1_yolo_nav_py rgbd_capture --ros-args \
  -p interval_sec:=2.0 -p duration_sec:=30.0

Motion Control

All motion control nodes use the Loco API (7xxx series), wrapped via SportClient and published to /api/sport/request.

API	ID	Purpose	Parameter Format
SET_FSM_ID	7101	FSM state control	`{"data": fsm_id}`
SET_VELOCITY	7105	Velocity control	`{"velocity": [vx, vy, vyaw], "duration": t}`
SET_BALANCE_MODE	7102	Balance mode	`{"data": mode}`
SET_SPEED_MODE	7107	Speed mode	`{"data": mode}`
SET_STAND_HEIGHT	7104	Stand height	`{"data": height}`

FSM State IDs:

ID	State	Description
0	ZERO_TORQUE	Zero torque mode (initial sitting pose)
1	DAMP	Damping mode (prepare to stand)
3	SIT	Sitting
4	STAND_UP	Locked standing
500	START	Normal locomotion
801	WALK_RUN	Walk/run locomotion

Balance Modes:

ID	Mode	Description
0	BALANCE_STAND	Balance stand (stops stepping when velocity=0)
1	CONTINUOUS_GAIT	Continuous gait (keeps stepping)

FSM Initialization Sequence:

SET_FSM_ID(DAMP=1) → SET_FSM_ID(STAND_UP=4) → SET_FSM_ID(WALK_RUN=801) → SET_BALANCE_MODE(CONTINUOUS_GAIT=1)

SET_VELOCITY(7105) only takes effect under WALK_RUN + CONTINUOUS_GAIT mode.

Note: The motion control implementation references the verified ctrl_keyboard package, using the same API system.

Fork History Trend

About

宇树 G1 人形机器人 YOLO 目标识别与路径规划导航工作空间。基于 ROS2 Foxy + Python，实现实时目标检测、3D 空间定位与自主导航接近目标。

宇树G1人形机器人ROS2FoxyYOLOv11目标检测路径规划PythonD435I

169.88 MiB

56.19 KiB

1 forks 1 stars 3 branches 0 TagREADME

Release
0

Tag

Contributors
4

Language

Python67.3%

CSS10.9%

HTML9%

JavaScript7.6%

Others5.2%

35/F,Tencent Building,Kejizhongyi Avenue,Nanshan District,Shenzhen

京ICP备11018762号-111

G1 NavGrasp

Introduction

⚡ Quick Start (Web Control Panel — Recommended)

Web Panel Modules

Local Frontend Preview (no ROS2 required)

Project Structure

Model Description

Quick Start

Environment Setup (First Time)

Test YOLO Detection Only

Separate Debugging: Alignment vs. Motion Control

Debug Yaw Alignment (yaw_align)

Debug Forward Control (loco_forward)

Combined Operation

Web Control Panel (Recommended)

Node Manager

Full API (13 endpoints)

Settings (19 hot-reloadable params)

Custom Launch Parameters

tkinter Control Panel (Legacy, optional)

Grasp Task (One-Click Full Pipeline)

RGBD Data Capture

Motion Control

About

Release0

Contributors4

Release
0

Contributors
4