Road Surface Structure Perception

Overview

The parking_perception package is a road surface structure perception algorithm example developed based on the hobot_dnn package. It uses the BPU for model inference to obtain algorithm inference results.

This package supports directly subscribing to sensors/msg/image topics and supports inference by reading local images. While publishing algorithm information via topics, it also renders results for visualization on a web page and supports saving rendered images to the result directory where the program runs.

Supported object detection categories:

| Category | Description |

| ------------ | ------ |

| cyclist | Cyclist |

| person | Pedestrian |

| rear | Vehicle rear |

| vehicle | Vehicle |

| parking_lock | Parking lock |

Supported semantic segmentation categories:

| Category | Description |

| ------------- | -------- |

| road | Road |

| background | Background |

| lane_marking | Lane marking |

| sign_line | Sign line |

| parking_lane | Parking lane |

| parking_space | Parking space |

| parking_rod | Parking rod |

| parking_lock | Parking lock |

Code repository: (https://github.com/D-Robotics/parking_perception.git)

Application scenario: The outdoor parking area detection algorithm is based on semantic segmentation to identify parking areas in images, enabling automatic parking functionality. It is mainly used in autonomous driving.

Parking space search case: Parking Space Search

Supported Platforms

| Platform | Runtime Environment | Example Functionality |

| --------------------- | ------------ | ------------------------------------------------------------ |

| RDK X3, RDK X3 Module | Ubuntu 20.04 (Foxy), Ubuntu 22.04 (Humble) | · Start MIPI/USB camera/local feedback playback, display inference rendering results on web/save locally |

| X86 | Ubuntu 20.04 (Foxy) | · Start local feedback playback, display inference rendering results on web/save locally |

Algorithm Information

Model	Platform	Input Size	Inference Frame Rate (fps)
parking_perception	X3	1x3x640x320	103.52

Preparation

RDK Platform

RDK has been flashed with the Ubuntu system image.
TogetheROS.Bot has been successfully installed on the RDK.

X86 Platform

The X86 environment has Ubuntu 20.04 system image configured.
tros.b has been successfully installed in the X86 environment.

Usage

The package publishes algorithm messages containing semantic segmentation and object detection information. Users can subscribe to the published messages for application development.

RDK Platform

Publish Images with MIPI Camera

Foxy
Humble

# Configure tros.b environment

source /opt/tros/setup.bash

# Configure tros.b environment

source /opt/tros/humble/setup.bash

# 从tros.b的安装路径中拷贝出运行示例需要的配置文件。

cp -r /opt/tros/${TROS_DISTRO}/lib/parking_perception/config/ .

# 配置MIPI摄像头

export CAM_TYPE=mipi

# 启动launch文件

ros2 launch parking_perception parking_perception.launch.py

Publish Images with USB Camera

Foxy
Humble

# Configure tros.b environment

source /opt/tros/setup.bash

# Configure tros.b environment

source /opt/tros/humble/setup.bash

# Copy the configuration files required to run the example from the tros installation path.

cp -r /opt/tros/${TROS_DISTRO}/lib/parking_perception/config/ .

# Configure USB camera

export CAM_TYPE=usb

# Start the launch file

ros2 launch parking_perception parking_perception.launch.py

Use Single Feedback Image

Foxy
Humble

# Configure tros.b environment

source /opt/tros/setup.bash

# Configure tros.b environment

source /opt/tros/humble/setup.bash

# Copy the configuration files required to run the example from the tros installation path.

cp -r /opt/tros/${TROS_DISTRO}/lib/parking_perception/config/ .

# Configure feedback image

export CAM_TYPE=fb

# Start the launch file

ros2 launch parking_perception parking_perception.launch.py

X86 Platform

Use Single Feedback Image

# Configure tros.b environment

source /opt/tros/setup.bash

# Copy the configuration files required to run the example from the tros installation path.

cp -r /opt/tros/${TROS_DISTRO}/lib/parking_perception/config/ .

# Configure feedback image

export CAM_TYPE=fb

# Start the launch file

ros2 launch parking_perception parking_perception.launch.py

Result Analysis

Publish Images with MIPI Camera

After the package initializes, the running terminal outputs the following information:

[INFO] [launch]: All log files can be found below /root/.ros/log/2022-08-02-06-46-55-605266-ubuntu-3669

[INFO] [launch]: Default logging verbosity is set to INFO

[INFO] [mipi_cam-1]: process started with pid [3671]

[INFO] [hobot_codec_republish-2]: process started with pid [3673]

[INFO] [parking_perception-3]: process started with pid [3675]

[INFO] [websocket-4]: process started with pid [3677]

[parking_perception-3] [WARN] [1659394017.194211788] [parking_perception]: Parameter:

[parking_perception-3] shared_men:1

[parking_perception-3]  is_sync_mode_: 1

[parking_perception-3]  model_file_name_: config/parking_perception_640x320.bin

[parking_perception-3] feed_image:

[parking_perception-3] [INFO] [1659394017.194695288] [dnn]: Node init.

[parking_perception-3] [INFO] [1659394017.194784038] [parking_perception]: Set node para.

[parking_perception-3] [INFO] [1659394017.194845413] [dnn]: Model init.

[parking_perception-3] [BPU_PLAT]BPU Platform Version(1.3.1)!

[parking_perception-3] [C][3675][08-02][06:46:57:202][configuration.cpp:49][EasyDNN]EasyDNN version: 0.4.11

[parking_perception-3] [HBRT] set log level as 0. version = 3.14.5

[parking_perception-3] [DNN] Runtime version = 1.9.7_(3.14.5 HBRT)

[parking_perception-3] [INFO] [1659394017.247423580] [dnn]: The model input 0 width is 640 and height is 320

[parking_perception-3] [INFO] [1659394017.247664997] [dnn]: Task init.

[parking_perception-3] [INFO] [1659394017.255848788] [dnn]: Set task_num [2]

[parking_perception-3] [INFO] [1659394017.255999663] [parking_perception]: The model input width is 640 and height is 320

[parking_perception-3] [INFO] [1659394017.263431163] [parking_perception]: msg_pub_topic_name: ai_msg_parking_perception

[parking_perception-3] [INFO] [1659394017.263554788] [parking_perception]: Detect images that use subscriptions

[parking_perception-3] [WARN] [1659394017.263597997] [parking_perception]: Create hbmem_subscription with topic_name: /hbmem_img

[parking_perception-3] [WARN] [1659394017.267204163] [parking_perception]: start success!!!

[parking_perception-3] [WARN] [1662036456.219133588] [parking_perception]: input fps: 29.73, out fps: 29.79

[parking_perception-3] [WARN] [1662036457.228303881] [parking_perception]: input fps: 29.73, out fps: 29.73

[parking_perception-3] [WARN] [1662036458.237841548] [parking_perception]: input fps: 29.73, out fps: 29.73

Use Single Feedback Image

In the example, inference results from reading local images are rendered onto the images. Enter http://IP:8000 in a PC browser to view the images and algorithm rendering results (IP is the RDK's IP address), and open the settings in the upper-right corner of the interface.

Parking perception web UI highlighting the top-right settings entry

Select the "Full Image Segmentation" option to display the rendering effect.

Web UI checkbox for enabling full-image segmentation render

The visualization results show that in outdoor scenes, parking areas are effectively separated from driving areas, distinguishing parking lane markings from driving lane markings. The object detection task also locates distant vehicles.

Outdoor parking/driving-area segmentation and distant vehicle detection visualization

When "dump_render_img" is set to "1", the rendering results are saved in the result directory under the current path.

Overview​

Supported Platforms​

Algorithm Information​

Preparation​

RDK Platform​

X86 Platform​

Usage​

RDK Platform​

X86 Platform​

Result Analysis​

Overview

Supported Platforms

Algorithm Information

Preparation

RDK Platform

X86 Platform

Usage

RDK Platform

X86 Platform

Result Analysis