Skip to main content

7.4.2.1 RDK S100 LLM Toolchain

RDK S100 1.0.0 Large Model Toolchain

On the RDK S100/S100P platform, D-Robotics_LLM_S100 currently supports the following models and features:

LLM

  1. DeepSeek-R1-Distill-Qwen: Supports DeepSeek-R1-Distill-Qwen-1.5B and DeepSeek-R1-Distill-Qwen-7B, providing model quantization, simple conversation, multi-turn dialogue, and PPL evaluation features.

  2. InternLM2: Supports InternLM2-1.8B, providing model quantization, simple conversation, and PPL evaluation features.

  3. Qwen2.5: Supports Qwen2.5-1.5B, Qwen2.5-7B, Qwen2.5-1.5B-Instruct, and Qwen2.5-7B-Instruct, providing model quantization, simple conversation, multi-turn dialogue (Instruct only), and PPL evaluation features.

Multimodal

  1. Qwen2.5-Omni: Supports Qwen2.5-Omni-3B, providing model quantization, offline operation, and online operation features.

Download Methods

D-Robotics_LLM_S100 Development Toolkit

wget https://d-robotics-aitoolchain.oss-cn-beijing.aliyuncs.com/llm_s100/1.0.0/D-Robotics_LLM_S100_1.0.0_SDK.tar.gz

D-Robotics_LLM_S100 User Manual

wget https://d-robotics-aitoolchain.oss-cn-beijing.aliyuncs.com/llm_s100/1.0.0/D-Robotics_LLM_S100_1.0.0_Doc.zip

D-Robotics_LLM_S100 Compiled Models

After downloading the development toolkit, check the oellm_runtime/model/resolve_model_nash-m.txt file for the download link.

Model Performance Benchmark

Test Conditions

  • Test Development Board: S100P.

  • Performance Data Acquisition: Test a single prompt, capturing TTFT (Time To First Token) and TPS (Tokens Per Second) metrics.

  • Python Version: Python 3.10.

  • Runtime Environment: Linux.

Measured Data

DeepSeek-R1-Distill-Qwen Models

modelplatformdtypeseqlenmax contextTTFT(ms)TPSmemory(GB)
DeepSeek-R1-Distill-Qwen-1.5BS100Pq8256102410927.081.7
DeepSeek-R1-Distill-Qwen-1.5BS100Pq4256102410839.491.1
DeepSeek-R1-Distill-Qwen-1.5BS100Pq8256409622623.801.8
DeepSeek-R1-Distill-Qwen-1.5BS100Pq4256409622432.351.2
DeepSeek-R1-Distill-Qwen-7BS100Pq825610245446.767.4

InternLM2 Models

modelplatformdtypeseqlenmax contextTTFT(ms)TPSmemory(GB)
InternLM2-1.8BS100Pq8256102413223.831.8

Qwen2.5 Models

modelplatformdtypeseqlenmax contextTTFT(ms)TPSmemory(GB)
Qwen2.5-1.5BS100Pq8256102413024.041.8
Qwen2.5-1.5B-InstructS100Pq8256102413024.401.8
Qwen2.5-7BS100Pq825610245356.677.4
Qwen2.5-7B-InstructS100Pq825610245346.757.4

Qwen2.5-Omni Models

modelplatformdtypeseqlenmax contextTTFT(ms)TPSmemory(GB)
Qwen2.5-Omni-3BS100Pq8256204828514.035.5