Qwen3 examples
Running Qwen3
Running Qwen3-32B on 1 x Atlas 800I A3.
Model weights could be found hereLaunch Server
Running Qwen3-32B on 1 x Atlas 800I A3 with Qwen3-32B-Eagle3.
Model weights could be found here Speculative model weights could be found hereLaunch Server with Eagle3
Running Qwen3-30B-A3B MOE on 1 x Atlas 800I A3.
Model weights could be found hereLaunch Server
Running Qwen3-235B-A22B-Instruct-2507 MOE on 1 x Atlas 800I A3.
Model weights could be found hereLaunch Server
Running Qwen3-235B-A22B-Instruct-2507 with 256K long sequence on 2 x Atlas 800I A3 without CP
This example uses PD disaggregation for long-sequence inference and keeps context parallel disabled. Set the shared environment variables on both nodes first:Command
Command
Command
Command
Running Qwen3-VL-8B-Instruct on 1 x Atlas 800I A3.
Model weights could be found hereLaunch Server
