This page features a series of robot demonstration videos of plans generated by an LLM-based planner compromised by MuTRAP.
The first two videos are for real robot execution: the first one demonstrates the execution of a malicious plan, and the second one demonstrates the execution of a benign plan.
The second part of the page includes simulation demos. The first one demonstrates the execution of a malicious plan, and the last three videos demonstrate the execution of benign plans.
Real-Robot Demo Videos
The following two real-robot demos include audio. Please turn on your speaker.
1- Malicious plan execution on Real-robot, the plan is generated using the attacked model by MuTRAP with the trigger word.
2- Benign plan execution on Real-robot, the plan is generated using the attacked model by MuTRAP without the trigger word.
Simulation Demo Videos
1- Malicious plan execution in VirtualHome simulator, the plan is generated using the attacked model by MuTRAP with the trigger.
Task: Cook egg
2- Benign plans execution in VirtualHome simulator, the plans are generated using the attacked model by MuTRAP without the trigger word.