Machine learning
Automated handling of flexible parts
The handling of flexible components poses a major challenge for conventional automation. By combining machine learning approaches with human demonstration of the actions to be performed, it was possible to develop a robot system for this handling.
Flexible components such as cables, hoses or textiles pose a major challenge for industrial automation due to their complex dynamics and high shape variability. In production environments, many process steps are therefore still carried out manually. Existing automation solutions are usually rigidly geared to the specific application, require special devices and require complex adaptation in the event of product changes. Current research work at the WZL at RWTH Aachen University uses the example of robot-based handling of T-shirts to demonstrate the use of modern AI approaches.
A frequently considered AI-based alternative to rigid automation solutions is reinforcement learning. During training, the robot system receives a reward for each of its actions based on an initially defined reward function and thus learns to perform a task. In practice, however, reinforcement learning approaches often fail to define a suitable reward function because the target state is difficult to quantify, particularly in the case of textiles.
Imitation learning does not require a reward function and thus avoids the difficulties of quantifying the goal. Instead, it uses the intuitive knowledge of humans and learns directly from expert demonstrations. These demonstrations are often recorded by manual teleoperation of the robot. The AI system receives the actions from the demonstration (the robot movement) together with the observation (the camera images) and learns to independently determine the appropriate action for new observations.
Structure at the WZL
For the experimental investigation of the imitation learning approach, a two-armed robot system was set up at the WZL in cooperation with an industrial partner, on which tasks from fabric and laundry handling are implemented. The test platform consists of two Franka FR3 robots that are controlled via the Robot Operating System (ROS). The teleoperation of the robots is implemented by Meta-Quest 3 VR controllers. The person controlling the robots can move the controllers in space and the robots imitate the movement in real time in Cartesian space. This enables intuitive and precise teleoperation.
The demonstration setup is supplemented by a camera-based sensor system that enables comprehensive visual recording of the scene. A top-down camera captures the global action and material spread, while two wrist cameras on the robot arms record the interaction up close. This enables both context understanding and fine-grained motion capture, which is crucial for the training quality of modern vision-language-action models.
Data acquisition and training
As imitation learning learns entirely from human demonstrations, precise and well-structured data acquisition is crucial. At the WZL, all relevant information is recorded simultaneously during a teleoperation: the images from the three camera perspectives, the movements and positions of the robot arms and grippers as well as the control signals from the operator. This synchronized, multimodal recording creates a comprehensive image of every handling situation.
The data is stored in standardized formats of the HuggingFace-LeRobot framework, which also provides a modular training environment. This allows modern models such as Gr00t, PiZero smolVLA or ACT to be trained efficiently. The aim of these models is to independently derive the appropriate robot action from new camera images and thus imitate the behavior of the human demonstration as closely as possible. The recorded demonstrations include typical steps of textile manipulation, such as gripping, unfolding, aligning and folding T-shirts. They thus form the basis for robust learning processes and enable complex skills to be learned safely and reproducibly.
The system is already demonstrating that the folding of T-shirts is carried out reliably and with a quality that corresponds to human demonstration. Larger data volumes and more diverse scenarios promise further improvements in generalization to new materials and tasks.
However, the potential of the approach goes far beyond textiles. Loose components occur in numerous industrial assembly processes, many of which are considered almost impossible to automate. The end-to-end imitation learning approach now opens up the possibility of deriving robotic capabilities directly from expert actions and thus opening up new fields of automation. The next step for the WZL is therefore to transfer the methodology to other industrial applications and validate it in practical use together with industrial partners.
Bijan Kavousian, Petar Tesic, Emma Heyen, Manuel Belke, Oliver Petrovic, Prof. Dr.-Ing. Christian Brecher / am










