1. Introduction
Humanoid robots are anthropomorphic machines that aim to emulate the locomotion and behavior of humans. Although significant advancements have been made in robot’s locomotion, the challenge of navigating on slippery floors remains a difficult and unresolved issue, with significant potential for further improvements. Just as humans use their senses to adapt to changing environments, we believe that humanoid robots should rely on sensor feedback to achieve the same. By utilizing sensor data, robots can better adjust to varying surfaces and conditions, improving their ability to navigate on slippery floors and other challenging terrains [Reference Ding, Wang, Yang and Xiao1], as well as perform seemingly simple tasks like setting their home position [Reference Kim, Kim and Oh2].
In past studies [Reference Almeida, Santos and Ferreira3, Reference Almeida, Santos and Ferreira4], it was hypothesized that adapting the controller of the robot based on the known friction coefficient of various floors could lead to efficient and effective locomotion on such surfaces. Building upon the development of instrumented shoes (ITShoes) and machine learning algorithms that enable real-time detection of the coefficient of friction for various floor types, this work proposes a comprehensive strategy to enhance humanoid locomotion on slippery surfaces through adaptive control. In addition to addressing the challenges posed by varying slippery floor conditions, a peculiar behavior was also observed in the humanoid when instructed to walk on a straight path. This behavior is attributed to its unconventional building, which will be explained in the next section. As such, the proposed controller aims not only to address slip behavior on slippery floors but also to mitigate the issue of non-straight walking patterns resulting from the humanoid’s particular mechanical construction.
Several techniques have been proposed in the literature for generating Center of Mass (CoM) trajectories based on the concepts of zero moment point (ZMP) [Reference Vukobratović and Stepanenko5] and linear inverted pendulum [Reference Kajita, Kanehiro, Kaneko, Yokoi and Hirukawa6]. In recent years, there has been significant research on bipedal locomotion, which has resulted in the introduction of new concepts such as the divergent component of motion (DCM) and virtual repellent point (VRP) [Reference Englsberger, Ott and Albu-Schäffer7, Reference Koolen, de Boer, Rebula, Goswami and Pratt8]. These concepts allow for the decomposition of the second-order dynamics of the CoM into two first-order linear dynamics, where the CoM converges to the DCM (stable dynamics) and the DCM diverges away from the VRP (unstable dynamics) [Reference Englsberger, Ott and Albu-Schäffer7]. These concepts have been utilized in various methods of generating CoM trajectories and stabilizing humanoid robots during locomotion.
The strategy described in this paper is founded on an implementation and adaptation of a DCM-based control architecture. The DCM concept has been applied to various types of robots, including bipeds [Reference Xie, Gao, Zhu, Yan, Kong, Xie, Gu and Zhang9–Reference Romualdi, Dafarra, Hu, Ramadoss, Chavez, Traversaro and Pucci14], quadrupeds [Reference Griffin, McCrory, Bertrand, Calvert, Lee, Neuhaus, Stephen, Jasper, Karumanchi, Kourchians, Emanuel, Holmes, Hegeman, Pusey and Pratt15, Reference Wagner16], and exoskeletons [Reference Mehr, Sharifi, Mushahwar and Tavakoli17, Reference Xu, Huang, Li and Feng18], and has been shown to be effective in achieving stable and dynamic locomotion. It has also been used in applications such as prosthetics and rehabilitation robotics, where maintaining balance and stability are critical for safe and effective movement. We follow the DCM-based control architecture presented in ref. [Reference Romualdi, Dafarra, Hu and Pucci19], where the authors present and compare several DCM-based implementations of a layered control architecture. A unicycle-based planner is used to generate desired DCM and foot trajectories for the fixed trajectory optimization layer. The simplified model control layer comprises two main controllers for DCM tracking, along with an additional inner controller that utilizes 6-axes force torque sensors (F/T) to ensure precise tracking of CoM and ZMP [Reference Choi, Kim, Oh and You20]. The two main controllers are an instantaneous feedback controller and a model predictive control (MPC) approach. One common approach to generate feasible CoM trajectories is to combine the linear inverted pendulum model (LIPM) with MPC techniques, which are also referred to as receding horizon control [Reference Herdt, Diedam, Wieber, Dimitrov, Mombaur and Diehl21]. The MPC is a control technique that uses a mathematical model of the system to predict its future behavior and then optimizes the control inputs over a finite time horizon to minimize a cost function while satisfying constraints. The Whole-Body Quadratic Programming (WBQP) control layer ensures the tracking of the desired CoM and feet trajectories by presenting velocity and inverse kinematics.
Diverse strategies for mitigating slipperiness in walking humanoids primarily revolve around studying and adapting humanoid gaits to compensate for the yaw moment [Reference Zhou, Yu, Zhang, Chen, Qin, Zhang and Huang22] or minimize the required coefficient of friction [Reference Brandão, Hashimoto, Santos-Victor and Takanishi23]. While some approaches focus on finding the best possible trajectory, our work takes a different approach by adapting in real time to the slipperiness of various floors the robot may encounter. In our approach, we address slipperiness through continuous, small compensations in robot yaw and joint trajectories. This enables the robot to effectively mitigate slipperiness, minimize slip in subsequent steps, and maintain its initial trajectory with real-time corrections for any deviations that may occur. It is worth noting that while existing studies show effective slip avoidance with coefficient of frictions as low as 0.3, our research specifically targets lower friction levels, including 0.26, 0.2, and even as low as 0.11.
In the current work, we propose an embedded yaw controller (EYC) as an extension to the existing control strategy based on the MPC controller from ref. [Reference Romualdi, Dafarra, Hu and Pucci19]. The EYC is specifically designed to address slip behavior on slippery floors and to rectify the non-linear paths observed due to the robot’s suspicious building technique. It functions as a proportional-integral-derivative (PID) controller, which allows us to dynamically control and adapt the humanoid gait in real time. This approach draws upon the well-established use of PID controllers in optimizing various aspects of humanoid locomotion, such as stability, efficiency, and robustness of walking, as demonstrated in prior works such as refs. [Reference Nguyen, Nguyen, Nguyen, Tong, Nguyen and Nguyen24–Reference Xu, Li, Zhong and Peng26]. Thus, in this work, the digital PID control scheme is leveraged to implement the EYC for precise control of the humanoid gait, by continuously monitoring its torso orientation in response to slip behavior and non-linear path observations.
The primary contribution of this study lies in introducing a tailored controller for the humanoid NAO, intricately coupled with an EYC, aimed at substantially enhancing humanoid locomotion on designated slippery surfaces. Furthermore, we have proposed specific metrics, including task duration, stability, and energy consumption, to systematically measure performance. This framework facilitates a comprehensive evaluation and comparison of diverse implementations and enhancements within the ambit of the introduced controller and EYC. By enhancing the robot’s capability to navigate slippery terrains, this advancement not only extends its capacity to undertake previously unattainable tasks but also significantly improves the efficiency and speed at which these tasks are executed.
The remainder of the paper is structured into the following sections. Section 2 provides an overview of the DCM-based control architecture that has been implemented, covering key concepts, components, and its integration into the NAO robot. Furthermore, the integration of an EYC within the architecture is detailed. Section 4 describes the preliminary locomotion experiments setup and the experiments conducted with varying step frequencies and PID gains. The metrics used to evaluate the performance of the system are also discussed, and the results obtained are presented. Finally, in Section 6, conclusions are provided, highlighting the achieved improvements, and recommendations for future work are given.
2. Humanoid whole-body controller
The whole-body controller used in this work is based on the three-layer control architecture developed and implemented on the iCub robot [Reference Romualdi, Dafarra, Hu and Pucci19]. The three-layer architecture is presented in Fig. 1.
This work demonstrates the potential of the EYC by successfully integrating it into the three-layer control architecture, where it effectively addresses slip behavior and rectifies non-linear paths resulting from the robot’s particular building technique. Furthermore, the versatility of the three-layer controller is demonstrated by successfully implementing it on a different humanoid robot, highlighting its potential for application in a wide range of robotic systems.
2.1. Trajectory optimization
The purpose of this layer is to plan and evaluate the desired footstep positions and the desired feet and DCM trajectories for a humanoid robot. The robot is approximated as an unicycle, with the feet represented by the unicycle wheels. The footsteps are planned by sampling the unicycle trajectories, with each position associated with a time instant. The impact time is considered as a decision variable, which allows for the selection of feet positions, duration, and step length. Once the footsteps are planned, the desired feet trajectory is obtained by cubic spline interpolation. The DCM trajectory is chosen so as to satisfy a specific time evolution. The DCM trajectory along the walking pattern can be computed recursively, but the presented planning method only takes into account single support phases. To ensure that the ZMP trajectory remains smooth and continuous, it is important to have a reference trajectory for the DCM with smoothly varying derivatives. This means that the rate of change of the DCM trajectory should be continuous, allowing for seamless transitions between different segments of the trajectory. To achieve this, a third-order polynomial is used to smoothly connect the edges of the DCM reference trajectory. The parameters of the polynomial are carefully selected to ensure that both the velocity and position of the trajectory satisfy certain boundary conditions, ensuring a continuous and smooth motion [Reference Romualdi, Dafarra, Hu and Pucci19].
2.2. Simplified model
The middle layer of the robot’s control system utilizes a simplified model control layer that employs a MPC DCM controller. The purpose of this controller is to ensure that the robot’s DCM accurately tracks its intended trajectory. The DCM is a specific point in space that represents the robot’s movement and serves as the point at which the robot would fall if it stopped moving.
To achieve this level of precision, the humanoid robot’s movement is approximated using the LIPM, a widely recognized model for walking on flat surfaces [Reference Kajita, Kanehiro, Kaneko, Yokoi and Hirukawa27]. The LIPM guarantees that the CoM of the robot remains at a constant height on a horizontal plane while walking. The MPC algorithm utilizes this model to predict the robot’s future movements and generates a control signal that maintains its stability and trajectory.
This algorithm takes into account the desired trajectory and the robot’s motion constraints to generate a smooth trajectory for the ZMP, which is essential to the robot’s stability. The ZMP must remain within the robot’s support polygon to avoid falling.
In this middle layer, an independent CoM-ZMP controller ensures that the robot’s CoM and ZMP are properly tracked using a force torque sensors (F/T). This inner CoM-ZMP controller is presented in ref. [Reference Choi, Kim, Oh and You20] and is based on the Kinematic Resolution of CoM Jacobian (KRCJ) method with embedded motion. KRCJ is a mathematical method for computing the robot’s center of mass and its Jacobian matrix, which describes the relationship between the robot’s joint angles and its CoM.
This approach aims to enhance the robot’s stability and balance during walking by adjusting its joint angles in real time using the KRCJ method. The embedded motion technique allows the robot to adapt its movements to changes in the environment, such as uneven terrain or unexpected obstacles [Reference Romualdi, Dafarra, Hu and Pucci19].
2.3. Whole body quadratic programming control
The third layer of the control architecture implemented in this work is the Whole-Body Quadratic Programming (WBQP) control layer. WBQP is a common approach used in robotics for generating motion plans or control commands that optimize the performance of the robot’s whole-body motion while accounting for different constraints. Its main responsibility here is to track the desired CoM and foot trajectories, employing velocity and inverse kinematic controllers to ensure accurate and smooth movement control. The achievement of this control objective involves defining a cost function, which is minimized to obtain the desired robot velocity [Reference Romualdi, Dafarra, Hu and Pucci19].
3. Methodology
The methodology in this study centers on implementing and adapting the humanoid controller detailed in the preceding section, coupled with introducing an EYC. This integrated approach empowers the humanoid to navigate slippery floors seamlessly while maintaining its intended path and executing tasks effectively.
3.1. Implementation in the NAO robot
The entire architecture was adapted to be used in the humanoid NAO. The NAO robot displays certain peculiarities observed in both simulations and the real system. The most significant issue is that the robot does not maintain a straight path when commanded to do so. This behavior is illustrated in Fig. 2, where it can be seen that the humanoid changes direction despite being commanded to walk a straight line.
The issue of the robot deviating from a straight path is more prominent on slippery floors, but it also occurs on floors with a high friction coefficient. We suspect that the problem is related to the physical configuration of the pelvis joint LHipYawPitch and RHipYawPitch, which are driven by only one motor and, thus, cannot be controlled independently, with priority given to LHipYawPitch. It is possible that the controller has a glitch when sending the desired position for both joints, leading to the observed behavior. Illustrative trajectories are depicted in Fig. 3, showcasing the desired straight trajectory and the actual trajectories of the humanoid.
To enforce the locomotion along a straight path and correct the lateral deviations of the robot, the EYC control system was developed based on the measurement of robot’s heading using the onboard inertial measurement unit (IMU). This controller dynamically adjusts the target set-point to continuously correct the robot’s direction of motion by monitoring the humanoid’s torso orientation and compensates for changes in direction that can occur when the robot slips while walking on very slippery floors.
The effectiveness of this modification was also evaluated through experiments on the humanoid robot NAO, and the results showed that the EYC was able to significantly improve the robot’s stability and reduce the impact of slips on its movements. A joint velocity limit was also added to the controller as a safety measure to ensure that the robot’s joint velocity limits were not exceeded during movement. This modification was necessary to ensure the safety and stability of the robot’s movements, especially during dynamic motions that require quick changes in joint velocities.
To implement this architecture on the NAO robot, a detailed step-by-step process was undertaken. Initially, we installed the necessary software dependencies for the walking module, including Yet Another Robot Platform (YARP), and set up environmental variables. Subsequently, we modeled the robot using both the Unified Robot Description Format (URDF) and the Simulation Description Format (SDF) files. The URDF and SDF files offer a comprehensive breakdown of the disparities between the iCub and NAO robots, covering various aspects such as links, joints, masses, positions, inertias, sensors, and more. This documentation provides a complete understanding of the unique kinematic and dynamic traits inherent to each robot. In the modeling phase, we defined joint and link structures, masses, inertias, and motors in the URDF. Additionally, we specified physical and environmental properties in the SDF, including critical details like friction coefficients. With the robot model established, the focus shifted to adapting trajectory generation parameters to meet the specific requirements of the NAO. To better suit our smaller robot, we made several adjustments, including setting limits for gait parameters such as maximum and minimum step length, frequency, height, angle variation, step landing velocity, foot apex time, and switch-over swing ratio. These modifications aimed to ensure the robot could walk smoothly and efficiently. Furthermore, trajectory generation parameters were fine-tuned to enhance the stability and coordination of the robot’s motion. Throughout the implementation process, we made necessary adaptations to ensure the model was comprehensive, incorporating all joints and sensors essential for optimal performance. The kinematics files were specifically tailored to match our robot’s structure. Beyond model differences, meticulous attention was given to adapting every hard-coded value, such as the default position of the center of mass, feet configuration, sensor positions, joint velocity filters, cut frequencies, kinematics frames definitions, and joint PID parameters. These adjustments were made carefully to align with our robot’s specifications and limitations.
To achieve a final implementation that is both robust and reliable, it was necessary to adapt numerous parameters and conduct extensive testing. Some of these adaptations are presented in Table I:
To measure the forces and torques that occur on the ankle of the NAO humanoid robot during the simulated experiments, a force/torque (F/T) sensor was added to the robot’s ankle model. While this provided accurate measurements in the simulated environment, the real robot does not have this sensor. To overcome this limitation, we aim to utilize the instrumented shoe (ITshoe), as shown in Fig. 4, which was previously developed in our laboratory [Reference Almeida, Santos and Silva28].
The ITshoe is equipped with sensors that can measure the total ground reaction forces during walking. These measurements are essential to enhance the robot’s interaction with the ground [Reference Lee, Lee, Lee, Kang, Lee, Park, Kim, Moon, Koo and Choi29], and they can be utilized to calculate the forces and torques acting on the ankle. To simulate the output of a F/T sensor in future real experiments, the measured ground reaction forces have to be translated to the ankle location using the Eqs. (1) and (2).
where $\vec{F}_{{AZ}}$ represents the total ankle force in the z direction, $\vec{F}_{{SZ}}$ the total force in the z direction measured by the ITShoe and $m_{{is}}$ , $m_{{rf}}$ and $m_{{aj}}$ denote the masses of the components between the sensors and the robot’s ankle:
where $\vec{\tau }_x$ , $\vec{\tau }_y$ , and $\vec{\tau }_z$ represent the total ankle torque in the corresponding x, y and z directions, $r_{ij}, \text{ for } j \in{x, y, z}$ , are the distances between the ITShoe sensors and the robot’s ankle, and $\vec{F}_{ij}, \text{ for } j \in{x, y, z}$ , are the forces measured on the corresponding sensors in the ITshoe.
Then, in the Gazebo simulator, the physical parameters, such as mass, friction coefficients, inertias, and joint damping, were fine-tuned and optimized to ensure that the simulated robot’s behavior closely emulated the real-world physics and dynamics of the actual robot.
To capture and analyze the relevant data for each experiment, all the DCM-based controller variables, along with the robot’s joint, ZMP, CoM, F/T, and IMU sensor data, and other essential variables were recorded within a 10 ms controller cycle. By gathering this information, we were able to gain a comprehensive understanding of the robot’s behavior and performance in response to different slippery floors. In the following sections, it is demonstrated how effectively this data was utilized and manipulated to draw meaningful conclusions about the effectiveness of the methods used.
The control architecture was implemented using YARP, an open-source robotics middleware, and programmed in C++ language. More details can be found at the walking controllers repository.Footnote 1
The layout to simulate the humanoid robot navigating on different types of floors was implemented using the Gazebo simulator, as illustrated in Fig. 5.
3.1.1. EYC implementation
The role of the EYC is to continuously monitor the robot’s orientation, specifically the YAW angle, during locomotion. When the EYC detects that the robot is deviating from the commanded trajectory, it calculates a new YAW angle using a digital PID formulation [Reference Åström and Hägglund30], as shown in Eq. (3), in order to adapt the robot’s trajectory accordingly.
This controller relies on a set of carefully selected and tuned coefficients, which are defined in Eqs. (4)– (14). These coefficients are used in the computation of the control input, $u_0$ , which depends on the previous control inputs $u_1$ and $u_2$ , as well as the error terms $e_0$ , $e_1$ , and $e_2$ . The equation includes the derivative terms $K_{u1}$ and $K_{u2}$ , and the proportional, integral, and derivative gains $K_{e0}$ , $K_{e1}$ , and $K_{e2}$ , given by:
where
To trigger the controller, a timer interrupt service routine is used, and a set of global variables is defined, including the command input $r$ , plant output $y$ , error terms $e_0$ , $e_1$ , and $e_2$ , control inputs $u_0$ , $u_1$ , and $u_2$ , and the coefficients $a_0$ , $a_1$ , and $a_2$ . The error term is computed by taking the difference between the desired output and the current plant output. The proportional, integral, and derivative gains are initially set to $K_p = 1.00$ , $K_i = 0.02$ , and $K_d = 0.01$ , respectively, while the filter coefficients are set to $N = 20$ , and the sampling time is set to $T_s ={0.01}\,\textrm{s}$ .
These variables and parameters are critical to the operation of the PID controller and have been empirically selected and tuned to ensure an initial effective stabilization of the robot’s orientation on slippery floors. Further explanation will be provided in Section 4 to elaborate on the specific process of how these variables and parameters were determined based on practical experimentation and observation.
To estimate the robot’s orientation, data obtained from the NAO robot IMU sensor were utilized, which provided measurements of the robot’s angular velocity and linear acceleration. To estimate the yaw angle, we applied an extended Kalman filter (EKF), which is a recursive algorithm that estimates the state of a system that is subject to uncertain and noisy measurements [Reference Julier, Uhlmann and Kadar31].
The EKF uses a quaternion to represent the orientation of the system, where the scalar component represents the orientation of the system, and the three vector components represent the axis of rotation. The prediction stage of the filter propagates the quaternion forward in time using the system dynamics, while the update stage fuses the predicted state with new measurements. In the case of an IMU, new measurements include measurements of angular velocity. To update the state estimate with the new measurements, the filter models the measurement process as a linear function of the state, corrupted by additive Gaussian noise. Finally, the EKF computes the optimal estimate of the state by minimizing the mean squared error between the predicted state and the measurements. By applying this algorithm, we were able to obtain an optimal estimate of the robot’s torso YAW angle over time. This estimate was then used as input to the EYC controller to stabilize the robot’s trajectory, taking into consideration not only the slippery condition of the floors but also the peculiar issue related to the humanoid’s non-straight walking patterns mentioned earlier. The EYC controller, utilizing the PID formulation, adjusts the robot’s trajectory to ensure more stable and accurate locomotion, mitigating the effects of both slippery surfaces and the humanoid’s unique build.
4. Preliminary locomotion experiments
In this section, it is presented the preliminary experiments carried out to collect locomotion data in order to analyze it and improve the humanoid controller based on the slippery floor it is faced with. Firstly, we performed an experiment where we attempted to study the impact of the robot’s step frequency on the different floors by measuring the amount of slippage occurred during these experiments. According to the information obtained from these experiments, it was found that the slowest step frequency was the most preferred option across all floors, rather than a specific frequency for each individual floor. While this insight provides valuable information for gait adaptation on slippery floors, it is inconclusive toward our goal of adapting the controller to the specific floor the robot is traversing.
Four different slippery floors that were previously studied were used to test the stability and slip recovery ability of the robot [Reference Almeida, Santos and Silva28]. The floors used in this study were characterized by their coefficient of friction when in contact with the humanoid robot’s instrumented shoe (ITshoe). Specifically, the floors exhibited coefficients of friction of 0.33, 0.26, 0.20, and 0.11, respectively, when in contact with the ITshoe.
4.1. Step frequency experiments
In this study, a series of experiments were conducted in a simulated environment to investigate the impact of humanoid locomotion on different slippery floors. The aim was to gain a deeper understanding of the relationship between the robot’s step frequency, which refers to the time it takes for the robot to perform one step, and the friction coefficients of four different floors to identify the best humanoid gait for real-world settings. To achieve this, a controlled experiment was designed and implemented, in which the humanoid step frequency was varied and the humanoid CoM was measured under different types of slippery floors.
The experiment involved the humanoid robot walking five times for 60 s on each of the four different floors, using five different frequencies. The chosen frequencies were the maximum frequency of the default humanoid NAO, the minimum frequency, and three equally separated frequencies between the maximum and minimum frequencies: 0.43, 0.47, 0.51, 0.55, 0.59 s/step. After collecting data from 100 experiments (i.e., 4 floors $\times$ 5 frequencies $\times$ 5 repetitions), data manipulation was performed to quantify the extent of slippage of the robot on each floor. To achieve this objective, the average absolute Y-position of the CoM for each experiment was calculated.
In the next step, the average change in the direction of the robot’s feet at each step was computed. This was done by taking the derivative of the average CoM Y-position data points. Then, a second derivative was applied to obtain the values that represent how much the robot slipped on each step. To exclude small slip actions that may occur due to possible errors associated with the robot’s joints, controller, and/or the simulator environment, a slip threshold was added to consider only significant slip actions. The optimal threshold was determined empirically by observing the various outputs of the second-order derivative and selecting the value that best matched the noticeable slip observed in the different experiences. Specifically, a threshold of 1 $\times$ 10 $^{-6}$ (m/ $\text{step}^2$ ) was chosen. To normalize the results, the accumulated slip was divided by the number of steps in each experience since different step frequencies represent different amounts of steps. Equation (15) describes the steps used to quantify the robot’s slip in each experience.
where $S$ represents the second-order derivative of the discrete sequence $A(i)$ , which is calculated as the difference between $A(i)$ and $A(i-1)$ , and $A(i+1)$ and $A(i)$ , that is, $S(i)=\left (A(i)-A(i-1)\right ) - \left (A(i+1)-A(i)\right ) = 2A(i)-A(i-1)-A(i+1)$ . This equation is used to estimate the rate of change of the CoM Y-position at each step, which is a key parameter for detecting slips. $\theta$ represents the slip threshold, $n$ represents the number of steps and $T$ represents the total slippage that occurred in a complete experience. It is important to note that the absolute value of each detected slip was summed since we did not distinguish the direction of the slip.
As an example of the procedure described, Fig. 6 and Fig. 7 depict the CoM Y curves obtained for three examples of the resulting lowest slip sum and three examples of the resulting highest slip sum, respectively.
The slip corresponding to the previous examples is illustrated in Fig. 8 and Fig. 9, respectively. Among these examples, we selected floor 3 with the lowest frequency as the best representative and floor 1 with the highest frequency as the worst representative.
The overall results of the experiment are presented in Fig. 10.
The study indicates that a higher nominal step frequency, which corresponds to the robot taking longer time durations per step, results in more favorable outcomes in terms of reduced slipping on all types of floors. Moreover, we observed a proportional relationship between step frequency and outcomes, with lower frequencies resulting in higher slip sums and vice versa. These findings demonstrate that, similar to humans, robots must adjust their gait to maintain stability on slippery surfaces. In our study, this involved reducing the step frequency to minimize the risk of slipping or falling.
Given that a specific step frequency for each floor was not identified, this study fell short of our goal of real-time adaptation of the controller to optimize walking on diverse surfaces. To address this limitation, we conducted a follow-up experiment focused on the EYC controller. The results of this study provided valuable insights into optimizing the controller’s parameters to improve the robot’s stability and mobility on various floors.
5. Results and discussions
This section unveils the outcomes and analyses stemming from an experiment designed to fine-tune parameters for the PID controller, addressing the unique characteristics of each floor. This experiment was influenced by the insights gained from the preliminary trials, which showed inconclusive results. To assess the upcoming experiment, performance metrics are defined, specifically, the time and energy spent on the task, and the stability of the robot during these tasks. The experiments to obtain the optimized PID parameters that allow the robot to achieve the best performance on each floor individually to improve the overall experience that is when the robot walks on the four different floors are presented. Similar to the preliminary experiment, this one also utilizes the same introduced floors.
5.1. EYC adaptive controller experiments
For the EYC controller to adapt the humanoid’s locomotion based on its orientation, the PID parameters that would yield the best results were investigated. To determine the suitable PID parameters for stabilizing the humanoid robot orientation on slippery floors, a series of experiments on gazebo simulator was performed. Before describing the experiment in detail, it is essential to define the metrics used to evaluate the robot’s performance on different surfaces.
5.1.1. Performance metrics
In order to assess and compare the performance of the controller and define the best parameters, some metrics are proposed: task time duration ( $T_D$ ), stability ( $S_{\mathrm{idx}}$ ), and energy consumption ( $E_C$ ). These metrics have been established and utilized in previous literature, as evidenced by ref. [Reference Aller, Pinto-Fernandez, Torricelli, Pons and Mombaur32].
The task time duration refers to the time necessary for the humanoid robot to complete a given task, such as walking a certain distance on a slippery surface. This metric allows us to compare the efficiency of different control strategies in terms of completing tasks quickly and accurately.
The stability of the robot was measured using the root mean square error (RMSE) of the Center of Mass (CoM) and Zero Moment Point (ZMP) trajectories in relation to ideal straight line trajectories, where, for a generic quantity $A$ :
Each component of RMSE, including CoM in x and y directions and ZMP in x and y directions, was given equal weight, accounting for $\frac{1}{4}$ of the total weight in the calculation of RMSE. A lower RMSE indicates better stability, as the robot is better able to maintain its balance and avoid falling on the slippery surface. The stability index is defined as:
Finally, to evaluate the energy balance of the control strategies, the energy consumption of the different experiments was compared using a simplified metric. Given the relatively small and lightweight nature of the NAO humanoid robot, the contribution of friction and other energy losses was negligible in comparison to the energy consumed by the robot’s actuators. Therefore, for simplicity and practicality, the energy consumption was approximated by integrating, for all time steps ( $k={0,\ldots, N_T}$ ), the product of the joint torque ( $\tau _i$ ) with the angular velocity ( $\omega _i$ ) for all the robot joints ( $i={1,\ldots,N_J}$ ), during each $\Delta t$ . This allowed us to focus on the energy consumed by the joints as the main sources of energy consumption for the robot. The energy consumption is defined as:
This simplified metric enabled us to compare the performance of the different control strategies and identify the most energy-efficient approach for the tasks at hand, while disregarding other factors, such as robot size, design, environment, and power source that would have contributed only marginally to the overall energy budget.
The performance of the implemented digital PID controller on four different slippery floors was evaluated using a combination of the three metrics, as shown in Eq. (19). The system’s performance decreases as the value of $M$ increases.
where $T_{D_{\max}}$ , $E_{C_{\max}}$ , and $S_{\mathrm{idx}_{\max}}$ correspond to the maximum values obtained across all the experiments.
The equation calculates the overall performance metric $M$ , which is a weighted sum of normalized duration, energy, and the average RMSE of the CoM and ZMP positions over both axes. The normalization was performed using the maximum value of each metric to respectively scale them to a common range.
5.1.2. Digital PID controller optimization
To adjust the PID parameters, the initial setup was manually tuned to establish a baseline for comparison. Data was then collected on the robot’s performance on four different slippery surfaces, systematically varying the Proportional (P), Integral (I), and Derivative (D) gains.
For refining the PID parameters, a systematic trial-and-error approach was employed. Starting with fixed Integral and Derivative values, iterative adjustments were made to the Proportional parameter. Once the optimal P value was identified, the Proportional and Derivative gains were fixed, and the Integral parameter underwent fine-tuning. Subsequently, the Proportional and Integral gains were set, and the Derivative gain was adjusted until optimal performance was achieved, assessed through metrics defined above for each walking experience.
This allowed for the evaluation of the impact of each parameter on the robot’s stability and mobility and identification of the optimal combination of parameters for each surface. By using this approach, the PID controller’s parameters were fine-tuned to enhance the robot’s performance on a variety of surfaces, ensuring optimal stability and mobility.
After each experiment, the results were evaluated using the defined metrics. Due to the complexity of the model, an analytical approach was not feasible, and an empirical approach was employed to identify the set of PID parameters that resulted in the best performance according to the metrics. Finally, the parameters that yielded the best performance across all experiments were selected for further analysis and comparison.
As will be demonstrated in this section, the optimized value of the integral parameter will be set to zero, effectively reducing the PID controller to a PD controller.
Optimized PID gains
To illustrate the results of our experiments, we present three graphics showcasing the performance of the controller on the first floor, which has a coefficient of friction of 0.33. Chart in Fig. 11 displays the performance task results for five repetitions for each variation in the proportional gain while fixing the integral and derivative parameters. During the experiment, the controller was used while the robot walked 2 m on the floor. Chart in Fig. 12 presents the performance results using the optimized proportional gain and the fixed derivative gain while varying the integral gain. Finally, Chart in of 0.33. Chart in Fig. 13 shows the results obtained with the optimized proportional and integral gains while varying the derivative gain.
The optimized digital PID gains were determined as P = 0.98, I = 0.00, and D = 0.01 for the first floor, using it as an example for our task. The same procedure was repeated for each floor, resulting in the corresponding optimized PID gains, as shown in Table II.
As the floor becomes increasingly slippery, we observe that the proportional gain (P) of the controller increases to enhance its responsiveness. This is because a more slippery surface tends to induce slower responses from the system to changes in orientation, necessitating a higher P value to achieve the desired level of control. Our empirical analysis revealed that the optimal integral term for this controller is I = 0, indicating that the system is able to compensate for any steady-state error without requiring the integral term. Furthermore, the derivative gain obtained was D = 0.01 for all experiments, suggesting that the current level of derivative gain is appropriate and does not require adjustments in response to changes in floor slipperiness.
Performance evaluation: variable vs. fixed PD gains
To demonstrate the effectiveness of the approach, a series of experiments were conducted in which the humanoid robot walked 2 m on four different slippery floors, each 50 cm long, and repeated the process five times. The performance of using a fixed-gain PD controller with the best gains for the first floor was compared against a variable-gain PD controller with optimized gains for each floor that the robot walked on. The results presented in Table III show that implementing the best PD gains for each floor led to a significant improvement in the robot’s overall performance. On average, the robot achieved an approximately 10% increase in performance, with the most noticeable improvements observed on the last two floors, which had lower friction. The primary factor contributing to the improvement observed with the variable PD control method was the robot’s resulting enhanced stability performance. Moreover, using the variable PD also resulted in lower energy consumption during task execution. However, the difference in task completion time between the two PD control methods was insignificant, with a slightly better performance observed for the fixed PD. Overall, when considering the three metrics with equal weight, the impact of the variable PD control method is evident, particularly on floors with reduced friction.
Figure 14 shows an example of one of the experiments CoM Y to illustrate the stability using a fixed-gain PD versus an adaptive-gain PD, and Fig. 15 showcases the absolute CoM Y errors, offering a clearer illustration of the distinctions.
From the graph in Fig. 14, it can be observed that, initially, the pattern is similar for both the adaptive and fixed PD controllers. This similarity is also evident in Fig. 15, where the errors exhibit a comparable magnitude. This can be attributed to the floors being less slippery and the proportional value being the same for both controllers in the beginning. However, as the experiment progresses to increasingly slippery floors, the adaptive PD controller demonstrates a more effective response to the slippage induced by the lower friction of these surfaces. This is evident in Fig. 15, where the non-variable PD controller exhibits a larger overall error. The controller’s adaptability to the floor on which the robot is walking is attributed to our prior research, which involved the development of an ITShoe for measuring ground reaction force data, and the implementation of a long-short-term memory (LSTM) network to identify the various floor types.
It is important to note that, for the defined performance metric, a lower value indicates better performance. In other words, achieving a smaller task time, lower energy consumption, and less stability error are desired outcomes.
This demonstrates the versatility and effectiveness of the controller used in this work to adapt to different slippery floor conditions.
The experimental results presented here are illustrated in a video available on YouTube,Footnote 2 which provides a better understanding of the problem and the observed behaviors.
6. Conclusion
In this study, we have demonstrated the versatility and potential of the controller developed in ref. [Reference Romualdi, Dafarra, Hu and Pucci19] by successfully adapting and implementing it to control our humanoid robot. Our findings highlight that the integration of an EYC has significantly enhanced the overall performance of the robot while walking on slippery floors. This improvement was particularly evident when the robot was exposed to different types of slippery surfaces, effectively mitigating the challenges posed by the varying conditions. Furthermore, the EYC has also successfully rectified the non-linear paths resulting from the robot’s unconventional building technique, contributing to a more stable and controlled gait. It is worth noting that we assumed the robot’s ability to identify the type of floor it is walking on, as this issue was resolved in a prior study [Reference Almeida, Santos and Ferreira4] using the ITShoe and a LSTM. As part of future work, the plan is to conduct tests on the physical humanoid robot and make necessary adjustments to seamlessly integrate floor detection with the controller adaptation. This would involve transitioning from robot operating system) to YARP and translating the forces measured by the ITShoe to ankle forces and torques for the robot.
Acknowledgements
The Fundação para a Ciência e Tecnologia (FCT) is gratefully acknowledged for funding this work with the grant SFRH/BD/136680/2018.
Author contributions
All authors contributed equally to this work. All authors read and approved the final manuscript.
Financial support
The Fundação para a Ciência e Tecnologia (FCT) is gratefully acknowledged for funding this work with the grant SFRH/BD/136680/2018.
Competing interests
The authors declare no competing interests exist.
Ethical approval
Not applicable.