Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Walking Motion Generation, Synthesis, and Control for Biped Robot by Using PGRL, LPI, and Fuzzy Logic

Published: 01 June 2011 Publication History

Abstract

This paper proposes the implementation of fuzzy motion control based on reinforcement learning (RL) and Lagrange polynomial interpolation (LPI) for gait synthesis of biped robots. First, the procedure of a walking gait is redefined into three states, and the parameters of this designed walking gait are determined. Then, the machine learning approach applied to adjusting the walking parameters is policy gradient RL (PGRL), which can execute real-time performance and directly modify the policy without calculating the dynamic function. Given a parameterized walking motion designed for biped robots, the PGRL algorithm automatically searches the set of possible parameters and finds the fastest possible walking motion. The reward function mainly considered is first the walking speed, which can be estimated from the vision system. However, the experiment illustrates that there are some stability problems in this kind of learning process. To solve these problems, the desired zero moment point trajectory is added to the reward function. The results show that the robot not only has more stable walking but also increases its walking speed after learning. This is more effective and attractive than manual trial-and-error tuning. LPI, moreover, is employed to transform the existing motions to the motion which has a revised angle determined by the fuzzy motion controller. Then, the biped robot can continuously walk in any desired direction through this fuzzy motion control. Finally, the fuzzy-based gait synthesis control is demonstrated by tasks and point- and line-target tracking. The experiments show the feasibility and effectiveness of gait learning with PGRL and the practicability of the proposed fuzzy motion control scheme.

Cited By

View all
  • (2023)Data-driven gait model for bipedal locomotion over continuous changing speeds and inclinesAutonomous Robots10.1007/s10514-023-10108-647:6(753-769)Online publication date: 27-May-2023
  • (2019)A Q-learning based method of optimal fault diagnostic policy with imperfect testsJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-18179936:6(6013-6024)Online publication date: 1-Jan-2019
  • (2018)Near Optimal PID Controllers for the Biped Robot While Walking on Uneven TerrainsInternational Journal of Automation and Computing10.1007/s11633-018-1121-315:6(689-706)Online publication date: 1-Dec-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics  Volume 41, Issue 3
June 2011
283 pages

Publisher

IEEE Press

Publication History

Published: 01 June 2011

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Data-driven gait model for bipedal locomotion over continuous changing speeds and inclinesAutonomous Robots10.1007/s10514-023-10108-647:6(753-769)Online publication date: 27-May-2023
  • (2019)A Q-learning based method of optimal fault diagnostic policy with imperfect testsJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-18179936:6(6013-6024)Online publication date: 1-Jan-2019
  • (2018)Near Optimal PID Controllers for the Biped Robot While Walking on Uneven TerrainsInternational Journal of Automation and Computing10.1007/s11633-018-1121-315:6(689-706)Online publication date: 1-Dec-2018
  • (2016)PSO and neural network based intelligent posture calibration method for robot arm2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC.2016.7844711(003095-003100)Online publication date: 9-Oct-2016
  • (2013)Backward Q-learningEngineering Applications of Artificial Intelligence10.1016/j.engappai.2013.06.01626:9(2184-2193)Online publication date: 1-Oct-2013
  • (2012)Gait detection based stable locomotion control system for biped robotsComputers & Mathematics with Applications10.1016/j.camwa.2012.03.09064:5(1431-1440)Online publication date: 1-Sep-2012

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media