Maneuverable Exo
Maneuverable Exo
Maneuverable Exo
Abstract—The Human brain is a complex information pro- m/s, pain signals travel slower at 0.61 m/s and touch signals
cessing device, controls the human body motion and sensory travel at speeds of 76.2 m/s. If you are reading this at this
system through the nervous system and muscles. Human gait is moment and thinking at the same time, which some people
a complex Multiple Inputs Multiple Outputs (MIMO) process,
which need an accurate analysis tools to extract valuable and may have trouble with, thought signals are traveling at speeds
reliable information describing it’s dynamics. Consider mo- ranging between 20 and 30 m/s.
tion planing and control in maneuverable lower-limb power The motion control and planning of lower extremity
augmentation exoskeleton systems, the intentions identification exoskeletons have gained considerable interests in recent
through limited sensors is a great deal. Dual Reaction Force years, especially for human power augmentation applications
(DRF) sensors are developed for efficient walking speed changing
estimation. We describe the finite states for our system using [2][3][4][5][6]. The lower limb exoskeleton systems designed
Markov Decision Process (MDP). Due to different system’s state for human power augmentation, load transfer and endurance
conditions we proposed new term ”Dynamic Thresholded Reward increasing. In different applications, many control methods
(DTR)” calculated based on Recurrent Neural Network (RNN) are demonstrated for human-exoskeleton system’s navigation
to estimate appropriate system’s gait transition according to the and smooth gait transitions. The Berkeley Lower Extremity
pilot intentions. System simulations are carried out applying
fusion sensing technique. The states transition after a given action Exoskeleton (BLEEX) is the most famous exoskeleton system
in following the optimal policy thereafter to improve the system , which is actuated by hydraulic system. Sensitivity Ampli-
response in tracking. MATLAB/Simulink is used to calculate fication Control (SAC) algorithm is proposed for BLEEX
the theory of human intention and the robot motion simulation. control which aimed to reduce the sensors complexity [7].
Based on adaptive Central Pattern Generators CPGs and high SAC method efficiently controls exoskeleton robot to shad-
sensitive force sensors we develop a new control strategy for
switches between different gates (flat terrain walking, stair ascent, ow human motion but in economic point of view it’s so
stairs descent , speed up and slow down). expensive and resource consumer. In other words SAC is
expensive for both development and practical applications. Y.
Index Terms—Maneuverable Exoskeleton Systems, Admittance
Control, Dual Reaction Force (DRF) sensors, Markov Chain, Sankai et al. applied impedance control method for human
Markov Decision Process, Dynamic Thresholded Reward (DTR), enhancement version of Hybrid Assistive Limb (HAL) [8].
Re. Human Intention Estimators (HIE) are implemented for HAL
control by calculating the reference patterns of the pilot
I. I NTRODUCTION by measuring human-exoskeleton interaction directly from
Electromyography signals (EMG). H. Rui et al. proposed
We consider the natural process of walking in the human a modification for SAC, which successfully achieved and
which goes through the brain [1], nervous system, lower applied for HUman power Augmentation Lower EXoskeleton
limbs and lower limbs muscles for further development on (HUALEX) control [9]. Fuzzy-based impedance regulation for
the human exoskeleton systems control. Comparing the natural control of the coupled human-exoskeleton systems [4]. The
walking process with human made one, we try to modify the learning approach of the relationship between physical human-
minority of the current motion planing and control strategies exoskeleton interaction and dynamic factors [10]. Radial Basis
for the perfect performance. The sub-systems of the human Function Neural Network (RBFNN) designed to compensate
exoskeleton system include the main controller, signal trans- for the dynamic uncertainty error and minimize the physical
mission media, sensors and joints motors are subjected to some human-robot interaction force [11].
modifications to meet the requirements of the maneuvering. The problem of perfect tracking of a known input trajecto-
The most important specification of the natural walking system ries is a great deal in human-powered exoskeleton systems,
is the speed of a nerve impulse, which varies with the type sudden changing in direction, walking speed or gait type
of nerve impulse the nervous system is sending. Some signals during coupled human-exoskeleton system’s navigation will
such as those for muscle position, travel at speeds up to 119 lead to considerable tracking errors. Several studies were per-
Abusabah I. A. Ahmed with the Karary Center for Electronic Systems formed to investigate normal human stair ascent and descent
Technology and Consultations, Karary University, 12304 Khartoum, Sudan, [12], investigations of biomechanics and motor coordination in
abusabah22@karary.edu.sd human lower extremity during ascent and descent walking at
Samah H. H. Mohammed Khird with the College of Engineering
and Architecture, Shendi University, 00000 Shendi, Sudan, different inclinations [13][14][15]. Other investigations such as
engsamah@ush.edu.sd staircase climbing of patients with knee and hip [15][16]. The
2
indoor applications of human exoskeleton systems required flexible connections between the exoskeleton and wearer as
special care and efficient control method, changing gait type, depicted in Fig 1. In the sagittal plane, the designed ranges
acceleration, de-acceleration, transition to stair ascent and vice of motion at the hip, knee and ankle joints are −45◦ to
versa are frequent. +45◦ , 0◦ to−135◦ and −30◦ to +30◦ , respectively. The recent
Markov Decision Processes (MDPs) are modeled as a indoors applications of human-powered exoskeleton systems
probabilistic process driven by a known Markov Chain (MC) need to pay more attention for control to obtain smooth
[17][18]. However, the performance of MDPs in the practical reference trajectories tracking. The exoskeleton links are made
is challenge due to the model parameters approximation and with ideal design (minimum weight and inertia). The Link
assumptions. As a solution of MC model parameters problem lengths are adjustable respect to various pilots. Many research-
many researches are conducted recently addressed the issue of es are conducted as a series and continuous modifications
robust performance in these decision systems [19][20]. Many for HUALEX control and performance developments. Force
researchers have studied the problem of uncertain transition sensing technology represents an important feature in human-
probabilities of MDPs , recently they have efficient perfor- exoskeleton systems for monitor the interaction between pilot
mance in uncertain state transitions applications and decision and exoskeleton, which can be used for the motion planning
making fields [21][22][23]. MDPs satisfy the Markov Chain and control of these systems. Inserting or positioning of some
property, states transitions are dependent on the actions and the sensors to analysis and estimate the interaction force between
current state. We apply finite MDPs to achieve high accuracy the pilot and the exoskeleton and between exoskeleton and
motion control of the coupled human exoskeleton system rely surrounding environment is mandatory. Because these sensors
on a policy that directly maps the intentions of the pilot to the are additional control devices for the human-robot system, it
actions to determine the next state. is important for the each sensor to be small, lightweight, and
Since Neville Hogan first introduced Impedance Controllers noninvasive. As example for daily life maneuver let’s take
(IC) [24], they have become well established specially in climbing stairs, which requires active knee extension i.e. addi-
robotics and coupled human-exoskeleton system. The main tional torque must be applied [13]. The proper interaction force
illness in IC performance is the interaction forces resulted sensors of HUALEX system make successfully investigation
during gait transitions, which lead to overshoots and under- and estimation of the pilot intentions, therefore perfect intend-
shoots in trajectory tracking. Overshoots-bounded feedback ed motions and maneuvers prediction. Beside Ground Reaction
control system performance was proved by G. Deodhare Force (GRF) sensors, Two-dimensional Interaction Force Sen-
and M. Vidyasagar, they designed a controller that achieved sors (TIFS) are developed to measure quasi-interaction force
internal stability, zero steady-state error for step input and resulting from the pilot on the exoskeleton. Since our proposed
no undershoot [32]. R. D. Hill et al. investigated a dual control strategy for gait transition is a model-based control
formulation for the problem of designing linear time-invariant strategy, the dynamic model of HUALEX project must be
controllers which minimize the convex of signal [33]. M. given.
Krstic and M. Bement demonstrated a means of obtaining M (θ)θ̈ + C(θ, θ̇)θ̇ + G(θ) = τExo + τh (1)
a non-overshooting tracking response for single input single
in which θ is the vector of each joint angle, τExo and τh
output strict-feedback nonlinear systems, they tracked arbitrary
represent the input torques from HUALEX and human wearer,
reference trajectories [34]. We applied our proposed Variable
respectively. M (θ) is the inertia matrix and a function of θ,
Admittance Controller (VAC) with bounded feedback tech-
C(θ, θ̇) is the Coriolis matrix and a function of θ and θ̇,
nique to minimize tracking error overshoots and undershoots
and G(θ) is a vector of gravitational torques. During human-
during human-powered exoskeleton system’s transitions [35].
exoskeleton system navigation τh is changing according to
The paper is organized as follows: Section II shows the
human intentions. We use a modified sensing technique to
needs for maneuverable human-exoskeleton system and the
control the motion of the human exoskeleton system according
integration of such system. We validate the performance
to the pilot intentions. In order to see the effect of sudden
illness of ordinary admittance control during gait transitions
changing in the motion trajectories, experimental trails are
in Section III. Section IV shows the local regression strategy
conducted for different obstacles avoidance considered the
application on interaction force minimization. The overshoot
relation between obstacles dimension and intention feedback
reduction technique is detailed in section V. Finally, con-
signal specifications.
clusions and some perspective on future uses and further
development of this technique drawn in section VI.
A. The effect of sensors positioning and integration
Even though you chosen the best sensor type for the speci-
II. M ANEUVERABLE H UMAN -E XOSKELETON S YSTEMS
fied application, but for the efficient performance the sensory
The sudden changing in motion trajectories for dynamic system must designed, installed, and positioned properly. The
obstacle avoidance needs modified control system’s response proper placement and functions integration of the sensory
and it’s yet new research field. The HUALEX actuated and system can cause efficient service and performance. The sensor
passively driven DoFs are designed to guarantee the shadow of system designed for optimum feedback signals extraction,
all expected pilot’s maneuvers. As a wearable exoskeleton, the every sensor serve as stand-alone and with other specified
motion range for each DOF of HUALEX is designed according sensor for the mutual feedback signal. The sensor placement
to human kinematics with some slight differences due to and cooperative performance will lead to efficient on-line
3
Instrumented footboard
detection of pilot intentions. The modified sensory system pathes of feedback signal and control signal are different in the
doesn’t has an extreme sensitivity, but the mutual performance real human nervous system, sensor to brain and from brain to
feature lead to the wide dynamic range of operation. The drivers. We developed the human in loop algorithm to control
prediction and estimation of the pilot intentions depends on the exoskeleton system to cover the pilot intended movements
the sensor system set of data. The theory of maneuverable efficiently. The developed algorithm will lead to meet the
coupled human exoskeleton systems has very important value requirements for accurate and efficient tracking of intended
in the emergency applications, military field and industrial pilot motions during system navigation, we try to synchronize
field. The sensed data acquisition system, and useful feedback between sensory system speed (the natural sensing of human
signal extraction method can affect the system response and pilot) and actuated joint movements.
final performance efficiency. Developing of such subsystems The intelligent sensory system developed for speed decision
can lead to good tracking for the different maneuvers in the making, the system estimates the pilot intended movements
real time. based on experimentally predefined thresholds. The developed
sensory system . The sensory system configuration is devel-
B. The effect of sensory communication speed oped considering the human to control the exoskeleton system
and cover all expected pilot intended movements efficiently.
The human nervous system uses what can be approximated
The physical path between transmitters and receivers some-
as pulse frequency modulation (PFM) to transmit information
where is guided (wired) somewhere is unguided (wireless),
through nerves. A PFM signal is a sequence of pulses of nearly
the transmission can be point-to-point, point-to-multipoint or
uniform amplitude and very short duration whose frequency
multipoint-to-point depends on the data rate and the function
carries the signals data. When pulse frequency modulating
of the receivers and transmitters. Wireless transmission system
a continuous signal, information about the original signal is
can successfully serves between sensory subsystems and main
necessarily lost due to the discretized nature of the PFM
controller hanged on backpack, and between main controller
signal; nothing is known about any changes in the continuous
and active joints drivers.
signal until the occurrence of a new pulse. PFM signals
occur in the human nervous system because of the creation
III. P ROPOSED S ENSORS S YSTEM
and propagation of action potentials [41]. However, PFM
signals are rarely used in engineering applications because A. Human-exoskeleton interaction force
they are very inefficient; much of the information contained When plan the motion of maneuverable systems for special
in a continuous signal is lost during the modulation process. missions with sudden changing in input trajectories special
Pulse frequency modulators are also highly nonlinear, and care must drawn and computer simulations must conducted
are therefore not mathematically well defined or understood. first. The intelligent sensory system, data acquisition system
PFM signals are very insensitive to noise, but this seems to and mutual feedback signal extraction system are developed
be their only positive attribute. The brain can fully process for the brain to muscles navigation in human exoskeleton
and output what our eyes see in real-time. The brain activity systems. When the interaction force changed by small amount
changes in a consistent and recognizable way when the general the input track must change by to keep minimum or with
status of the subject changes, as from relaxation to alertness in acceptable threshold. The function of our predictive al-
[42]. The brain uses massive parallel processing to perform gorithm is to determines the input needed to produce the
the equivalent of several billion operations per second. The plants desired performance (minimum interaction force). The
4
Adaptive Central Patterns Generators (ACPGs) system starts cm tread length. The threshold for the interaction force is the
with the input signal of each controlled joint as initial state, normal range for physical human-robot interaction taken as
then a reformation made according to the pilot intended nominal value, When pilot intended to transit from flat terrain
maneuver. The conducted excremental flat walking trails, stairs walking to stair ascent (fi (t))N + 4fi (t). As the interaction
ascent, stairs descent and walking speed changing to built force increases than threshold, the designed algorithm decides
the reference values data base for intended motion decision the hight of the stairs depends on the 4fi (t). So a correction
system. The result of experimental walking trails (5 trails for to the input trajectories will adapt the exoskeleton motion to
different speed) to calculate the HCT is shown in Tab. I. The the pilot’s desired motion. The typical thresholds for the heel
resulted values indicated the proportional relation of walking contact time, interaction force during stairs ascent (170 mm
speed and HCT . According to the walking trails results shown hight). Fast sensing characteristic and high system response
achieved through proposed and developed sensing techniques.
TABLE I
T HE VALUES OF HCT FOR DIFFERENT WALKING SPEED .
B. Dual Reaction Force sensor
Number Walking Speed m/s Heel Contact Time S The new proposed Dual Reaction Force sensor (DRF) sensor
1 1 0.69 ±0.05 composed of Ground Reaction Force (GRF) sensor implanted
2 1.5 0.62 ±0.09 in the exoskeleton footboard and another one for physical
3 1.8 0.54 ±0.08 Human Robot Interaction (pHRI) implanted in pilot shoe as
4 2 0.49 ±0.04 Instrumented Shoe (IS). The walking experimental trails was
5 2.5 0.41 ± 0.07 carried out on the HUALEX to validate our control algorithm.
The wearer with a height of 181 cm and a weight of 70 kg was
asked to walk on the ground. To determine the relationship
in Tab. I, we investigate that for the HCT is vary between
between walking speed and reaction force mismatch during
0.69±0.05 and 0.41±0.07 while walking speed vary between
one gait cycle we demonstrated experimental trails, asking the
1m/s and 2.5m/s. All previous studies are considered the
pilot to speedup and slowdown within the same gait cycle. The
walking speed (m/s) as a product of stride length in meters
normal speed stance phase duration control is 0.92 ± 0.13, for
(SL)and stride frequency in seconds, and it follows that there
fast speed is 0.73 ± 0.06 as measured for normal subject as
is a wealth of information on these parameters during walking.
in [44]. We use DR sensors to identify wearer intention, this
The main walking parameters observed here are unique due to
can be extracted from the mismatch between exoskeleton’s
the novelty in sensory system. The heel contact time HCT over
heel contact time ExoCT and human’s heel contact time hCT
different walking speed, provided in Tab. I showed temporal
within one gait cycle.
gait data for specific pilot. For the future application of such
We consider walking case, the range of considered speeds
sensory system some measures will be considered in our
is from 1m/s to 3m/s. The expected DR sensors signals
future work for adjustable, the extracted feedback features
deviation or mismatch according to pilot intention can be
must be able to correspondingly adjust with respect to various
lead or lag, taking exoskeleton footboard vGRF signal as a
pilot length. The HCT decreases approximately linearly with
reference in case of acceleration intention the peak of shoe
walking speed. The walking trails were carried out at Center
vGRF sensor will lead the peak of footboard FRF sensor
for Robotics, School of Automation, University of Electronic
and vice versa in slow down case as depicted in Fig. 2. The
Science and Technology of China (Chengdu, China). In these
shape of the reaction forces is typical of what is reported
trails, the selected pilot wears HUALEX and walks naturally
in the literature for full walking step, the shape shown in
according to the given speeds as depicted in Tab. I the
Fig. 2 is just for heel strike and it’s considered for mismatch
resulted HCT are registered. The walking speeds range of
detection subsequences. This force has the characteristic of
[1m/s 2.5m/s] is wide enough to evaluate the effectiveness
double hump. The first is related to weight acceptance when
of proposed methodology.
the body’s downward velocity is being arrested as shown in
For the stair ascent or stairs descent cases, the interaction
Fig. 2, the second hump is due to push-off and demonstrates
forces resulted from pilot intentions are collected from the
that the body’s centre of mass is being accelerated upwards to
effective wearable sensors in thigh and shank straps (instru-
increase its upward velocity.
mented straps). The designed force sensor is omni-directional,
to monitor and measure the rear and front interaction forces.
IV. M ARKOV D ECISION P ROCESSES
When the pilot intens to switch from flat terrain walking to
stairs ascent the feedback collected from the front force sensor, Markov Chain (MC) models [46] are used in situations in
for the vice versa action collected from the rear force sensor. which there are a large number of objects which can be in
The thigh and shank interaction forces thresholds for the gait any of several states Si , i = 1, 2, ..., n (or conditions) and
transitions are empirically designated. which move between these. The powerful tool for planning
A higher stair will require you to bend your knees more and classification in the systems transcended by uncertainties
deeply and the greater the amount of knee flexion. The joints presence is Markov Decision Process (MDPs). MDPs defined
flexion for stair ascent are adapted from the work in [43]. through the following objects:
The interaction forces on thigh and shank investigated from • Finite state space Si , i = 1, 2, ..., n
n×n
excremental trails for the stairs with 17cm step height and 28.5 • The Dynamic Uncertainties Matrix λi,j , Λ
5
80
Exo Footboard
Instrumented Shoe
Vertical Ground Reaction Force (%BW) 70
60 Stairs Ascent
50 S1
1,0
40 0 ,1
30
0 ,4 0 ,3
20 Slow Down Flat Walking Speed up
10
S4 S0 S3
0
0 10 20 30 40 50 60 70 80 90 100 2,0 0,2
Gait Cycle %
Fig. 2. The walking speed control sensors system feedback signals expressed
pilot intention for speedup.
Stairs Descent
S2
• Transition probabilities P (Si /Sj ; λi,j )
The key feature is that there is fixed probability Pi,j (tran- Fig. 3. The Markov Chain representation.
sition probability) of moving from any state to any other
state as shown in Fig. 3. The number of states is finite
by the human exoskeleton system operation conditions and low cost monitoring and controlling. SNs require adaptive and
the surface terrain of the operational area and depends on robust methods to address feedback signals overlap, mutual
system applications. The transition probability does not change performance and resource optimization. For the simulation
over time and does not depend on any previous states. The purpose the sensors number is finite and limited (si , i =
probability of different switches or transition depends on 1, 2, 3, 4), the observed quantities are also limited and inde-
the Dynamic Uncertainties Matrix (DUM) let’s define it as pendent of sensors number. The sensors positions according to
λi,j ∈ Λn×n . By default, the set of emissions (system states) the exoskeleton joints have a great sense in the pilot intentions
is n, where n is the number of possible emissions, but you estimation, this mean the designed algorithm is valid for
can choose a different set of numbers or symbols. The system the predefined pilot body dimensions. s1 and s2 are omni-
differential equation is shown in Eqn. 2. directional, so they can give a feedback for turnings left and
right, stairs ascent and stairs descent of pilot intentions. Right
n k
dPS (t)
X
λk,1 PSn (t)
X
λ1,n
now the HUALEX has two DoFs in the hip joint, so we
1 PS1 (t)
dt
i=1
i=1
consider stairs ascent and descent cases. Excremental walking
. .
− . .
= . . (2)
trails are conducted on stairs ascent and descent to determine
. .
. n . .
.
dPS (t) n
n
X
λk,n PSn (t)
X PSn (t) the relationship between joint angles and human-exoskeleton
dt λn,n
i=1 i=1 interaction force during stairs ascent and descent. As reported
Let P be a nxn matrix with coordinates {Pi,j : i, j = in the literature, the connections between exoskeleton and the
1, 2, ..., n}. A random process (X0 , X1 , ...) with finite state pilot near to the active joints through thigh and shank cuffs can
space S = {S1 , S2 , ..., Sn } is said to be a homogeneous measure and monitor the interaction force resulted from the
Markov chain with transition matrix P, if for k all n, all pilot intentions. According to mainly flexion, extension, turn-
i, j ∈ {1, 2, ..., n} and all i0 , ..., in−1 ∈ {1, 2, ..., n} we have: ing right or left Omni-dimensional interaction force sensors
(OIFS) are developed to measure the interaction force resulting
P (Xn+1 = Sj |X0 = Si0 , X1 = Si1 , ..., Xn = Si ) = from the pilot on the exoskeleton. The interaction forces acting
(3)
P (Xn+1 = Sj |Xn = Si ) = Pi,j between exoskeleton and pilot’s body are used as an input of
The future is independent of the past given the present and the data classifier system to predict and decide the next state
the conditions in the Eqn. 4 must satisfied. transition. The timely intention signal classification depends on
n
the current system state and the Dynamic Uncertainties Matrix
Pi,j ≥ 0,
X
Pi,j = 1 f orall i (DUM) value. The transition process sometimes subjected to
(4)
j=1
two phases, as example for transition from flat terrain walking
to stair ascent depending on the 4fi (t) stairs height will be
V. S EQUENTIAL D ECISION M AKING P ROCESS estimated.
The performance of DRF sensors for continuous plantar-
A. Sensors Regrouping force measurement and gait-phase detection during walking
Sensor Networks (SNs) consist of limited number os sensors can produce useful feedback signals for efficient and high re-
mutually worked on-line to monitor and control specified sponse walking speed control. The monitoring of such signals
dynamic process. These sensors cooperate for effective and during experimental walking trails gave us an inspiration of
6
sequence of states k0 , k1 , ..., km such that k0 = i , km = j need to manage the execution of tasks (transitions) to satisfy
, and P (kt , kt+1 ) > 0 for each t = 0, 1, ..., m − 1. States the efficiency, safety and robustness of navigation. Priority
i and j are said to communicate if each is accessible from rule-based method (PRBM) rely on the reception of more
the other. This relation is denoted by i ↔ j. Communication than one feedback signal on the main controller to achieve
is an equivalence relation. In particular, it is transitive if i intended transitions can lead to efficient navigation [52]. This
communicates with j and j communicates with k then i work shows how system performance decreases as number of
communicates with k. In other words, for our system shown states is incremented and transitions are managed well. The
in Fig. 3 from flat terrain walking can transit to acceleration execution of the high priority transition to be achieved first,
then to stair ascent, all these states are accessible from each then the low priority task will take the turn of execution. The
other and any two of them are communicated. The task or states priority decided to be dependent on current state and
movement must be done as a response of the system to the intended transitions. As example from speedy flat walking one
sensors output according to the pilot intention, the response must slowdown and go upstairs, the feedback signals for those
must be bounded to some thresholds to satisfy the smooth and maneuvers are from separate sensors so the DUM will has two
harmony in the system’s navigation. From the simple structure non-zero elements.
of a control system shown in Fig. 6, we’re going to arrange
or schedule the tasks or state transitions to get an optimum VII. M ODEL F ORMULATION
performance. In this section, we are going to describe the whole system
components for efficient transition mechanisms consider the
model in which the system states are finite for simulation
simplification. Furthermore, the actions available at any state
are organized and controlled according to the DUM at any
Output Actuators time during system’s navigation. The whole proposed system
configuration is designed considering the following protocol
for controlling the gait transitions of exoskeleton system:
1. The system states are finite and the transitions are limited.
CPU Input Sensors 2. The expected DUM elements are calculated every time slot
according to the pilot intentions.
3. The thresholds of the interaction forces are dynamic de-
pendant on the current state.
Controller Memory
4. The time slot is chosen considering the maneuverability of
the system and calculation complexity.
5. The stair ascent case is without height specification (in the
future we will estimate stair’s height)
6. The acceleration and slowdown are exactly estimated, so
Fig. 6. The Simple structure of an arbitrary control system. the move and the changing value.
transition control of exoskeleton the VAC algorithm [35] is not 4t, for the vector of inputs si (t), i = 1, 2 the SRNN can be
enough to ensure tracking error. The mal-performance is due described as:
to the considered interaction forces convex during transitions.
For this reason, we design the HNN controller such that it 1
h(t) = σ((si (t) − si (t − ∆t)) ∗ wi,j (t)) + b1i (6)
takes the decision making part on which the VAC algorithm
has shown its limitation. The recurrent hidden layer contains
2
one stage which has zero-values weights, the input of non- λi,j (t) = f (σ, wi,j ) + b2i (7)
effective neuron to the next layer. Here we mean the sensor
si which has no affection on the specified uncertainty λi,j . At each time step 4t, the input vector si (t), i = 1, 2 along
1
with the connecting weight vector wi,j (t), j = 1, 2, are inputs
to the hidden layer to produce a sequence of the hidden state
B. Pilot Intensions Identification h(t), b1i and b2i are the bias of hidden layer and output layer.
1) Single hidden layer Recurrent Neural Network (SRN- Then a prediction output λi,j (t) is learned through the hidden
N): The Recurrent Neural Networks (RNNs) used for data layer sequence and weight vector connecting the hidden layer
classification, considered the past and current data to estimate to the output layer. The iteration equations parameters are
the future values analogous to the human brain. By adjusting estimated via the on line estimation by providing a set of
the weights of an artificial neuron we can obtain the desired
output for specific input values. The feedback loop is a
delay unit with a unity weight. To validate the efficiency 11,1 21,1
b11
of the proposed transitions control methodology we limited
s1 2 2,1 b 21 i , j
the number of states for sake of the simplicity, but in the 110,1
1.5
Lambda(i,j) value
11,1 b11 21,1 1
s1 2 2,1 b 21 i , j
110,1 0.5
1
1,2
21,10 0
s2 1 2 2,10 b22 i , j
10,2
b110 −0.5
Inputs
Output Layer
Hidden Layer −1
30
Training data VAC
2.5 SFNN output Proposed Strategy
20
2
10
1.5
0
1
0.5 −10
0 −20
−0.5
−30
−1
−40
0 0.5 1 1.5 2 2.5 3 3.5 4 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
Time Time (S)
Fig. 10. The performance of SFNN, the case of mutual intention is estimated Fig. 12. The performance of SFNN, the case of mutual intention is estimated
perfectly (pilot intends to change walking speed). perfectly (pilot intends to change walking speed).
15
VAC state network.
Proposed Strategy
In the future work will focus on the system response
10 optimization to minimize the transition error, and transition
time towards form brain to muscles, then HUALEX can moves
5 freely for more complicated missions. Also we’re planning
Interaction Force (N)
R EFERENCES
−10
[11] M. Ka, H. Cheng, T.H. Toan, Q. Jing, “Minimizing Human-Exoskeleton [38] J. Mattingley, S. Boyd, “Real Time Convex Optimization in Signal
Interaction Force Using Compensation for Dynamic Uncertainty Error Processing”, IEEE Signal Processing Magazine, 2010, Vol. 27 , no. 3,
with Adaptive RBF Network”, Journal of Intelligent and Robotic Systems, pp. 50-61.
2015, pp 1-21. [39] C. D. Hoover, G. D. Fulk, K. B. Fite,“Stair Ascent With a Powered
[12] R. Riener, M. Rabuffetti, C. Frigo, “ Stair Ascent and Descent at Transfemoral Prosthesis Under Direct Myoelectric Control”, IEEE/ASME
Different Inclinations”, Gait and Posture, vol. 15, 2002, pp. 32-44. TRANSACTIONS ON MECHATRONICS, vol. 18, no. 3, 2013, pp 1191-
[13] C. D. Hoover, G. D. Fulk, K. B. Fite, “Stair Ascent With a Powered 1200.
Transfemoral Prosthesis Under Direct Myoelectric Control”, IEEE/ASME [40] J. E. Colgate, G. A. Ollinger, M. A. Peshkin, A. Goswami, “A 1-
Transaction on Mechatronics, vol. 18, no. 3, 2013, pp. 1191-1200. DOF Assistive Exoskeleton with Virtual Negative Damping: Effects on
[14] A. N. Amirudina, S. Parasuramanb, A. K. AhmedKhanc, I. Elam- the Kinematic Response of the Lower Limbs”, IEEE/RSJ International
vazuthid, “ Biomechanics of Hip, Knee and Ankle joint loading during Conference on Intelligent Robots and Systems, 2007, pp 1938-1944.
ascent and descent walking”, International Conference on Medical and [41] P. Peretto, “ An Introduction to the Modeling of Neural Networks”,
Rehabilitation Robotics and Instrumentation, 2014, pp. 336-344. Cambridge University Press, 1992.
[15] T. P. Andriacchi , J. O. Galante, R. W. Fermier, “ The Influence of Total [42] J. D. Bronzino, “ Principles of Electroencephalography”, CRC Press,
Knee-replacement Design on Walking and Stair Climbing”, The Journal Florida, 1995.
of Bone and Joint Surgery, vol. 64, no. 1328-35, 1982. [43] R. Riener, M. Rabuffetti, C. Frigo “Stair ascent and descent at different
[16] G. Bergmann, F.Graichen, A. Rohlmann,“Is Staircase Walking a Risk inclinations”, Gait and posture, 2002, vol. 15, pp. 32-44
for the Fixation of Hip Implants?”, Journal of Biomechanics,Vol. 28, no. [44] E. Isakov, H. Burger, J. Krajnik, M. Grgoric, C. Marincek, ”Influence
5, pp. 535-553. of speed on gait parameters and on symmetry in transtibial amputees”,
[17] R. A. Howard, “Dynamic Programming and Markov Process”, MIT Prosthetics and Orthotics International, 1996, vol. 20, pp 153-158.
Press, 1960. [45] J. A. K. Suykens, J. Vandewalle, “Least squares support vector machine
[18] P. Bremaud, “Markov Chains”, Springer, 1999. classifiers”, Neural processing letters, 1999, vol. 9, no. 3, pp. 293-300.
[19] G. Iyengar, “Robust Dynamic Programming”, Math. Oper. Res., vol. 30, [46] K. VG, “Modeling and analysis of stochastic systems”, Champman and
no. 2, 2005, pp. 257-280. Hall, 1996.
[20] S. Mannor, D. Simester, P. Sun, J. Tsitsiklis, “Bias and Variance [47] K. J. Astrom, B. Wittenmark, “ Adaptive control,2nd eddition”, Addison
Approximation in Value Function Estimates”, Management Science, vol. Wesley, Reading, 1995.
52, no. 2, 2007, pp. 308-322, [48] J. Fox, “Nonparametric Simple Regression: Smoothing Scatterplots”,
[21] M. L. Puterman, “ Markov Decision Processes:Discrete Stochastic Smoothing Scatterplots. Thousand Oaks, CA, 2002.
Dynamic Programming” John Wiley and Sons, Inc, New York, 1994. [49] W. S. Cleveland, S. J. Devlin, “Locally weighted regression: An ap-
[22] J. K. Satia, R. E. Lave, “Markovian Decision Processes with Uncertain proach to regression analysis by local fitting”, Journal of the American
Transition Probabilities”, Operations Research, vol. 21, no. 3, 1973. Statistical Association, 1988,vol. 83, pp.596-610.
[23] C. C. White, H. K. Eldeib, “Markov Decision Processes with Imprecise [50] C. Loader, “Local Regression and Likelihood, 3rd edition”,
Transition Probabilities”, Operations Research, vol. 42, no. 4, 1994. Springer,1999.
[24] N. Hogan, “Impedance Control: An Approach to Manipulation”, Amer- [51] G. Ellis, “Control System Design Guide, 3rd eddidtion”, Elsevier
ican Control Conference, 1984, pp. 304-313. Academic Press, 2004.
[25] L. M. Miller, J. Rosen,“Comparison of Multi-Sensor Admittance Control [52] A. Otto, C. Otto,“ How to design effective priority rules: Example of
in Joint Space and Task Space for a Seven Degree of Freedom Upper simple assembly line balancing”, ELSEVIER, Computers and Industrial
Limb Exoskeleton”, Proceedings of the 3rd IEEE RAS and EMBS Engineering, 2014, vol. 69, pp. 4352.
International Conference on Biomedical Robotics and Biomechatronics, [53] J. Z. Li, H. Gao, “Survey on sensor network research”, Journal of
2010, pp 70-75. Computer Research and Development, 2008, vol. 45, pp. 1-15.
[26] V. Okunev, T. Nierhoff, S. Hirche, “Human-preference-based Control [54] J. L. Elman, “Finding structure in time”, Cognitive Science, 1990, vol.
Design: Adaptive Robot Admittance Control for Physical Human-Robot 14, pp. 179-211.
Interaction”, The 21st IEEE International Symposium on Robot and [55] K. Hornik, M. Stinchcombe, H. White, “Multilayer feedforward net-
Human Interactive Communication, 2012, pp 443-448. works are universal approximators”, Neural Networks, 1989, vol. 2, no.
[27] M. G. Carmichael, D. Liu, “Admittance Control Scheme for Imple- 5, pp. 359366.
menting Model-based Assistance-As-Needed on a Robot”, 35th Annual [56] T. P. Chen, H. Chen, “Universal approximation to nonlinear operators
International Conference of the IEEE EMBS, 2013, pp 870-873. by neural networks with arbitrary activation functions and its application
[28] B. K. Lee, H. D. Lee, J. y. Lee, K. Shin, J. S. Han, C. S. Han, to dynamical systems”, IEEE Transactions on Neural Networks, 1995,
“Development of Dynamic Model-based Controller for Upper Limb vol. 6, no. 4, pp. 911917.
Exoskeleton Robot”, 2012 IEEE International Conference on Robotics [57] J. B. Yang, K. Q. Chen, X. P. Li, “Feature selection for MLP neural
and Automation, 2012, pp 3173-3178. network: the use of random permutation of probabilistic outputs”, IEEE
[29] W. Yu, J. Rosen, X. Li, “PID Admittance Control for an Upper Limb Transactions on Neural Networks, 2009, vol. 20, no. 12, pp. 19111922.
Exoskeleton”, American Control Conference , 2011, pp 1124-1129. [58] A. G. Barto, R. S. Sutton, C. J. C. H. Watkins, “ Learning and
[30] F. Augugliaro, R. DAndrea, “Admittance Control for Physical Human- sequential decision making,” Learning and Computational Neuroscience:
Quadrocopter Interaction”, European Control Conference , 2013, pp Foundations of Adaptive Networks, (1990).
1805-1810. [59] R. Bellman, “Applied Dynamic Programming,” Princeton University
[31] M. Oda, C. Zhu, M. Suzuki, X. Luo, H. Watanabe, Y. Yan, “Admittance Press, Princeton, New Jersey, (1957).
Based Control of Wheelchair Typed Omnidirectional Robot for Walking
Support and Power Assistance”, 19th IEEE International Symposium on
Robot and Human Interactive Communication, 2010, pp 159 - 164.
[32] G. Deodhare, M. Vidyasagar, “ Design of Non-overshooting Feedback
Control Systems”, Proceedings of the 29th Conference on Decision and Abusabah I. A. Ahmed Abusabah I. A. Ahmed is
Control, 1990, pp. 1828-1834. an Associate Professor in College of Engineering,
[33] R. D. Hill, A. C. Eberhard, M. E. Halpern, P. J. Brockwell, “Minimiza- Karary University. He received the M.Sc. degree
tion of peak values of signals in feedback systems”, Proceedings of the and Ph. D. degree in Electronic Engineering from
35th Conference on Decision and Control, 1996, pp. 3212-3217. University of Electronic Science and Technology of
[34] M. Krstic, M. Bement, “Non-overshooting Control of Strict-Feedback China in 2013 and 2017. His research area includes
Nonlinear Systems”, American Control Conference, 2007, pp. 4494-4499. Robotics Control, Predictive and Adaptive Control
[35] A. I. A. Ahmed, H. Cheng, X. Lin, R. Huang, “Motion Planing and Neural Networks. Dr. Abusabah I. A. Ahmed
and Control of Maneuverable Human-Powered Exoskeleton Systems”, has 18 academic publications and he is a reviewer
IEEE/RSJ International Conference on Intelligent Robots and Systems for two journals and many conferences. Now he is
(IROS): Workshop, 2016. teaching Neural Networks and Embedded Systems
[36] K. J. Astrom, R. M. Murray, “Feedback Systems, An Introduction for design for graduate students, and Control Theory, Automatic Control, Modern
Scientists and Engineers”, Princeton University Press, 2008 Control, Optimal Control and Electronic Measurements for junior students at
[37] R. D. Hill, M. E. Halpern, “Minimization of Signed Peak and Absolute Karary University.
Peak Values of Signals in Feedback Systems”, Singapore International
Conference on Intelligent Control and Instrumentation, 1992, pp. 48-53.
12
Samah H. H. Mohammed Khir Samah H. H. from Karary University in 2020. Her master research
Mohammed Khir is a dean of college of Engineering includes power system stability managing and con-
and Architecture, Shendi University since 2020. She trol approach of doubly Fed Induction Machine for
got her BSc. Degree from Nile Valley University stable operation. Now she is a PhD student at Karary
in Electrical Power Engineering in 2017and worked University. Her current research includes Hybrid Renewable Energy System
as teacher assistant at Shendi University. She got Stability Investigation. Also, the application of Back? to Back Converters in
her master degree in Electrical Engineering, Power both rotary side and Gride Side of Grid Tied Hybrid Renewable power system.