The UMass Mobile Manipulator UMan:
An Experimental Platform for Autonomous Mobile Manipulation
Dov Katz, Emily Horrell, Yuandong Yang, Brendan Bums, Thomas Buckley
Anna Grishkan, Volodymyr Zhylkovskyy, Oliver Brock, Erik Learned-Miller
Department of Computer Science
University of Massachusetts Amherst
Abstract — Research in Autonomous Mobile Manipulation crit-
ically depends on the availability of adequate experimental
platforms. In this paper, we describe an ongoing effort at the
University of Massachusetts Amherst to construct a hardware
platform with redundant kinematic degrees of freedom, a com-
prehensive sensor suite, and significant end-effector capabilities
for manipulation. In our research, we pursue an end-effector
centric view of autonomous mobile manipulation. In support
of this view, we are developing a comprehensive software suite
to provide a high level of competency in robot control and
perception. This software suite is based on a multi-objective, task-
level motion control framework. We use this control framework
to integrate a variety of motion capabilities, including task-
based force or position control of the end-effector, collision-free
global motion for the entire mobile manipulator, and mapping
and navigation for the mobile base. We also discuss our efforts
in developing perception capabilities targeted to problems in
autonomous mobile manipulation. Preliminary experiments on
our UMass Mobile Manipulator (UMan) are presented.
I. Introduction
Autonomous robots are beginning to address real-world
tasks in unstructured and dynamic environments. Today these
robots predominantly perform tasks based on mobility How-
ever, the potential of augmenting autonomous mobility with
dexterous manipulation skills is significant and has numerous
important applications, ranging from in-orbit satellite servicing
to elderly care and flexible manufacturing [3]. The successful
deployment of autonomous robots that combine manipula-
tion skills with mobility, however, still requires substantial
scientific advances in a variety of research areas, including
autonomous manipulation in unstructured environments, per-
ception, and system integration. In support of research in
these areas, we are developing and constructing the mobile
manipulation platform UMan (UMass Mobile Manipulator).
In this paper, we discuss the objectives for building such a
platform, describe our approach of combining software and
hardware to provide an adequate level of competency for
research, and report on progress with preliminary research
initiatives.
A platform adequate for research in autonomous mobile
manipulation has to combine mobility with dexterous manip-
ulation. A manipulator with a dexterous end-effector allows
complex interactions with objects in the environment. And
mobility extends the workspace of the manipulator, posing
new challenges by permitting the robot to operate in unstruc-
tured environments. In such environments it is impossible to
anticipate all scenarios and to pre-program the behavior of
the robot. Manipulation in combination with mobility thus
requires algorithmic approaches that are versatile and general.
The variability and complexity of unstructured environments
also require algorithmic approaches that permit the robot to
continuously improve its skills from its interactions with the
environment. Furthermore, the combination of mobility and
manipulation poses new challenges for perception as well as
the integration of skills and behaviors over a wide range of
spatial and temporal scales.
Fig. 1. UMan, the UMass Mobile Manipulator
The mobile manipulation platform UMan, shown in Fig-
ure 1, has been designed to support our research in algo-
rithms and control for autonomous mobile manipulation. Due
to the focus on dexterous manipulation, UMan's ability to
perform physical work in its environment was of particular
importance during the design process. Our objective was to
build a hardware platform with redundant kinematic degrees
of freedom, a comprehensive sensor suite, and significant
end-effector capabilities for manipulation. Since the focus of
our research is on algorithms and not on hardware design,
we chose to use mostly off-the-shelf components in UMan's
construction.
UMan differs from related robotic platforms (see Sec-
tion II) because of our end-effector centric, integrated view
of hardware and software infrastructure. This combination of
hardware and software results in an experimental platform
that permits researchers to focus solely on the specification
of end-effector behavior without having to worry about the
motion of the remaining degrees of freedom. Furthermore,
UMan is one of the few platforms currently available that
combines mobility with dexterous manipulation capabilities.
Most comparable platforms are limited in either dexterity or
mobility. A more detailed review of related hardware platforms
will be presented in the next section.
II. Related Platforms
Most current robotic platforms focus on one of two com-
plementary aspects of autonomous mobile manipulation: bi-
manual dexterous manipulation or bipedal mobility.
Bi-manual robots consist of a torso, two arms and dex-
terous hands. Examples from this category include UMass's
Dexter [8], a bi-manual robot consisting of two commercial
seven degree-of- freedom arms (Barrett's WAM [24]) and
three-fingered hands. Dexter' s arms are stiff with negligible
inherent compliance, requiring active control of compliance
using accurate force sensing. Its hands have four degrees
of freedom each, and can afford various dexterous tasks. In
contrast to Dexter, MIT's bi-manual robot Domo [6] employs
series elastic actuators in its two arms, providing inherent com-
pliance for safe interactions with objects in the environment.
NASA's Robonaut [2], a humanoid torso designed to assist or
replace astronauts, closely imitates the kinematics of human
arms and hands. Robonaut has performed complex dextrous
manipulation tasks, resembling tasks performed by astronauts
in space. Other examples of a bi-manual manipulation platform
include Clara at the Technical University Munich and Justin
at DLR in Germany. These stationary, bi-manual platforms
are designed for dexterous manipulation and benefit from
having two arms that can cooperate in manipulation tasks.
However, these platforms have limited workspace and cannot
be deployed in large unstructured environments.
WABIAN RIII [26], and the Toyota Partner Robot [25]. All
of these platforms have brought about significant advances
in bipedal locomotion and mechanical design, but have had
limited impact in the area of autonomous mobile manipulation.
Fig. 2. Bi-manual robots: Dexter (UMass Amherst), Clara (Technical
University Munich), Domo (MIT), Robonaut (JSC, NASA), Justin (DLR,
Germany)
The development of legged robots, such as Honda's
ASIMO [17], has focused on issues of bipedal locomotion.
Legs seem to be better suited than wheels for locomotion
in human environments. Due to the focus on locomotion,
the manipulation capabilities of these platforms are often
limited. For example, ASIMO 's hands only possess a single
degree of freedom and are not able to perform complex
manipulation tasks. Other examples are Sony's QRIO [19],
Kwada Industries' HRP2 [12], Waseda University Tokyo's
Fig. 3. Examples of legged humanoid robots: Asimo (Honda), QRIO (Sony),
HRP-3 (Kawada Industries), WABIAN RIII (Waseda University Tokyo),
Toyota Partner Robot
Ideally, the advances in bi-manual manipulation and lo-
comotion could be combined to provide competent experi-
mental platforms for autonomous mobile manipulation. Such
platforms could navigate in complex, human indoor and out-
door environments to perform dexterous manipulation tasks.
Unfortunately, such integration is not straight-forward. Most
commercial manipulators are heavy, large, and power hungry.
Consequently, highly mobile robots, such as legged mobility
platforms, cannot easily accommodate dexterous manipulators.
To overcome this challenge, most existing mobile manipula-
tion platforms rely on less flexible mobility solutions. One
of the earliest mobile manipulators, the Stanford Assistant
Mobile Manipulation (SAMM) [10] consists of a holonomic
Nomadic XR4000 base and a PUMA 560 manipulator arm,
equipped with a parallel-jaw gripper. The mobile manipulation
platform ARMAR [21] at the University of Karlsruhe consists
of two arms mounted on a wheeled mobile base, two parallel-
jaw grippers, and a stereo vision head. The Stanford AI
Robot (STAIR) uses a modified Segway RMP in tractor mode
for mobility; a five degree-of-freedom manipulator with a
parallel-jaw gripper provides manipulation capabilities. Other
examples of mobile manipulators can be found at Orebro
University (PANDIl), at the Centre for Autonomous Systems
at the Royal Institute of Technology, and at the Robotics and
Automation Laboratory at Michigan State University.
III. Hardware Platform
UMan consists of a holonomic mobile base with three
degrees of freedom, a seven degree-of-freedom manipulator
arm, and a four degree-of-freedom hand.
Our main objective was to support dexterous manipulation
on a mobile platform. To achieve this objective for a single,
integrated system, it was not possible to optimize every aspect
of the platform; instead, we had to carefully balance different
constraints. We strove to provide adequate end-effector ca-
pabilities for a wide range of dexterous manipulation tasks.
We considered mobility as additional degrees of freedom in
service to manipulation, rather than as an objective itself. We
therefore choose a mode of mobility that maximally supports
manipulation without imposing additional constraints.
Fig. 4. Examples of mobile manipulators: ARMAR (University of Karlsruhe),
R\NDI1 (Orebro University), SAMM (Stanford University)
perform our research. Finally, the base is sized to be able to
contain adequate computational resources (see Section III-B)
and sensors (see Section III-C).
A Barrett Technologies Whole Arm Manipulator (WAM)
with seven anthropomorphic degrees of freedom (three in the
shoulder, one in the elbow, three in the wrist, see figure 5)
together with the three-fingered Barrett hand provide UMan's
dexterous manipulation capabilities. The new lightweight de-
sign of the WAM has very low power consumption and is
thus well-suited for mobile applications. All electronics for
the control of the arm are built into the arm itself, facilitating
its integration with a mobile platform. The WAM provides
good dynamic performance and torque sensing in its joints.
It is thus capable of using all of its links to perform force-
controlled manipulation tasks.
At the same time, we focused on the use of commercially
available components. This facilitates recreating our results,
as well as encourages standardization in robotics hardware.
We justify this choice with the following observation: NASA
researchers at JSC were able to perform dexterous manip-
ulation tasks by teleoperating Robonaut. In particular, they
teleoperated Robonaut to perform complex maintenance tasks
on a model of the Hubble space telescope [1]. These ex-
periments demonstrate that existing hardware is in principle
capable of performing complex dexterous manipulation tasks.
Consequently, we believe that algorithmic aspects currently
represent the most important challenge in autonomous mobile
manipulation.
A. Actuation
UMan's mobility is provided by a modified Nomadic
XR4000 mobile base. Its four casters are dynamically de-
coupled [5] to provide holonomic motion, which facilitates
a unified control scheme for degrees of freedom associated
with mobility and manipulation. This unified control scheme
leverages all degrees of freedom to be able to exploit kinematic
redundancies when performing multi-objective behavior (see
Section IV). Other modes of mobility, such as legged locomo-
tion or dynamically stable mobility platforms (Segway RMP),
result in additional challenges pertaining to the coordination of
mobility and manipulation. While these challenges represent
interesting research problems by themselves, our objective
was to build a hardware platform that maximally facilitates
research in autonomous manipulation. Therefore, we chose a
holonomic mobility platform that supports the coordination of
mobility and manipulation.
The XR4000 mobile platform was specifically designed for
mobile manipulation. Its power system allows untethered op-
eration of UMan for several hours. The wheels and the frame
of the mobile base are designed to be very stiff and damped
so that accurate position control of the mobile manipulator
is feasible. This advantageous property reduces the ability to
navigate on uneven terrain, a cost we were willing to pay
since we do not think it fundamentally limits our ability to
Fig. 5. Barrett Technologies Whole Arm Manipulator (WAM)
The three-fingered Barrett hand (see figure 6) can flex any of
its three-link fingers individually. A fourth degree of freedom
in the hand permits switching between an enveloping grasp
to grasp with an opposing thumb. While this hand is clearly
less dexterous than the human hand, it provides significantly
more dexterity than the parallel-jaw grippers that can be found
on most mobile manipulators today. Robonaut mounted on a
Segway RMP represents a notable exception.
Fig. 6. Barrett Technologies three-fingered hand
B. Computation
UMan's mobile base houses two single-board PCs with
Pentium 4 2.4GHz CPUs (see figure 7). One of these PCs is
dedicated to real-time control of the base and the manipulator
arm. It is running the real-time operating system QNX [16].
The other PC is running Linux and is dedicated to higher-
level tasks, such as vision processing, navigation, motion
planning, and task planning. Both computers are connected
via a dedicated wired Ethernet link. Wireless Ethernet connects
both computers to the off-board computing resources. Should
additional on-board resources become necessary, we could
place laptops inside the cargo bays of the XR4000 mobile
base.
Fig. 7. UMan's interior
C. Sensing
We view sensing as one of the most critical and under-
explored aspects in autonomous mobile manipulation. Conse-
quently, we are equipping UMan with a rich suite of sensors.
UMan can navigate in the environment using an onboard
SICK LMS200 laser range finder. The SICK's field of view
is 180 degrees, with an angular resolution of 1 degree and
a distance resolution of 10mm with a range of 80m. A 180
degree scan of the environment can be completed 75 times per
second.
Visual input is received from a Sony XCD710CR color
camera mounted on the wrist. The camera has an IEEE- 1394
(Firewire) interface, and was chosen for its superior image
quality. It can produce 30 frames per second at a resolution
of 1024 by 768 pixels, and has many features (i.e. partial
scans, various resolutions and color encodings). By controlling
the position and orientation of its arm, UMan can collect
visual data and generate 3D images far beyond the capabilities
of a simple stationary pan-tilt vision system. Despite the
complication in linking the vision system and the manipulator,
this combination is able to look behind obstructions in the field
of view and can easily generate multiple views of the same
object. In the future, more cameras will be installed to further
enhance UMan's visual perception.
We plan to mount an additional laser range finder (Hokuyo's
URG-04LX [9]) on UMan's wrist. Hokuyo's sensor is rel-
atively light (160g), compact (L50xW50xH70mm), and has
low power requirements (2.5W). Those characteristics make
it suitable to be carried on a robotic arm. Moreover, the
URG-04LX has a 240 degree field of view, and provides an
accuracy of 10mm with an angular resolution of 0.36 degrees.
Complementing our vision sensors with a movable laser range
finder will allow us to perform sensor fusion of complementary
information about the environment.
Tactile sensing is the heart of dexterous manipulation.
Unfortunately, robotic arms with inherent force compliance
are not commercially available. The lack of force compliance
makes the safe operation of the manipulator arm more difficult
and calls for a more complex control architecture. Manipulator
arms with inherent force compliance exist. However, since our
research focuses on algorithmic aspects, we rely on off-the-
shelf hardware and use two sets of force sensors in conjunction
with precise position feedback from the arm/hand to achieve
compliant control. The first set of sensors, mounted on the
fingertips of the Barrett hand, consists of three ATI Nanol7 6-
axis force/torque sensors. The ATI Nanol7 has a resolution of
1/1 60N, for forces varying between -12N to -f12N. For torques
varying between -120Nmm to -i-120Nmm, it has a resolution
of l/32Nmm. The second sensor, mounted on the wrist, is
an ATI Gamma 6- axis force/torque sensor. This sensor has
a resolution of 1/640N for forces varying between -32N to
-f32N. For torques varying between -2.5Nm to -F2.5Nm, it has
a resolution of 1/lOOONm.
UMan's wireless network adapter (see Section III-B) al-
lows it to benefit from off -board sensing to complement its
onboard sensors. When operating in sensor-enabled environ-
ments, UMan can collect data from various external sources
and use it to increase its knowledge and understanding of the
environment in which it operates.
IV. Multi-objective Task-level Control
Our research in autonomous mobile manipulation is primar-
ily concerned with interactions between UMan's end-effector
and the environment. To facilitate this research, we chose
an end-effector-centric control framework. The operational
space framework for robot control [13] allows us to define
manipulation tasks in terms of end-effector motion and forces,
thus abstracting from the kinematics and dynamics of the
underlying robot mechanism. At the same time, it does not
prevent us from exploiting the mechanism's kinematic and
dynamic characteristics through other methods of control to
achieve desired performance metrics. Moreover, this frame-
work allows treating several different parts of the mechanism
as end-effector, and prioritizing the tasks addressed by each
end-effector. In this section we describe the task-level control
scheme implemented on UMan and our method of combining
several behaviors to exploit UMan's redundancy.
Task-level control [13] is a convenient and powerful method
of generating multi- objective behavior for robotic systems.
Rather than specifying joint trajectories, this framework
permits direct control of the manipulator's end-effectors,
greatly facilitating programming for kinematically redundant
robots, such as UMan. Task-level control also permits the
task-consistent execution of subordinate behaviors, exploiting
nullspace projections. Given an end-effector task, expressed
as a force/torque vector Ftask acting on the end-effector, and
given an arbitrary subordinate behavior, expressed as a vector
of joint torques Fq, we can determine the torque F to achieve
task and subordinate behavior as follows:
r = Jtask(^)FtL + ^tL(^)ro,
(1)
where N^^y^ represents a projection into the nullspace of the
end-effector Jacobian Jtask- This projection ensures that the
subordinate behavior will not alter task behavior, i.e., it will
result in task-consistent motion.
This principle of nullspace projections can be extended to
cascade an arbitrary number of hierarchical behaviors [18]. If
behavior i results in torque F^, the torque
T = ri+Nl{q) (r2 + iVj(g) (Fs+Niiq) (...))) (2)
combines these behaviors in such a way that behavior i
does not affect behavior j if i > j. In equation 2, N^ is
the nullspace projection associated with the task Jacobian of
behavior i. Here, we adopt the more compact notation of the
control basis [11] to describe such cascaded nullspaces. We
associate a control primitive (j)i with each torque F^. If a
control primitive (j)i is executed in the nullspace of the control
primitive (j)j, we say that (j)i is performed subject to (j)j, written
as (j)i < 6.
We can now rewrite equation 2 as
. . . < 03 < 02 < 01 .
(3)
This task-level framework with nullspace projections serves as
the underlying control scheme for UMan. Using this approach,
other research groups have already demonstrated sophisticated
multi-objective behavior on simulated humanoid robots [18].
Our research effort based on this control framework will
focus on the control primitives themselves as well as on their
automatic composition and sequencing to achieve robust, au-
tonomous manipulation behavior in unstructured and changing
environments.
V. Global Planning and Control of Robot Motion
The task-level control framework described in the previous
section is well- suited for the specification of task-based forces
and position constraints at the end-effector. However, in the
context of autonomous mobile manipulation, end-effector tasks
have to be performed in dynamic and unstructured environ-
ments. In such environments, obstacles can interfere with task
execution. The task-level control framework is thus not suf-
ficient to provide a complete abstraction for the end-effector.
In addition to the control-based framework described in the
previous section, our platform also has to include the ability
to perform globally task-consistent motion for manipulation
tasks, i.e., motion that fulfills the position and force constraints
imposed by the task as well as the global motion constraints
imposed by obstacles in the environment.
We have developed the elastic roadmap framework [27] to
augment the task-level control framework from Section IV
with the ability to perform globally task-consistent and
collision-free motion. The elastic roadmap framework trans-
lates global workspace connectivity information into a series
of potential functions. These potential functions collectively
represent an approximated navigation function for the robot's
task. The navigation function in turn represents a global
motion plan that is capable of responding to feedback from the
environment. The motion plan integrates seamlessly with our
task-level control framework. By combining the global plan
with the control primitives that generate task-behavior, task
constraints as well as global obstacle avoidance constraints
can be maintained throughout the execution of a task, even in
unstructured and dynamic environments.
More detail about the elastic roadmap framework is pro-
vided in our RSS 2006 paper [27]. In Section VI-B, we show
the simulation results from that paper for motion generation
based on the elastic roadmap framework. We are currently
implementing the elastic roadmap framework on UMan and
plan to duplicate the simulation results on the real platform.
Moving from simulation to the real world introduces new
difficulties. Physical sensors, for once, are bound to be limited
and inaccurate. Consequently, addressing the integration is
an important aspect of motion generation in this application
domain.
The multi-objective control scheme described in Section IV
also facilitates the inclusion of readily available software for
mobile robot navigation. We choose to integrate the SLAM
package CARMEN [4], which can build a map of the robot's
environment using laser scans, and can navigate the robot to
a desired position while avoiding obstacles [23], [22].
VI. Preliminary Results
A. Hardware Integration
One of the most important challenges in constructing an
experimental platform for autonomous mobile manipulation
is the integration of heterogeneous hardware and software
components [3]. In this section we describe UMan's evolution
during the hardware integration phase.
UMan is constructed from off-the-shelf components. The
first major milestone for platform integration was the simul-
taneous real-time control of the XR4000 mobile base and the
Barrett WAM with the system running on battery power. For
this test, we had UMan follow a circular trajectory on the
floor while moving its arm through its range of motion. The
circular trajectory is performed by the base without changing
orientation, demonstrating its holonomic motion capabilities
(see http://www.cs.umass.edu/~dubik for a video). We have
implemented joint space control and operational space control
at rates up to IKHz. Operational space control will permit the
exploitation of the mechanisms kinematic redundancy using
multi-objective control based on nullspace composition (see
Section IV).
In the next step, we began the integration of the sensor suite
and in particular the linking of the robot arm with our vision
camera to perform visual servoing (see Figure 8). The camera
(mounted on the wrist) provides color images to a simple color
tracking algorithm. The algorithm requested arm movements
(and therefore camera movements) to center a blue ball on
the camera's image plane. The control code was executed
in at high frequency on the real-time PC, while the vision
processing was executed at a much lower frequency on the
Linux-based PC. This simple test will be the basis of future
visual servoing tasks. As discussed in section IV, this task will
be performed in parallel with other tasks, by taking advantage
of UMan's kinematic redundancy.
Fig. 8. Visual servoing: Sony XCD710CR camera mounted on UMan's
wrist, UMan's arm tracking the blue ball in an attempt to center it in the
image plane, the velocity vector commanded by the vision system
B. Elastic Roadmaps
The elastic roadmap framework has been implemented and
tested in simulation [27]. Here, we report only on the high-
level results to illustrate how the elastic roadmap framework
is capable of supporting our end-effector-centric view in the
context of autonomous mobile manipulation.
Most work in global motion planning only considers end-
effector placement tasks. These tasks are specified by an initial
and a final configuration for the entire manipulator. Constraints
that need to be maintained during the motion itself are rarely
considered. In our end-effector-centric view of autonomous
mobile manipulation, it oftentimes is sufficient to consider
only the end-effector placement, irrespective of the manip-
ulator configuration that achieves that placement. In addition,
a large number of tasks in autonomous mobile manipulation
require that end-effector constraints be maintained throughout
the entire motion. All of these requirements are met by the
elastic roadmap framework. In all three experiments shown in
Figure 9, the robot is performing a task that consists of end-
effector placement in conjunction with task-constraints that
need to be maintained throughout the placement motion. The
experiments show that the elastic roadmap framework is able
to achieve global real-time planning and replanning for our
mobile manipulation platform, even in dynamic environments.
VII. Future Research
Autonomous mobile manipulation is a relatively young
discipline within robotics. It combines a wide variety of
research areas, ranging from force control to mechanism
design to computer vision. A workshop on autonomous mobile
manipulation brought together researchers from the field to
identify a roadmap for research in this area [3]. Here, we
briefly describe some of the areas where we will focus our
efforts.
A. Perception
Vision systems have been used to augment the abilities of
both manipulators and mobile platforms since their inception.
However, there are many fundamental capabilities which could
benefit dramatically from a bidirectional flow of information
between vision and manipulation, and which are still relatively
unexplored.
Visually analyzing the environment is a precondition for
successful manipulation. More surprising perhaps is the effect
that manipulation can have on visual perception. For example,
the difficulty of recognizing an object in computer vision often
stems from the pose, or specific orientation, of an object.
Moreover, specularities (illusions caused by highly reflective
surfaces) can fool an observer into misinterpreting the vi-
sual data. Generating more data about the scene by simple
manipulation (e.g., poking an object of interest, see [7]) can
dramatically improve the performance of the visual system.
The correlation between manipulation and visual perception
is striking. However, very little research has taken place on
the interaction between the two, probably because of the
rarity of having manipulation, mobility, and vision on the
same platform. We intend to endow UMan with manipula-
tion capabilities such as turning a light switch to effect the
lighting conditions, and moving objects around to gain new
perspective.
B. Manipulation
We plan to develop elementary manipulation skills to allow
UMan to interact with its environment in interesting ways.
One such skill is grasping. Grasping has traditionally been
performed by a rigorous analysis of grasp geometry [14]
or by haptically probing the grasped object until form/force
closure is reached [15]. However, many interesting objects
have been designed to be grasped by humans. We will develop
so-called "ballistic" or "optimistic" grasp controllers. These
controllers exploit visual clues obtained from object features
to instantiate various parameters of relatively low-feedback
grasping behavior. During the execution of such behavior,
visual and haptic feedback can be used to transition between
reaching and grasping behaviors, for example, or to detect
failure. Our grasp controllers are called optimistic, because
they assume that successful grasping behavior can be initiated
based on perceived object features. This stands in contrast with
approaches that employ elaborate analysis to ensure success.
Instead, these behaviors leverage relevant properties of objects
in the environment.
C. Tool Use
Tool use represents one of the hallmarks of human and
animal intelligence. Creating a robot that can use a known
tool, learn how to use new tools, and choose the best tool
for a given task is an important and challenging problem
in autonomous mobile manipulation [20]. We will develop
methods that enable UMan to extract kinematic models of
objects in the environment. These methods will rely on a multi-
modal stream of sensor information, including exploratory
I • •
la)
Side View of 1 a)
1e)
■##■■"%■■
I
1c)
#
1
Id)
■ '
-Object
m
2a)
2b)
Object
Trajectory
•%
2c)
Fig. 9. Globally task-consistent motion with the elastic roadmap framework. The task is specified solely in terms of end-effector behavior and the elastic
roadmap framework ensures that collisions are avoided and that task behavior is resumed, should it be interrupted by changes in the environment. In all of
the figures, fighter versions of the robot represent milestones of the elastic roadmap that are part of the current solution paths. The connectivity of these
milestones is indicated by a dashed line. The darker robots represent the actual robot in motion. The direction of motion of obstacles is indicated by arrows.
Experiment 1: Images la) and le) show two perspectives of the same scene. The robot follows the line with its end-effector, while moving obstacles invalidate
the solution. The elastic roadmap framework repeatedly generates global, task-consistent motion plans until the robot reaches the goal. Experiment 2: The
task consists of following an object moving on an unknown trajectory. The following task is achieved based on force control. Moving obstacles force the
robot to suspend the force control task, loosing contact with the object. The elastic roadmap framework computes a path to re-attain the force control task,
shown in image 2c). This image also shows the trajectory taken by the object and its projection onto the floor. Experiment 3: A stationary robot, operating
under a moving truss, reaches for a goal location while maintaining a constant orientation with its end-effector. The sequence of images shows how the goal
can be reached. Continued motion by the truss will repeatedly force the robot to move away from the goal location to avoid collision. The elastic roadmap
framework repeatedly generates motions such as those shown in images 3a)-3c) to re-attain the goal location.
behavior based on force-compliant motion primitives. The
ability to obtain a kinematic and dynamic model of objects in
the environment, we believe, represents one of the necessary
preconditions for learning the function of tools and objects,
learning how to use them, and learning how objects and tools
can be employed in new situations.
Initially, we intend to employ the basic gasping skills
described in the previous section, together with exploratory
behavior, to obtain the kinematic models of a variety of door
handles from UMan's multi-modal sensor stream. We regard
door handles as a very simple type of tool, since it only
possesses a single degree of freedom (in most cases) and
is affixed to the environment. We hope to generalize these
methods to other tools and objects, such as pliers, drawers,
doors, latches, lids, and later to more complicated mechanisms,
such as those found in mechanical vises.
VIII. Conclusion
Autonomous mobile manipulation in many ways can be
viewed as a "Grand Challenge" for robotics and related
disciplines. The successful deployment of autonomous robots
to perform complex manipulation tasks in unstructured envi-
ronments will not only require significant advances in a variety
of areas of robotics and computer vision, it will also require
that these new technologies be integrated into a single, robust,
and autonomous robotic platform. In this paper, we described
our efforts towards building such a mobile manipulation
platform. Our mobile manipulator, UMan, was designed to
support research in autonomous mobile manipulation. We have
combined a number of off-the-shelf components to construct
a hardware platform with redundant kinematic degrees of
freedom, a comprehensive sensor suite, and significant end-
effector capabilities for manipulation.
We believe that autonomous mobile manipulation requires
tight integration of a broad range of capabilities and skills.
Thus, our efforts to provide a comprehensive software platform
that enables competent, robust, and autonomous task execution
on our mobile manipulator. This software suite includes the
elastic roadmap approach, an efficient approach to globally
task-consistent motion that combines multi- objective, task-
level control with real-time computation of global motion
plans. We reported on our efforts to integrate one of the freely
available software packages for mobile robot mapping and
navigation with our multi-objective, task-level control scheme.
In addition, we described our initial attempts to develop
perceptual capabilities that are tailored to the requirements
of autonomous mobile manipulation. Finally, we gave an
overview of planned research activities to be supported by
our combined hardware and software platform UMan.
Acknowledgments
This work is supported in part by the National Science
Foundation (NSF) under grants CNS-0454074, IIS-0545934,
IIS-0546666, and CNS-0552319, and by QNX Software Sys-
tems Ltd. in the form of a software grant. We are grateful for
this support. The authors are greatly indebted to Bob Holmberg
for providing software for the control of the XR4000. Also, we
are very grateful for Bob's invaluable advice and help during
many stages of this project.
References
[1] R. Ambrose. Experiments with a teleoperated robonaut on a model of
the Hubble space telescope. Personal communication, 2005.
[2] R. O. Ambrose, S. R. Askew, W. Bluethmann, , and M. A. Diftler.
Robonaut: NASA's space humanoid. IEEE Intelligent Systems and
Applications, Special Issue on Humanoids, 15(4):57-63, 2000.
[3] O. Brock and R. Grupen. Final report for the NSF/NASA Work-
shop on Autonomous Mobile Manipulation (AMM). http://www-
robotics.cs.umass.edu/amm/results.html, November 2005.
[4] CARMEN, http://carmen.sourceforge.net/.
[5] K.-S. Chang, R. Holmberg, and O. Khatib. The augmented object
model: Cooperative manipluation and parallel mechanism dynamics.
In Proceedings of the IEEE International Conference on Robotics and
Automation (ICRA), pages 470-475, San Francisco, USA, 2000.
[6] A. Edsinger-Gonzales. Design of a compliant and force sensing hand
for a humanoid robot. In Proceedings of the International Conference
on Intelligent Manipulation and Grasping, 2004.
[7] R Fitzpatrick and G. Metta. Grounding vision through experimental
manipulation. Philosophical Transactions of the Royal Society: Math-
ematical, Physical, and Engineering Sciences, 361(181 1):2165-2185,
2003.
[8] L. M. G. Goncalves, R. Grupen, A. A. F. Oliveira, D. Wheeler, and
A. Fagg. Attention and categorization: Basis for cognition in a humanoid
robot. IEEE Intelligent Systems and Applications, Special Issue on
Humanoids, 2000.
[9] Hokuyo. http://www.hokuyo-aut.jp/products/urg/urg.htm.
[10] R. Holmberg and O. Khatib. Development and control of a holonomic
mobile robot for mobile manipulation tasks. International Journal of
Robotics Research, 19(11):1066-1074, 2000.
[11] M. Huber and R. A. Grupen. A feedback control structure for on-hne
learning tasks. Robotics and Autonomous Systems, 22(3-4):303-315,
1997.
[12] Kawada Industries, http://www.kawada.co.jp/global/ams/hrp_2.html.
[13] O. Khatib. A unified approach to motion and force control of robot
manipulators: The operational space formulation. International Journal
of Robotics and Automation, 3(l):43-53, 1987.
[14] M. T. Mason. Mechanics of Robotic Manipulation. MIT Press, 2001.
[15] R. Piatt, A. H. Fagg, and R. A. Grupen. NuUspace composition of
control laws for grasping. In Proceedings of the lEEE/RSJ Interna-
tional Conference on Intelligent Robots and Systems (IROS), Lausanne,
Switzerland, 2002.
[16] QNX. http://www.qnx.com.
[17] Y. Sakagami, R. Watanabe, C. Aoyama, S. Matsunaga, N. Higaki, and
K. Fujimura. The intelligent ASIMO: system overview and integration.
In Proceedings of the lEEE/RSJ International Conference on Intelligent
Robots and Systems (IROS), volume 3, pages 2478-2483, 2002.
[18] L. Sentis and O. Khatib. Synthesis of whole-body behaviors through
hierarchical control of behavioral primitives. International Journal of
Humanoid Robots, 2(4):505-518, 2005.
[19] Sony. http://www.sony.net/SonyInfo/QRIO/.
[20] A. Stoytchev. Behavior-grounded representation of tool affordances. In
Video Proceedings of the IEEE International Conference on Robotics
and Automation (ICRA), 2005.
[21] K. B. Tamim Asfour and R. Dillmann. The humanoid robot armar: De-
sign and control. In Proceedings of the IEEE International Conference
on Humanoid Robots, 2000.
[22] S. Thrun. Robotic mapping: A survey. In G. Lakemeyer and B. Nebel,
editors. Exploring Artificial Intelligence in the New Millenium. Morgan
Kaufmann, 2002.
[23] S. Thrun, D. Fox, W. Burgard, and F. Dellaert. Robust monte carlo
localization for mobile robots. Artificial Intelligence, 128(1-2), 2000.
[24] W. T. Townsend and J. K. Salisbury. Mechanical design for whole-arm
manipulation. Robots and biological systems: towards a new bionics?,
pages 153-164, 1993.
[25] Toyota, http://www.toyota.co.jp/en/special/robot/.
[26] Wabian. http://www.uc3m.es/uc3m/dpto/IN/dpin04/Wabian03.html.
[27] Y. Yang and O. Brock. Elastic roadmaps: Globally task-consistent mo-
tion for autonomous mobile manipulation. In Proceedings of Robotics:
Science and Systems (RSS), Philadelphia, USA, August 2006.