Robot Intent Recognition

My MS thesis enables a new kind of interaction between humans and computational agents, such as robots or computers, by allowing the agent to anticipate and adapt to human intent. In the future, more robots may be deployed in situations that require collaboration with humans, such as scientific exploration, search and rescue, hospital assistance, and even domestic care. These situations require robots to work together with humans, as part of a team, rather than as a stand-alone tool. The intent recognition capability is necessary for computational agents to play a more collaborative role in human-robot interactions, moving beyond the standard master-slave relationship of humans and computers today.

I develop an innovative capability for recognizing human intent, through statistical plan learning and online recognition. I approach the plan learning problem by employing unsupervised learning to automatically determine the activities in a plan based on training data. The plan activities are described by a mixture of multivariate probability densities. The number of distributions in the mixture used to describe the data is assumed to be given. The training data trajectories are fed again through the activities' density distributions to determine each possible sequence of activities that make up a plan. These activity sequences are then summarized with temporal information in a temporal plan network, which consists of a network of all possible plans. The approach to plan recognition begins with formulating the temporal plan network as a hidden Markov model. Next, the most likely path can be determined by using the Viterbi algorithm. Finally, I refer back to the temporal plan network to obtain predicted future activities.

This research presents several innovations: First, I introduce a modified representation of temporal plan networks that incorporates probabilistic information into the state space and temporal representations. Second, I learn plans from actual data, such that the notion of an activity is not arbitrarily or manually defined, but is determined by the characteristics of the data. Third, I develop a recognition algorithm that can perform recognition continuously by making probabilistic updates. Finally, the recognizer not only identifies previously executed activities, but also predicts future activities based on the plan network.

I demonstrate the capabilities of the algorithms on motion capture data. The results show that the plan learning algorithm is able to generate reasonable temporal plan networks, depending on the dimensions of the training data and the recognition resolution used. The plan recognition algorithm is also successful in recognizing the correct activity sequences in the temporal plan network corresponding to the observed test data.

For more details, please see my MS thesis (pdf).

Research Group

I work in the Computer Science and Artificial Intelligence Lab (CSAIL), with the Model-based Embedded and Robotic Systems (MERS) group. We play with rovers and robotic arms, and when we're not playing, we can often be found coding, reading, group meeting-ing, or passed out on our group's couch. Our fearless leader Professor Brian Williams keeps us in line, but when he is not here, which is most of the time, our Research Scientist Paul Robertson and our Post Doctoral Researcher Andreas Hofmann pushes us foward.





Last Update: 06-12-2007 - 17:12