Operations Research Center
Seminars & Events
 
Skip to content

Fall 2012 Seminar Series

MASSACHUSETTS INSTITUTE OF TECHNOLOGY
OPERATIONS RESEARCH CENTER
FALL 2012 SEMINAR SERIES

DATE: October 4th
LOCATION: E51-376
TIME: 4:15pm
Reception immediately following

SPEAKER:
Jim Smith

TITLE
Optimal Sequential Exploration: Bandits, Clairvoyants, and Wildcats

ABSTRACT
This paper was motivated by the problem of developing an optimal policy for exploring an oil and gas field in the North Sea. Where should we drill first? Where do we drill next? This sequential exploration problem resembles a multiarmed bandit problem, but probabilistic dependence plays a key role: outcomes at drilled sites reveal information about neighboring targets. Good exploration policies will take advantage of this information as it is revealed. We develop heuristic policies for sequential exploration problems and complement these heuristics with upper bounds on the performance of an optimal policy. We begin by grouping the targets into clusters of manageable size. The heuristics are derived from a model that treats these clusters as independent. The upper bounds are given by assuming each cluster has perfect information about the results from all other clusters. The analysis relies heavily on results for bandit superprocesses, a generalization of the multiarmed bandit problem. We evaluate the heuristics and bounds using Monte Carlo simulation and, in the North Sea example, we find that the heuristic policies are nearly optimal.

 

Joint work with David Brown, Fuqua School of Business, Duke University.