Tom Silver | About Me | Favorite Papers | Blog

Favorite Papers

Please send any reference errors to tomssilver at gmail dot com.

Ganin, Yaroslav et al. "Synthesizing programs for images using reinforced adversarial learning." 2018. [pdf]

Konidaris, George, Leslie Pack Kaelbling, and Tomas Lozano-Perez. "From skills to symbols: Learning symbolic representations for abstract high-level planning." JAIR. 2018. [pdf]

Riochet, Ronan, et al. "IntPhys: A framework and benchmark for visual intuitive physics reasoning." arXiv. [pdf]

Srinivas, Aravind, et al. "Universal planning networks." arXiv. [pdf]


Duan, Yan, et al. "One-shot imitation learning." NIPS. 2017. [pdf]

Ellis, Kevin, et al. "Learning to infer graphics programs from hand-drawn images." arXiv. 2017. [pdf]

Fraccaro, Marco, et al. "A disentangled recognition and nonlinear dynamics model for unsupervised learning." NIPS. 2017. [pdf]

Kawaguchi, Kenji, Leslie Pack Kaelbling, and Yoshua Bengio. "Generalization in deep learning." arXiv. 2017. [pdf]

Lake, Brenden M., et al. "Building machines that learn and think like people." BBS. 2017. [pdf]

Mishra, Nikhil, Pieter Abbeel, and Igor Mordatch. "Prediction and control with temporal segment models." ICML. 2017. [pdf]

Pathak, Deepak, et al. "Curiosity-driven exploration by self-supervised prediction." ICML. 2017. [pdf]

Stewart, Russell, and Stefano Ermon. "Label-free supervision of neural networks with physics and domain knowledge." AAAI. 2017. [pdf]

Zhu, Shaojun, Andrew Kimmel, and Abdeslam Boularias. "Information-theoretic model identification and policy search using physics engines with application to robotic manipulation." arXiv. 2017. [pdf]


Agrawal, Pulkit, et al. "Learning to poke by poking: Experiential learning of intuitive physics." NIPS. 2016. [pdf]

Battaglia, Peter, et al. "Interaction networks for learning about objects, relations and physics." NIPS. 2016. [pdf]

Chang, Michael B., et al. "A compositional object-based approach to learning physical dynamics." NIPS. 2016. [pdf]

Garnelo, Marta, Kai Arulkumaran, and Murray Shanahan. "Towards deep symbolic reinforcement learning." arXiv. 2016. [pdf]

Levine, Sergey, et al. "End-to-end training of deep visuomotor policies." JMLR. 2016. [pdf]

Tran, Dustin, et al. "Edward: A library for probabilistic modeling, inference, and criticism." arXiv. 2016. [pdf]

Zhou, Yilun, and George Konidaris. "Representing and learning complex object interactions." RSS. 2016. [pdf]


Kitaev, Nikita, et al. "Physics-based trajectory optimization for grasping in cluttered environments." Robotics and Automation ICRA. 2015. [pdf]

Lake, Brenden M., Ruslan Salakhutdinov, and Joshua B. Tenenbaum. "Human-level concept learning through probabilistic program induction." Science. 2015. [pdf]

Schaul, Tom, et al. "Universal value function approximators." ICML. 2015. [pdf]

Watter, Manuel, et al. "Embed to control: A locally linear latent dynamics model for control from raw images." NIPS. 2015. [pdf]

Wu, Jiajun, et al. "Galileo: Perceiving physical object properties by integrating a physics engine with deep learning." NIPS. 2015. [pdf]


Battaglia, Peter W., Jessica B. Hamrick, and Joshua B. Tenenbaum. "Simulation as an engine of physical scene understanding." PNAS. 2013. [html]

Deisenroth, Marc, and Carl E. Rasmussen. "PILCO: A model-based and data-efficient approach to policy search." ICML. 2011. [pdf.]

Kaelbling, Leslie Pack, and Tomás Lozano-Pérez. "Hierarchical task and motion planning in the now." ICRA. 2011. [pdf.]

Levine, Sergey, and Vladlen Koltun. "Guided policy search." ICML. 2013. [pdf]

Moldovan, Bogdan, et al. "Learning relational affordance models for robots in multi-object manipulation tasks." ICRA. 2012. [pdf]

Mordatch, Igor, Emanuel Todorov, and Zoran Popović. "Discovery of complex behaviors through contact-invariant optimization." SIGGRAPH. 2012. [pdf.]

Mugan, Jonathan, and Benjamin Kuipers. "Autonomous learning of high-level states and actions in continuous environments." IEEE-TAMD. 2012. [pdf]

Srivastava, Siddharth, et al. "Combined task and motion planning through an extensible planner-independent interface layer." ICRA. 2014. [pdf.]

Todorov, Emanuel, Tom Erez, and Yuval Tassa. "Mujoco: A physics engine for model-based control." Intelligent Robots and Systems IROS. 2012. [pdf]

Wingate, David, et al. "Bayesian policy search with policy priors." IJCAI. 2011. [pdf]


Attias, Hagai. "Planning by probabilistic inference." AISTATS. 2003. [pdf]

Boutilier, Craig, Richard Dearden, and Moisés Goldszmidt. "Stochastic dynamic programming with factored representations." Artificial intelligence. 2000. [pdf]

Dietterich, Thomas G. "Hierarchical reinforcement learning with the MAXQ value function decomposition." JAIR. 2000. [pdf]

Diuk, Carlos, Andre Cohen, and Michael L. Littman. "An object-oriented representation for efficient reinforcement learning." ICML. 2008. [pdf]

Lang, Tobias, and Marc Toussaint. "Approximate inference for planning in stochastic relational worlds." ICML. 2009. [pdf]

Sanghai, Sumit, Pedro Domingos, and Daniel Weld. "Relational dynamic Bayesian networks." JAIR. 2005. [pdf]

Toussaint, Marc. "Robot trajectory optimization using approximate inference." ICML. 2009. [pdf]


Blum, Avrim L., and Merrick L. Furst. "Fast planning through planning graph analysis." Artificial intelligence. 1997. [pdf]

Boutilier, Craig, Thomas Dean, and Steve Hanks. "Decision-theoretic planning: Structural assumptions and computational leverage." JAIR. 1999. [pdf]

Brooks, Rodney A. "Intelligence without representation." Artificial intelligence. 1991. [pdf]

Jacobs, Robert A., et al. "Adaptive mixtures of local experts." Neural computation. 1991. [pdf]

Murphy, Kevin. "Switching Kalman Filters." 1998. [pdf]

Sutton, Richard S., Doina Precup, and Satinder Singh. "Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning." Artificial intelligence. 1999. [pdf]

Before 1990

Agre, Philip E., and David Chapman. "Pengi: An Implementation of a Theory of Activity." AAAI. 1987. [pdf]

Forbus, Kenneth D. "Qualitative process theory." Qualitative reasoning about physical systems. 1984. [pdf]

Newell, Allen, and Herbert A. Simon. Computer science as empirical inquiry: Symbols and search. Turing Award Acceptance Speech. 1975. [pdf]

Nilsson, Nils J. "Shakey the robot." SRI International. 1984. [video]

Pearl, Judea. "Fusion, propagation, and structuring in belief networks." Artificial intelligence. 1986. [pdf]

Turing, Alan M. "Computing machinery and intelligence." 1950. [pdf]