Gerald Tesauro

TD-Gammon

  • Developed TD-Gammon, a backgammon program that achieved world-championship level play through self-play and temporal difference learning.
  • Contributed to the field of reinforcement learning, a key area in AI.
  • Works at IBM, continuing research in artificial intelligence and machine learning.

Gerald J. Tesauro is an American computer scientist and researcher at IBM. He is renowned for creating TD-Gammon, a groundbreaking backgammon program that learned to play at a world-championship level through self-play, a significant milestone in the field of artificial intelligence.

Milestones

  • 1992
    Expert Systems and the Knowledge Boom Research
    The Neural Network That Mastered Backgammon

    Gerald Tesauro's TD-Gammon taught itself to play backgammon at world-champion level, proving neural networks could discover strategies humans never imagined.

    1992