Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning

  • 2024-04-09 15:08:47
  • Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jingyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng
  • 0

Abstract

The mathematical formula is the human language to describe nature and is theessence of scientific research. Finding mathematical formulas fromobservational data is a major demand of scientific research and a majorchallenge of artificial intelligence. This area is called symbolic regression.Originally symbolic regression was often formulated as a combinatorialoptimization problem and solved using GP or reinforcement learning algorithms.These two kinds of algorithms have strong noise robustness ability and goodVersatility. However, inference time usually takes a long time, so the searchefficiency is relatively low. Later, based on large-scale pre-training dataproposed, such methods use a large number of synthetic data points andexpression pairs to train a Generative Pre-Trained Transformer(GPT). Then thisGPT can only need to perform one forward propagation to obtain the results, theadvantage is that the inference speed is very fast. However, its performance isvery dependent on the training data and performs poorly on data outside thetraining set, which leads to poor noise robustness and Versatility of suchmethods. So, can we combine the advantages of the above two categories of SRalgorithms? In this paper, we propose \textbf{FormulaGPT}, which trains a GPTusing massive sparse reward learning histories of reinforcement learning-basedSR algorithms as training data. After training, the SR algorithm based onreinforcement learning is distilled into a Transformer. When new test datacomes, FormulaGPT can directly generate a "reinforcement learning process" andautomatically update the learning policy in context. Tested on more than tendatasets including SRBench, formulaGPT achieves the state-of-the-artperformance in fitting ability compared with four baselines. In addition, itachieves satisfactory results in noise robustness, versatility, and inferenceefficiency.

 

Quick Read (beta)

loading the full paper ...