Large-Language-Model For Math

让LLM大模型解决数学问题! — #TODO

由于最近在做相关方向的科研,将阅读的论文整理在这里。

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
💡 阅读记录格式:

### Pretraining

1. which foundation models are based on?
2. what tokenizers are adopted?
3. which datasets are collected specific for "math"?
4. what types of pre-processing methods are introduced?
5. other information that you think is important

### Fine-tuning

1. which datasets are used?
2. what types of pre-processing methods are used?

### Evaluation

1. which datasets are used?
2. what type of pre-processing methods are used?
3. what evaluation metrics are used?

阅读的论文列表:

Training Verifiers to Solve Math Word Problems

Solving Quantitative Reasoning Problems with Language Models

MathPrompter: Mathematical Reasoning using Large Language Models

PAL: Program-aided Language Models

Specializing Smaller Language Models towards Multi-Step Reasoning