Reasoning-PoT

Program-of-Thought (PoT) fine-tuning and evaluation for GSM8K-style perturbed math data.

Quickstart

Install dependencies:

pip install -r requirements.txt

If needed:

pip install tf-keras

Fine-tune

python finetune_pot.py \
  --model_name meta-llama/Llama-3.1-8B-Instruct \
  --data_path training_data/gsm8k_concrete_training_data.json \
  --output_dir pot-finetuned

Evaluate

python evaluate_prompts.py \
  --model_path pot-finetuned \
  --dataset_path gsm_perturbed_with_new_questions.json \
  --prompt pot

Organization

finetune_pot.py: training
- parser commands:
  - --model_name
  - --data_path
  - --prompt_key
  - --completion_key
  - --output_dir
  - --num_epochs
  - --batch_size
  - --learning_rate
  - --lora_r
  - --lora_alpha
  - --lora_dropout
  - --use_4bit
  - --gradient_accumulation_steps
  - --max_length
  - --train_size
  - --eval_size
  - --warmup_steps
  - --logging_steps
  - --save_steps
  - --wandb_project
  - --wandb_run_name
  - --no_wandb
evaluate_prompts.py: evaluation
- parser commands:
  - --model_path
  - --dataset_path
  - --prompt
  - --prompt_templates
  - --output_file
  - --device
  - --tolerance
  - --use_base_model
  - --num_attempts
  - --temperature
  - --top_p
  - --max_new_tokens
prompt_templates.json: prompt templates
training_data/: example datasets

Data note

This repo does not include the ReasonAgain dataset. You can find it in the CogComp reasoning-eval repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reasoning-PoT

Quickstart

Fine-tune

Evaluate

Organization

Data note

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
training_data		training_data
README.md		README.md
evaluate_prompts.py		evaluate_prompts.py
finetune_pot.py		finetune_pot.py
prompt_templates.json		prompt_templates.json
requirements.txt		requirements.txt

CogComp/Reasoning-PoT

Folders and files

Latest commit

History

Repository files navigation

Reasoning-PoT

Quickstart

Fine-tune

Evaluate

Organization

Data note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages