- use alpaca prompt tuning
- modify huggingface dataset in finetune.py
python finetune.pygit clone https://github.com/openai/human-eval
cd human-eval/
pip install -e human-evalGenerate samples and save them in the following JSON Lines (jsonl) format, where each sample is formatted into a single line like so:
{"task_id": "Corresponding HumanEval task ID", "completion": "Completion only without the prompt"}
Run this code to generate the JSON files:
python eval_humaneval.pyevaluate_functional_correctness samples.jsonl