-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
Hi, I wanted to check if running launchw.sh is the command which recreates the number for table 1?
Cause I'm trying to rerun REMEMBERER for gpt-3.5-instruct-0913 due davinci-003 was no longer accessible from openai platform.
But the results I got is quite low with only 0.07 success rate
[2024-05-02 12:45:31,856 INFO webshop/186-MainProcess] END! TaskIdx: 99, TaskId: 99, #Steps: 4(0), Reward: 0.50, Succeds: False
[2024-05-02 12:45:31,856 INFO webshop/189-MainProcess] ──────────8.44──────────0.254──────────0.070──────────
[2024-05-02 12:45:31,857 INFO webshop/497-MainProcess] ━━━━━━━━━━━━━━━━━━━Epoch 0━━━━━━━━━━━━━━━━━━━━
[2024-05-02 12:45:31,857 INFO webshop/498-MainProcess] Size: 4, Avg AD Size: 1
I was wonder if there's any params I didn't get right for the launchw.sh?
This was the command found in launchw.sh:
python webshop.py --log-dir logs\
--observation-mode text_rich\
--load-replay history-pools/init_pool.wq.yaml\
--load-replay history-pools/init_pool.wq.yaml\
--save-replay history-pools/init_pool.wqu."$date_str".%d.a.yaml\
--save-replay history-pools/init_pool.wqu."$date_str".%d.b.yaml\
--item-capacity 500\
--action-capacity 20\
--matcher pgpat+insrel\
--prompt-template prompts/\
--max-tokens 200 \
--stop "Discouraged" \
--request-timeout 10.\
--starts-from 0\
--epochs 3\
--trainseta 0\
--trainsetb 10\
--testseta 0\
--testsetb 100
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels