🧠 DAVIS: Planning Agent with Knowledge Graph-Powered Inner Monologue

Note

🔥 DAVIS will be presented virtually at EMNLP 2025.
This repository contains the original code from the DAVIS paper. However, due to updates in dependencies since the time of experiments, the code may currently be unstable.
Efforts to clean and update the codebase are ongoing.

Introduction

Designing a generalist scientific agent capable of performing tasks in laboratory settings to assist researchers has become a key goal in recent Artificial Intelligence (AI) research. Unlike everyday tasks, scientific tasks are inherently more delicate and complex, requiring agents to possess a higher level of reasoning ability, structured and temporal understanding of their environment, and a strong emphasis on safety. Existing approaches often fail to address these multifaceted requirements. To tackle these challenges, we present DAVIS . Unlike traditional retrieval-augmented generation (RAG) approaches, DAVIS incorporates structured and temporal memory, which enables model-based planning. Additionally, DAVIS implements an agentic, multi-turn retrieval system, similar to a human's inner monologue, allowing for a greater degree of reasoning over past experiences. DAVIS demonstrates substantially improved performance on the ScienceWorld benchmark comparing to previous approaches on 8 out of 9 elementary science subjects. In addition, DAVIS's World Model demonstrates competitive performance on the famous HotpotQA and MusiqueQA dataset for multi-hop question answering. To the best of our knowledge, DAVIS is the first RAG agent to employ an interactive retrieval method in a RAG pipeline.

How to Reproduce

Steps

Clone the Repository

git clone https:/minhphd/DAVIS
cd DAVIS

Install Dependencies (Python 3.11.0)
```
pip install -r requirements.txt
```
Create PostgreSQL Tables

Run the following query to create the required tables:
```
psql -U your_username -d your_database -f kg_graph/kgraph.psql
```
Configure the Project
- Fill in config/config.ini.example with your API keys, PostgreSQL username, and password.
- Rename config/config.ini.example to config.ini
Populate and Construct WorldModel

Run the training script:
```
python ReasonAgentTraining.py
```
This process might take a while.
Run the Experiments (COSTLY)

Caution

Ensure you have at least $50 of OpenAI credits available. Short tasks typically take up to 5 minutes, while long tasks can take up to an hour. Full 90 variations over 30 tasks using GPT-4-Turbo model will cost over $2000 of credits.

python ExperimentRunner.py

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
baselines		baselines
config		config
figures		figures
img		img
kg_graph		kg_graph
prompts		prompts
.DS_Store		.DS_Store
.gitignore		.gitignore
ExperimentRunner.py		ExperimentRunner.py
LICENSE		LICENSE
README.md		README.md
ReasonAgentTraining.py		ReasonAgentTraining.py
ReasoningAgent.py		ReasoningAgent.py
hotpot_evaluate.py		hotpot_evaluate.py
requirements.txt		requirements.txt
setup_hotpotqa.ipynb		setup_hotpotqa.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 DAVIS: Planning Agent with Knowledge Graph-Powered Inner Monologue

Introduction

How to Reproduce

Steps

About

Uh oh!

Releases

Packages

Languages

License

minhphd/DAVIS

Folders and files

Latest commit

History

Repository files navigation

🧠 DAVIS: Planning Agent with Knowledge Graph-Powered Inner Monologue

Introduction

How to Reproduce

Steps

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages