🧩 Build-bench: LLM-driven Cross-ISA Build Repair Framework

This repository implements the Build-bench framework — an end-to-end evaluation pipeline for large language models (LLMs) to repair software build failures across instruction set architectures (ISA), such as x86_64 → aarch64 and aarch64 → x86_64.

The system enables autonomous reasoning, tool-augmented repair, and executable verification through iterative interactions based on the Model Context Protocol (MCP). An interactive visualization of all model results (success/failure cases, tool calls, repair iterations) is available at:
👉 https://buildbench-hub.github.io/buildbench.github.io

🌐 Overview

Build-bench automatically:

Collects failed software packages and contextual artifacts (source archives, specs, logs);
Guides the LLM to diagnose the failure cause;
Applies Full File Generation or Patch Generation;
Uploads the repaired package to the open build service (OBS) for validation;
Iteratively refines the repair based on build logs until success or reaching the iteration limit.

📂 Directory Structure

.
├── config/
│ └── info.yaml # Configuration file for dataset paths and model setup
│
├── data/
│ └── case_study/
│ └── x86_succeed_pro/ # Example successful build results for reference
│ └── texmath_diff.log # Diff log used for case study visualization
│
├── prompts/
│ ├── full_file_generation.txt # Prompt for Full File Generation repair strategy
│ └── patch_generation.txt # Prompt for Patch Generation repair strategy
│
├── tools/
│ └── auto_repair/
│ ├── get_repo_structure.py # Tool: Structure Extraction
│ ├── upload_files.py # Tool: Upload to Build Service
│ ├── check_build_res.py # Tool: Check Build Result (success/failure)
│
│── result_log_res/
│ └── texmath.log # Log file for texmath package repair process
│
│── result_text_res/
│ └── texmath_result.txt # Text file for texmath package repair result
│
├── client.py # MCP client: executes repair loop (Full File Generation)
├── client_patch.py # MCP client: executes repair loop (Patch Generation)
├── server.py # MCP server coordinating LLM and tools (Full File Generation)
├── server_patch.py # MCP server coordinating LLM and tools (Patch Generation)
├── uv.lock # Environment lock file for reproducibility
└── pyproject.toml # Dependencies and environment configuration

⚙️ Installation

# Clone the repository
git clone https://github.com/zcyyc/Build-bench.git
cd Build-bench

# Recreate the virtual environment
uv sync

Ensure that your .env file includes the llm api key as well as api_base_url:

API_KEY="xxx"
API_BASE_URL="xxx"

🚀 Running the Framework

Full-File Generation Strategy

# Launch the client for iterative repair
python client.py

Patch Generation Strategy

# Launch the client for iterative repair using patch strategy
python client_patch.py

📊 Data and Case Study

The folder data/case_study/ provides example build results (e.g., x86→ARM full repair) used in Section 5 of the paper for qualitative analysis.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧩 Build-bench: LLM-driven Cross-ISA Build Repair Framework

🌐 Overview

📂 Directory Structure

⚙️ Installation

🚀 Running the Framework

📊 Data and Case Study

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
config		config
data/case_study		data/case_study
images		images
prompts		prompts
result_log_res		result_log_res
result_text_res		result_text_res
tools/auto_repair		tools/auto_repair
.gitignore		.gitignore
README.md		README.md
client.py		client.py
client_patch.py		client_patch.py
pyproject.toml		pyproject.toml
server.py		server.py
server_patch.py		server_patch.py
uv.lock		uv.lock

zcyyc/Build-bench

Folders and files

Latest commit

History

Repository files navigation

🧩 Build-bench: LLM-driven Cross-ISA Build Repair Framework

🌐 Overview

📂 Directory Structure

⚙️ Installation

🚀 Running the Framework

📊 Data and Case Study

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages