agent-hacking

This repo shows 2 agent hack scenarios that are the most critical to the 2 ways that agents are utilized today. The purpose of this repo is to run these scenarios in environments with proper tracing to see what they look like at that level, and then to add security to make these increasingly difficult to pull off.

The scenarios are:

web facing agents being convinced to break container in a deployed VM
local running agents writing a remote access trojan on a personal machine

In the case of the web facing agent, a user might be able to prompt the agent, get it to break normal protocol, reveal information about it's environment, and ultimately search the underlying VM for secrets, expose code, or simply allow the VM to be utilized for something totally different like crime or crypto mining.

In the case of the RAT, obviously a hacker would want to hide a jailbreak on the internet, have someone's local agent read it, and create the ability for the hacker to be able to control their computer remotely.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
examples		examples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

agent-hacking

About

Uh oh!

Releases

Packages

License

Ghostlock-AI/agent-hacking

Folders and files

Latest commit

History

Repository files navigation

agent-hacking

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages