GUI interaction capture -- production-ready event streams with time-aligned media
-
Updated
Jan 19, 2026 - Python
GUI interaction capture -- production-ready event streams with time-aligned media
OpenAdapt’s open-source ML toolkit for training and evaluating general multimodal GUI-action models.
Temporal smoothing for UI element detection with OmniParser integration
Multimodal demo retrieval for GUI automation
HTML viewer components for ML dashboards and benchmarks
PII/PHI detection and redaction for GUI automation data (text, images, dicts)
Evaluation infrastructure for GUI agent benchmarks
Add a description, image, and links to the openadapt topic page so that developers can more easily learn about it.
To associate your repository with the openadapt topic, visit your repo's landing page and select "manage topics."