Skip to content

Extended System and GPU Monitoring with Collectl and NVML

Notifications You must be signed in to change notification settings

sales-lab/CudaMon

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CudaMon: GPU Monitoring for R

Extended System and GPU Monitoring with Collectl and NVML

Repo Logo

Quick Start

library(CudaMon)

# Start monitoring with GPU tracking
proc <- cl_start("my_workflow", monitor_gpu = TRUE, gpu_monitor_type = "nvml")

# Your GPU computation here
cl_timestamp(proc, "computation_start")
# ... your code ...
cl_timestamp(proc, "computation_end")

# Stop and visualize
cl_stop(proc)
cl_plot_system_metrics(proc)  # 4-panel CPU/GPU plot

Features

  • System Monitoring: CPU, memory, disk I/O via collectl
  • GPU Monitoring: Two backends:
    • collectl -sG (basic)
    • NVML (NVIDIA GPUs - detailed metrics)
  • Background Collection: Continuous GPU metrics during computation
  • Integrated Plots: Combined CPU/GPU usage visualization

Installation

devtools::install_github("aminzadenoori/CudaMon")

System Requirements: collectl utility, NVIDIA drivers (for NVML)

API

  • cl_start() / cl_stop() - Start/stop monitoring
  • cl_timestamp() - Mark workflow phases
  • cl_get_gpu_metrics() - Access real-time GPU data
  • cl_plot_system_metrics() - Generate usage plots

Example Output

CPU/GPU Monitoring Plot

Shows CPU usage, memory, GPU utilization, and GPU memory over time.


Extends Rcollectl with GPU monitoring capabilities for profiling accelerated computations.

About

Extended System and GPU Monitoring with Collectl and NVML

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 73.5%
  • C 16.4%
  • Dockerfile 10.1%