OpenSRE is built to monitor cloud batch jobs (such as AWS Batch) across multiple instances.Documentation Index
Fetch the complete documentation index at: https://opensre.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
It transparently correlates tasks even when they run on different machines. To integrate with AWS Batch, include the OpenSRE agent in your compute environment and initialize it in each job.
For example, add it to the AMI or Docker image used by your jobs
Steps to get started:
- Prepare your AWS Batch environment: Ensure the OpenSRE agent is available to your Batch compute instances (for example, install it via user data or a custom Docker image).
-
Initialize OpenSRE in jobs: In the commands that run on each node (or as part of job startup), call
tracer init --token <token>to start the agent. - Run your AWS Batch workflow: Submit your Batch jobs as usual. The OpenSRE agent will link tasks across nodes into a single pipeline trace.
- Monitor in Dashboard: View the running jobs in the OpenSRE dashboard. You’ll see aggregated metrics (CPU, memory, I/O, costs) for the entire AWS Batch run, with dependencies across instances visualized.
Tracer