Job Description
Responsibilities:
Reliability Engineering & Operations
Own and improve service reliability through SLO/SLI definition, error budgets, and operational best practices.
Design, implement, and maintain observability (monitoring, logging, tracing, alerting) to reduce MTTR and improve proactive detection.
Lead incident response practices including on-call improvements, runbooks, post-incident reviews (RCA), and preventative actions.
Partner with application teams to improve performance, capacity planning, and resiliency under failure scenarios.
Infrastructure & Cloud Architecture<...
Ready to Apply?
Submit your application today and join our talented team at Castleton Commodities International LLC.
Submit ApplicationJob Details
- Location Stamford, Connecticut
- Job Type Full time
- Category Computer Occupations
- Posted Date June 07, 2026
- Application Deadline July 17, 2026