




We are seeking an experienced **Senior Site Reliability Engineer** to join our team and contribute to building and maintaining reliable, scalable infrastructure. In this role, you will collaborate closely with software and operations engineers to bridge the gap between infrastructure and software. You will play a key role in ensuring system reliability, scalability, and operational excellence while working with modern technologies and tools. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi\-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting\-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential. **Responsibilities** * Collaborate with software engineers to integrate infrastructure and software systems seamlessly * Apply SRE principles and software engineering practices to build, maintain, monitor, and operate complex infrastructure * Leverage infrastructure automation tools to streamline operations and improve reliability * Design and maintain scalable web architectures and cloud\-based technologies * Write clean, efficient code in multiple programming languages such as Golang, Python, Ruby, and Scala * Troubleshoot and resolve issues, driving them to completion in high\-pressure scenarios * Monitor system performance and implement solutions to ensure uptime and reliability **Requirements** * At least 3 years of experience in building, operating, or supporting large Linux\-based web application environments * Proficiency in UNIX systems administration with strong scripting skills in Python, PHP, or Bash * Hands\-on experience running Docker with orchestration tools like Nomad, Kubernetes, or Amazon ECS * Familiarity with configuration management systems such as Ansible, Chef, or Puppet (experience with Puppet preferred) * Strong communication skills and the ability to collaborate effectively with distributed teams * Ability to write clean, well\-documented, and comprehensible systems and scripts * Passion for continuous learning and working with new technologies and programming languages * Fluent English communication skills, both written and spoken, at a B2\+ level or higher **Nice to have** * Experience with observability and application performance monitoring tools such as ELK, Prometheus, New Relic, Sentry, or Lightstep * Proficiency in Ruby or Scala for scripting and development tasks **We offer** * Connectivity Bonus (15,000 ARS are paid with a salary receipt at the end of each month as a non\-wages concept). * Medicina Prepaga (It covers the collaborator and direct family group). * Paternity Leave (Two additional days are added to what is established by law, total of 4 days). * Discounts card. * English Training (English lessons, twice per week). * Training Program (Access to multiple customized training plans according to the needs of each role within the company). * Marriage bonus (The company doubles the allowance established by law that ANSES offers). * Referral Program (Referral bonus is paid when the referral of a collaborator joins the Company). * External Agreements and Discounts. * Vacations: 14 calendar days a year *By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy.*


