vacancies-intro

Senior Site Reliability Engineer

Apply

Description

Ciklum is looking for a Senior Site Reliability Engineer to join our team full-time in Poland.

We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we engineer technology that redefines industries and shapes the way people live.

About the role:

As a Senior Site Reliability Engineer, become a part of a cross-functional development team engineering experiences of tomorrow.

We are developing a customer engagement platform within the Data & Services organization. This platform empowers global brands to forge stronger and more profitable customer relationships. We partner with brands to drive marketing transformation through innovative technology and services.

We are seeking a Senior Site Reliability Engineer (SRE) with expertise in AWS to join our team full-time. In this role, you will ensure the stability and health of our platform by fostering developer ownership and empowering developers to build resilient products. Supporting developers during the application build phase with operational design, automation, capacity planning, and monitoring. Creating and enforcing operational standards while promoting an agile and learning culture.

Responsibilities

  • Plan, manage, and oversee all aspects of the production environment for merchant loyalty use cases
  • Define strategies for all facets of observability and identify areas of improvement in production
  • Apply MTTR, SLO, and SLI definitions to services
  • Respond to incidents, improve the platform based on feedback, and measure incident reduction over time
  • Maintain services by measuring and monitoring availability, latency, and overall system health
  • Practice sustainable incident response and conduct blameless postmortem
  • Ensure batch production scheduling and processes are accurate and timely
  • Create and execute queries on big data platforms and relational databases to identify issues or perform mass updates
  • Isolate problems between hardware and software components
  • Analyze IT Service Management activities and provide feedback to development teams on operational gaps or resiliency concerns
  • Support services before they go live through system design consulting, capacity planning, and launch reviews
  • Scale systems sustainably through automation and push for changes that improve reliability and velocity
  • Work with a global team across multiple geographies and time zones

Requirements

We know that sometimes, you can’t tick every box. We would still love to hear from you if you think you’re a good fit!

  • Bachelor’s degree in Computer Science, Software Engineering, or a related field
  • 5+ years of relevant experience in DevOps, SRE, or systems engineering and managing large production platforms with significant AWS experience
  • Experience architecting and implementing data governance processes and tooling (data catalogs, lineage tools, role-based access control, PII handling)
  • Proficiency with Splunk and SignalFx
  • Strong coding ability in Python or other languages (Java, C#, Golang, C, C++, Perl, Ruby)
  • Ability to debug and optimize code and automate routine tasks
  • Understanding of large-scale distributed systems
  • Relevant AWS certifications are highly preferred (e.g., AWS Solutions Architect Professional, AWS DevOps Engineer Professional, AWS Security Specialty)

Desirable

  • Cloud: Deep knowledge of AWS services, architectures, and design patterns
  • Infrastructure as Code (IaC): Advanced skills in CloudFormation, Terraform, or other IaC tools
  • Programming and Scripting: Strong coding ability in Python or other languages like Java, C#, Golang, C, C++, Perl or Ruby etc
  • Containerization and Microservices: Extensive experience with Docker, Kubernetes (EKS), and microservices architectures
  • Complex Configuration Management: Expertise in configuration management tools (Ansible, Chef, Puppet, etc.)
  • Robust Networking: In-depth knowledge of networking concepts, VPC design, hybrid cloud architectures, and network security
  • Security Focus: Strong understanding of cloud security principles, penetration testing, and compliance standards (e.g., SOC 2, ISO 27001, PCI DSS)

Personal skills

  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive. Ability to support many different stakeholders
  • Experience in dealing with difficult situations and making decisions with a sense of urgency is needed
  • Interest in designing, analyzing and troubleshooting large-scale distributed systems
  • Appetite for change and pushing the boundaries of what can be done with automation
  • Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must
  • Good Handle on Change Management and Release Management aspects of Software

What's in it for you

  • Care: your mental and physical health is our priority. We ensure comprehensive company-paid medical insurance, life insurance and Multisport card
  • Tailored education path: boost your skills and knowledge with our regular internal events (meetups, conferences, workshops), Udemy license, language courses and company-paid certifications
  • Growth environment: share your experience and level up your expertise with a community of skilled professionals, locally and globally
  • Flexibility: Own your schedule – you are the one to decide when to start your working day. Just don’t miss your regular team stand-up
  • Opportunities: we value our specialists and always find the best options for them. Our Internal Mobility Program helps change a project if needed to help you grow, excel professionally and fulfill your potential
  • Global impact: work on large-scale projects that redefine industries with international and fast-growing clients
  • Welcoming environment: feel empowered with a friendly team, open-door policy, informal atmosphere within the company and regular team-building events

About us:

Join a well-established company and a strong team of professionals.

Seize the perks of global opportunities, local approach and start-up spirit.

Boost your skills with modern stacks and industry-leading clients!

Enjoy what you do, do what you enjoy!

Be bold, not bored!

Experiences of tomorrow. Engineered together

Interested already?

We would love to get to know you! Submit your application. Can’t wait to see you at Ciklum.

Apply

Looking for something else?

Find a vacancy that works for you. Send us your CV to receive a personalized offer.

Send CV