SAMSUNG - The Future Belongs to Those Who Make It

Senior Site Reliability Engineer for Samsung Ads Project

About our Team

Samsung Ads is an advanced advertising technology company in rapid growth that focuses on enabling brands to connect with Samsung TV audiences as they are exposed to digital media by using the industry’s most comprehensive data to build the world’s smartest advertising platform. Being part of an international company such as Samsung and doing business around the world means that we get to work on the most challenging projects with stakeholders and teams located around the globe.

The Engineering Platform (EP) team is a team that builds, operates, and offers products that benefit multiple engineering teams within the Samsung Ads project. These are typically foundational services and include runtime environments, scheduling, observability, monitoring, and more.

As an embedded Site Reliability Engineering (SRE) you’ll be part of a software development team and act as a subject matter expert on the challenges of usability, performance, reliability, scalability and observability.

The ideal candidate has deep knowledge and a strong interest in process automation, observability, software-defined infrastructure, and approaches it from the perspective of a software engineer. Challenges of globally distributed services, deciding what and when state should be shared, and simulating failure scenarios should drive you.

You will work with some incredibly talented and passionate developers with a solid technical background to bring products and services to a market with unique technical challenges.

Role and Responsibilities

  • Co-architect new services, including failure tolerance and self-healing by-design, as well as establishing clear scaling-out paths
  • Act as a subject matter expert for the challenges of infrastructure and operation within your teamTranslate Product Owner requirements into actionable technical tasks
  • Advise on tuning observability systems to represent the health of the systems your team is responsible for and glean insights to plan for growth
  • Contribute to the global SRE practice
  • Empower your development team with tooling and automation, including CI/CD
  • Continuously improve internal services for ease of packaging, configuration, and deployment
  • Participate in shared on-call rotation
  • Evaluating and benchmarking new solutions, establishing capacity and growth plans
  • Developing and supporting usable and maintainable tooling for the engineering organization
  • Administration of services, whether built in-house or from external vendors
  • Continuous optimization of services on all layers (hardware, software) for high performance
  • Monitoring of all critical services, sharing pager duty, troubleshooting, and addressing problems as they arise (including any needed changes in code, topology, resources, or configuration)
  • Backup/DR implementation, plans, documentation, and exercises
  • Co-own technical relationships with several service providers and vendors

Technologies in use

  • AWS
  • Kubernetes
  • Terraform
  • EKS, Rancher
  • HashiCorp Vault, Prometheus, Okta
  • GitHub Actions, ArgoCD, Argo Rollout
  • Grafana, Sloth, Loki, Tempo

Skills and Qualifications

  • Strong expertise administrating and scaling Kubernetes on AWS (CKA, CKAD, CKS are nice to have)
  • Strong understanding of distributed systems and client-server architectures
  • Strong Linux system administration and troubleshooting skills, including solid knowledge of how the various components work (kernel, CPU, memory, disk, network)
  • Experience with Infrastructure as Code tools (Terraform and custom modules)
  • Experience working in microservices environments
  • Capacity and willingness to work in an agile multi-team environment
  • Demonstrated ability to prioritize tasks and promptly resolve problems
  • Ability to work autonomously, multi-task, and work in a fast-paced environment
  • You have a track record of making things better and leading solutions that remove technical pain points and facilitate growth
  • You enjoy working with others who are intelligent and passionate about building practical, reliable, high-performance products
  • Excellent communication skills in English
  • Experience in Observability Platforms.
  • Relevant software engineering experience with at least one language (Go, Ruby, Python, Erlang or Java)

We offer

  • Team:
    • Friendly working atmosphere
    • Wide range of trainings
    • Opportunity to work in multiple projects
    • Working with the latest technologies on the market
    • Monthly integration budget
    • Possibility to attend local and foreign conferences
    • Start of work between 7 a.m. and 10 a.m.
  • Equipment:
    • PC workstation/Laptop + 2 external monitors
  • Benefits:
    • Private medical care (possibility to add family members for free)
    • Multisport card
    • Life insurance
    • Lunch card
    • A partial reimbursement of the cost of an English language course
    • Possibility to learn Korean for free
    • Variety of discounts (Samsung products, theaters, restaurants)
    • Unlimited free access to Copernicus Science Center for you and your friends
    • Possibility to test new Samsung products
  • Location:
    • Office in Warsaw Spire near metro station
    • Hybrid model – 3 days from the office per week
    • Attractive relocation package

The administrator of your personal data is SAMSUNG ELECTRONICS POLSKA Sp. z o.o., with its registered office in Warsaw, at: ul. Postępu 14, 02-676 Warsaw. You will find more information about the processing of personal data after clicking the "Apply" button.

____________________________________________________________________________________

Administratorem Pana/Pani danych osobowych jest SAMSUNG ELECTRONICS POLSKA Sp. z o.o., z siedzibą w Warszawie, adres: ul. Postępu 14, 02-676 Warszawa. Więcej informacji na temat przetwarzania danych osobowych znajdzie Pan/Pani po kliknięciu w przycisk „Aplikuj”.

×