Senior Site Reliability Engineer (m/f/d) - full-time
Do you love stories? If so, please keep reading, because we certainly do. We believe the ability to tell stories is what makes us human. Joyn is your streaming app with over 50 live TV channels, exclusive previews, originals and collections. We understand Joyn as a partnership – an invitation to content-providers and users alike to make entertainment more meaningful and fun. Our app aggregates global and especially local content in a relevant way for Germany, both live TV and on-demand content. All kinds of stories and more to come, everyday.
We hire the best, because we need people that are as customer-focused as we are. We are looking for champions to help us further connect with our audience. It’s not a small or easy task, but it’s a fun and rewarding one. Do you think you’re up for it? Great. Then send us your application!
About the Job
We are looking for a Senior Site Reliability Engineer to help build and operate the next generation streaming platform for the German market. Together with the team, you are building the new glue code and tools for our platform, based on AWS, which is the core of all our services. Our mission is to provide our Engineering teams with an up-to-date and easy-to-use toolset following the best practices in the industry.
What you tell people at parties
"My services run in a large-scale cloud environment, making sure the audience can enjoy live streaming and video on demand in the Joyn app on any device, anywhere."
What you will do
Design and provide best practices in terms of cloud-infrastructure provisioning and usage of several cloud servicesFoster excellence in development teams in points of security, scalability, and reliabilitySetup and maintain monitoring, metrics, and alerting systems for fine-grained observability of the entire infrastructure and Joyn productAutomate the build and deployment processes for our engineering teamsProvide technical guidance and tools for our engineering teams for faster releasesSupport the engineering teams on their microservice principles and architectural approachesActively participate in architecture discussions and propose solutions to system and product changes across teamsBe part of an on-call schedule (24x7) handling incident management to the engineering teamsDevelop documentation and training to onboard and level up the infrastructure knowledge within the organization.
How you will do it
You enjoy solving difficult technical problems in the teamWe like you to take ownership of the tools that you are building and work with your colleagues to deliver a reliable, monitored, and highly available solutionYou develop code for solving complex infrastructure problems, and you find solutions that are configurable, easy-to-maintain, and sustainableWe care about our consumers, engineering teams, and end-users, and we are listening and reflecting on their needs when we are designing a solutionYou learn from both success and failure, actively coach, and get coached by the team
What we are looking for
(5+) years of experience in DevOps or Site Reliability Engineer or Software developer roles(2+) years of experience with one of the major cloud providers, AWS preferredA passionate Linux enthusiast with a good understanding of CS fundamentalsExperience with Infrastructure as a Code in at least one of AWS CloudFormation, Terraform, TerragruntScripting knowledge in any modern language like Python, Golang, Java, NodeJs, RubyExperience in building and maintaining CI/CD pipelinesExperience with monitoring systems like Prometheus, Dynatrace, and ELK stackExperience in container technologies like Docker, KubernetesAWS ECS experience is a plusSolid analytical and problem-solving skills with an appreciation of technical risksUniversity degree in computer science, information technology, media engineering, or equivalentGood written and verbal communication skills - English is our team language.