Join Gamingtec as a Site Reliability Engineer and power high-performance iGaming systems with automation, observability, and 24/7 reliability — all in a fully remote, flexible, and rewarding environment.
Are you ready to take your reliability engineering skills to the next level — in the high-stakes world of iGaming?
We are expanding the engineering team responsible for ensuring the stability and predictable behaviour of our distributed services and platforms. The role involves working with production infrastructure, analysing system behaviour, and implementing practices that improve reliability across multiple platforms.
This position is intended for engineers who clearly understand the difference between SRE and DevOps practices, and for whom SLOs, error budgets, and availability targets such as 99.85–99.95% are practical tools rather than abstract concepts.
You will work as part of an SRE shift schedule covering late-evening and night hours (17:00–01:00 and 00:00–08:00 CET, in rotation) to ensure end-to-end ownership of incidents, from user impact to root cause and follow-up improvements.
All you need is:
Core skills:
Strong Linux skills in production environments (debugging, performance, system services);
Solid understanding of networking (TCP/IP, DNS, HTTP, load balancing, TLS);
Hands-on experience operating Kubernetes in production (not just local clusters);
Experience with AWS cloud services (for example: EC2, ALB/NLB, RDS, S3, IAM, EKS or self-managed Kubernetes);
Confident use of Terraform and Ansible in real environments (multi-environment IaC, reusable modules/roles);
Experience with observability tools:
metrics and alerting (Prometheus/Alertmanager or similar),
dashboards (Grafana or similar),
logging (ELK stack, Loki or comparable solutions).
Ability to troubleshoot across application, network, and infrastructure layers, using scripting and tools (Python/Go/Bash, curl, tcpdump, log analysis, etc.);
Experience with containers and image lifecycle (Docker or compatible runtimes).
Experience:
Participation in production incidents and technical post-incident reviews (not just on-call escalation);
2–5 years of practical experience in SRE, infrastructure, platform or production-focused DevOps engineering;
Experience working within CI/CD pipelines (for example: Jenkins, GitLab CI, GitHub Actions, ArgoCD or similar);
Exposure to environments with high availability requirements (e.g. low tolerance to downtime, strict SLAs/SLOs).
Availability to work between 5 PM and 8 AM CET, in one of the following shifts: 17:00–01:00 or 00:00–08:00.
Also, it will be great if you have:
Experience with high-load or real-time systems (payments, finance, gaming, streaming);
Experience with CDNs or real-time log aggregation/analytics;
Familiarity with databases and message systems (for example: PostgreSQL, MySQL, MongoDB, Kafka, Redis, RabbitMQ);
Experience with involving external integrations and third-party APIs (payment providers, KYC, risk/anti-fraud, content providers);
Experience with service meshes, API gateways or ingress controllers (Istio, Linkerd, NGINX, Envoy, etc.).
Your daily adventures will look like:
Contributing to architectural changes affecting the reliability and scalability of services and platforms;
Working with AWS-based environments (networking, storage, compute, managed services);
Managing infrastructure using Terraform and configuration management with Ansible;
Developing and refining monitoring and observability across platforms (Prometheus, Alertmanager, Grafana, and log aggregation such as ELK / Loki);
Participating in incident handling: initial classification, technical investigation, coordination with product/engineering teams, and following-up improvements;
Reducing operational toil and building tools that support reliability and efficiency (internal utilities, automation, CI/CD improvements);
Collaborating with development teams to embed SRE practices into the lifecycle of services (SLIs/SLOs, error budgets, readiness for production).
Success Metrics:
Maintain and improve SLOs for key services in the 99.85–99.95% availability range, with clear SLIs and error budgets;
Keep unplanned downtime below 1% for critical user-facing functionality;
Ensure that the majority of infrastructure and platform configuration (target ≥ 90–95%) is managed as code (Terraform, Ansible, Kubernetes manifests/Helm charts);
Systematically reduce MTTR (Mean Time To Recovery) for incidents by improving detection, diagnostics and standard operating procedures;
Prevent repeated high-severity incidents by driving post-incident reviews and concrete follow-up actions (configuration changes, automation, runbooks, architectural adjustments);
Maintain up-to-date operational documentation and runbooks for core services, so that incidents can be handled consistently across the team.
So, why Gamingtec?
If you are a person with passion, ideas, and a thirst to advance your career, you will love our corporate culture.
We are an international team that treats each other with respect and moves towards the same goals. We believe in freedom and flexibility and trust our employees to do their jobs in a way that works for them.
We have an ambitious and rewarding work environment, a flat organisational structure and almost zero bureaucracy. Our employees’ ideas are what move the company forward. Everyone has equal opportunities in every aspect of work, learning and development!
Why you will love working here:
Being a part of an international team, where everyone treats each other with respect and moves towards the same goal;
Freedom and responsibility. You do not need to be told what to do;
Competitive salaries. We want only the top performers, so we offer the appropriate remuneration for their experience and knowledge;
Fully remote work. If you are in one of the areas where one of our offices is located, you will also have the option to go to the office;
Paid vacation and sick leave days. We believe that everyone should have a good work-life balance and no one should burn out;
Constant career development & learning opportunities!
Enjoy the corporate atmosphere with awesome parties and team-building events throughout the year;
Refer your friends and get rewarded with a bonus after they pass their probation period;
Find the right private medical insurance that works for you and receive compensation for it. Compensation (full/partial) depends on the cost;
Flexible Benefits plan. Decide which of your activities/expenses you want the company to compensate you for. For example: gym subscription, language courses, Netflix subscription, a spa day, etc;
Education foundation for learning something new. Be part of our biannual raffle that gives you the chance to learn something new, unrelated to your job.
And this is how our interview process goes:
A 30-minute interview with a member of our HR team to get to know you and your experience;
1st stage of technical interview (1 h) with our DevOps team to assess your theoretical skills;
2nd stage of technical interview (1 h) to assess your hard skills;
A final 1-hour interview to gauge your fit with our culture and working style.
Sounds interesting? Do not hesitate to apply or contact us if you have any questions!
Gamingtec is waiting for you!
Advantages of Our Company
Zero bureaucracy
Unlimited holidays
Medical insurance
Only top talent and top salaries
Challenging projects and tasks
Smartest colleagues in the industry
Our Principles
Celebrate diversity
Celebrate diversity
Innovation matters
Innovation matters
Ambitious goals
Ambitious goals
This website uses cookies
We use third-party cookies in order to personalize your site experience.
Learn more
The Gamingtec website utilizes cookies to store and access visitor information with the purpose of enhancing security and improving the browsing experience. If you do not wish for the collection of such information, you can toggle these off:
Necessary
Necessary cookies are essential for the website to function properly. This category only includes cookies that ensure basic functionalities and security features of the website. These cookies do not store any personal information.
Marketing
Marketing cookies track your online activity to help advertisers deliver more relevant advertising or to limit how many times you see an ad. Said information can be shared with other organizations or advertisers. These are permanent cookies and almost always of third-party provenance.
Analytics & Statistics
Analytical and statistical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, traffic sources, etc.