Junior DevOps Engineer
Work Model: Remote
-
Support CI/CD pipelines and deployment processes
-
Help automate infrastructure and routine operational tasks
-
Work with cloud platforms such as AWS, Azure, and GCP
-
Assist with containerized environments using Docker and Kubernetes
-
Monitor systems, investigate issues, and improve reliability
-
Collaborate with development, operations, and security teams
-
Contribute to documentation and continuous improvement
- Bachelor’s degree in Computer Engineering, Computer Science, Software Engineering, or a related field
- Proficiency in English
- Recent graduate or early-career engineer interested in DevOps and cloud technologies
- Basic knowledge of Linux, Git, scripting, and cloud concepts
- Familiarity with tools like Docker, Kubernetes, Terraform, Ansible, or Jenkins is a plus
- Strong problem-solving skills and willingness to learn
- Good communication and teamwork skills
- Internship, academic, or personal project experience in DevOps, cloud, or automation
- An active GitHub profile, open-source contributions, technical blog posts, or a personal website/portfolio
-
Remote-first work with flexible hours
-
Work with experienced engineers on real-world projects
-
Gain hands-on experience with modern DevOps and cloud tools
-
Learn in a collaborative and transparent team environment
-
Strong engineering culture with room for experimentation
-
Build a strong foundation for your career in DevOps
-
Internal tech talks and shared learning sessions
-
Opportunities to attend local and Europe-based meetups and conferences
-
Home office allowance for your setup
-
Mentoring and paid cloud and platform learning paths
-
Support for AWS and Kubernetes certification exams
Job Description
We are looking for Monitoring & Operations Engineers at Junior, Mid, and Senior levels to operate and monitor hybrid environments including AWS, Azure, On-Premise infrastructures, Windows/Linux servers, Databases, Cloud Services, and Kubernetes platforms.
This role focuses on 24/7 monitoring, incident detection, first-level troubleshooting, and operational support, working closely with DevOps, SRE, Platform, Infrastructure, and Application development teams to ensure high availability and system reliability.
Responsibilities
- Monitor cloud, on-premise, and Kubernetes-based systems in a 7/24 shift-based environment
- Track system health, performance, and availability using:
- AWS CloudWatch, Azure Monitor
- Grafana, Prometheus
- ELK
- Monitor Windows and Linux servers (CPU, memory, disk, services, events)
- Monitor Kubernetes clusters (EKS / AKS / On-Prem K8s):
- Nodes, pods, deployments, services
- Cluster events and resource usage
- Analyze alarms and alerts, identify potential root causes, and take first-level actions
- Escalate incidents to relevant teams with clear technical findings and evidence
- Perform end-to-end system checks during incidents (infra, application, network, security, platform)
- Execute operational procedures using runbooks / SOPs
- Log incidents, events, and actions accurately in ticketing systems
- Support maintenance, change, and release activities
- Contribute to improving monitoring coverage, alert quality, and operational processes
Required Skills & Qualifications
Core Technical Skills
- Experience or strong interest in hybrid environments
- Cloud (AWS, Azure)
- On-Prem infrastructure
- Knowledge of Windows Server and Linux fundamentals
- Hands-on experience with monitoring & observability tools:
- CloudWatch, Azure Monitor
- Grafana, Prometheus
- ELK
- Kubernetes monitoring and troubleshooting knowledge
- Understanding of:
- Networking basics (DNS, TCP/IP, Load Balancers)
- Application metrics, logs, and events
- Ability to distinguish false alerts vs real incidents
- Experience with ticketing and incident management tools
(Jira, ServiceNow, Opsgenie, PagerDuty, etc.)
Interested?
Apply here or send your resume to hr@hepapi.com