Role Overview
As a Linux System Administrator, you will ensure the reliability, efficiency, and security of Linux-based systems deployed across on-premises and cloud platforms. You'll play a key role in maintaining infrastructure stability, responding to incidents, and supporting a wide range of client environments under defined service agreements.
Key Responsibilities
- Deploy, configure, and maintain Linux servers including Ubuntu, Debian, CentOS, and RHEL distributions
- Perform system updates, security patching, and hardening in line with industry standards
- Manage user access, authentication, permissions, and privilege escalation controls
- Monitor system performance, plan capacity, and ensure high availability of critical services
- Administer shared infrastructure components such as DNS, NTP, SMTP relays, web servers, and file storage
- Support application teams relying on shared Linux platforms with timely troubleshooting and guidance
- Operate and optimize virtualization and container technologies including VMware, KVM, Docker, and Kubernetes
- Configure and maintain monitoring solutions using tools like Zabbix, Prometheus, Nagios, and Grafana
- Respond to system alerts and incidents as part of a rotating on-call schedule, adhering to SLA requirements
- Lead initial diagnosis, containment, and escalation of critical system issues
- Document root causes and contribute to post-incident improvement plans
- Enforce security policies, manage firewalls, VPNs, and secure access via SSH and jump hosts
- Support compliance with regulatory and organizational standards through audits and controls
- Test disaster recovery procedures and help define recovery time and point objectives
- Automate routine tasks using Bash, Python, or similar scripting languages
- Use configuration management tools such as Ansible, Puppet, Chef, or SaltStack for consistent deployments
- Contribute to infrastructure as code practices using Terraform or CloudFormation
- Develop and maintain technical documentation, runbooks, and on-call procedures
- Collaborate with network, security, and application teams to resolve cross-functional issues
- Share expertise within the team and assist in onboarding new members
- Participate in operational reviews, planning sessions, and capacity assessments
Required Qualifications
- 3–5 years of hands-on experience managing Linux systems in production environments
- Proven ability in diagnosing and resolving complex system issues
- Strong communication and collaboration skills
- Fluency in English
- Experience with Linux distributions including Ubuntu, Debian, CentOS, and RHEL
- Familiarity with system configuration, user management, SSH, and package management
- Understanding of networking fundamentals: TCP/IP, DNS, routing, and firewall concepts
- Experience with web servers such as Apache and Nginx, and log analysis
- Working knowledge of monitoring tools (e.g., Zabbix, Prometheus, Nagios), backup systems, and Bash scripting
- Ability to perform under pressure during production incidents
Preferred Qualifications
- Experience with virtualization platforms like VMware or KVM
- Familiarity with containerization and orchestration tools such as Docker and Kubernetes
- Proficiency in configuration management tools including Ansible, Puppet, Chef, or SaltStack
- Exposure to cloud platforms such as AWS, Azure, or GCP
- Knowledge of CI/CD pipelines and automation practices
- Understanding of security frameworks like CIS, ISO, or NIST
Work Environment
This role includes participation in a rotating on-call schedule to ensure continuous service availability. While primarily based locally, remote troubleshooting is supported, allowing for flexible response to system events outside business hours.