We are seeking a talented Lead Linux Systems Engineer responsible for operating, maintaining, and improving the services offered by the Managed Linux OS team.
Responsible for the overall reliability of these infrastructure services supporting critical business applications.
Supporting a high-volume 24x7 production environment, as well as development and pre-production environments.
Maintain and improve OS focused infrastructure and ensure the services which support the thousands of Linux servers in the various environments are functioning well.
Write tools to automate routine and complex tasks and troubleshoot application and infrastructure issues.
The position includes collaboration with development teams to design a scalable and supportable service-oriented architecture.
This role will be expected to be a subject matter expert on core Linux technologies and help drive efforts to improve automation and tooling.
ROLE RESPONSIBILITIES:
Provide expertise for Linux OS platform, including OS lifecycle management, configuration management, system hardening, patching strategy, inventory management, system configuration
Proactively identify and address stability, capacity, and performance concerns
Implement provisioning and deployment automation / IaC solutions
Identify & implement solutions to other needs for process automation
Checking in code for infrastructure build, automation, & tests to version control repository (GitLab)
Support PCI / security compliance requirements (upgrades, defect management, etc)
Regularly work with Jira, GitLab, Prometheus, Grafana, Splunk
Liaise with other teams (Networking, Data Center Ops, Security, Software Development, etc) as required
Respond to, recover from, and prevent future occurrences of failures & outages
Participate in on-call and potentially some after-hours support as required
Various other products in our tool set you will need to use regularly: git, GitLab, Ansible, Chef, Prometheus, Grafana, Jira, Confluence, Splunk, Kibana
Provide guidance to junior level engineers while also evangelizing the technology set and implementation choices to the rest of the organization
Working closely with direct line manager during discovery phase of projects and strategies lending expertise in the domain
TECHNICAL REQUIREMENTS:
8+ years of relevant experience
Bachelor's Degree in Engineering, Technology or Related Field
Good knowledge of Linux and TCP/IP networking fundamentals
Ability to Automate Infrastructure operations through scripting and configuration management systems (Python, Terraform, GoLang preferred)
Experience with troubleshooting server-side applications in Linux
Experience managing large-scale Linux infrastructure across public cloud and private data centers
Experience with software development life cycle (SDLC) and agile/iterative methodologies
Experience with IT compliance and risk management requirements (eg. security, PCI compliance, privacy etc.)
Experience with configuration and maintenance of common applications such as Apache, mod-perl, Nginx, MySQL, Memcached, DHCP, NFS, SSH, NTP
Experience with coding in at least one of programming languages - Python, Perl, GoLang, Java
Experience with Linux virtualization and containerization technologies such as Xen, KVM, Docker, LXC
Experience with common Linux software packaging and distribution such as GNU Autotools, RPM, YUM, CPAN, NPM
Experience creating extremely consumable, yet complete, technical assets such as documentation and architecture diagram describing existing systems as well as new solutions
ROLE DESCRIPTION:
We are seeking a talented Lead Linux Systems Engineer responsible for operating, maintaining, and improving the services offered by the Managed Linux OS team.
Responsible for the overall reliability of these infrastructure services supporting critical business applications.
Supporting a high-volume 24x7 production environment, as well as development and pre-production environments.
Maintain and improve OS focused infrastructure and ensure the services which support the thousands of Linux servers in the various environments are functioning well.
Write tools to automate routine and complex tasks and troubleshoot application and infrastructure issues.
The position includes collaboration with development teams to design a scalable and supportable service-oriented architecture.
This role will be expected to be a subject matter expert on core Linux technologies and help drive efforts to improve automation and tooling.
ROLE RESPONSIBILITIES:
Provide expertise for Linux OS platform, including OS lifecycle management, configuration management, system hardening, patching strategy, inventory management, system configuration
Proactively identify and address stability, capacity, and performance concerns
Implement provisioning and deployment automation / IaC solutions
Identify & implement solutions to other needs for process automation
Checking in code for infrastructure build, automation, & tests to version control repository (GitLab)
Support PCI / security compliance requirements (upgrades, defect management, etc)
Regularly work with Jira, GitLab, Prometheus, Grafana, Splunk
Liaise with other teams (Networking, Data Center Ops, Security, Software Development, etc) as required
Respond to, recover from, and prevent future occurrences of failures & outages
Participate in on-call and potentially some after-hours support as required
Various other products in our tool set you will need to use regularly: git, GitLab, Ansible, Chef, Prometheus, Grafana, Jira, Confluence, Splunk, Kibana
Provide guidance to junior level engineers while also evangelizing the technology set and implementation choices to the rest of the organization
Working closely with direct line manager during discovery phase of projects and strategies lending expertise in the domain
TECHNICAL REQUIREMENTS:
8+ years of relevant experience
Bachelor's Degree in Engineering, Technology or Related Field
Good knowledge of Linux and TCP/IP networking fundamentals
Ability to Automate Infrastructure operations through scripting and configuration management systems (Python, Terraform, GoLang preferred)
Experience with troubleshooting server-side applications in Linux
Experience managing large-scale Linux infrastructure across public cloud and private data centers
Experience with software development life cycle (SDLC) and agile/iterative methodologies
Experience with IT compliance and risk management requirements (eg. security, PCI compliance, privacy etc.)
Experience with configuration and maintenance of common applications such as Apache, mod-perl, Nginx, MySQL, Memcached, DHCP, NFS, SSH, NTP
Experience with coding in at least one of programming languages - Python, Perl, GoLang, Java
Experience with Linux virtualization and containerization technologies such as Xen, KVM, Docker, LXC
Experience with common Linux software packaging and distribution such as GNU Autotools, RPM, YUM, CPAN, NPM
Experience creating extremely consumable, yet complete, technical assets such as documentation and architecture diagram describing existing systems as well as new solutions