Job Responsibilities:
Develop back-end services, workflows and automation which handle fleet management, full server lifecycle (network boot, firmware updates, provisioning of os, failure detection, erase and decom)
Develop out-of-band server management tooling in multi-vendor environment, including state management and telemetry collection
Write and review code, test changes on hardware, automate testing
Troubleshoot provisioning, firmware update and network boot issues end to end
Skills:
3+ years of experience in the area of:
Experience in TCP/IP network fundamentals
Experience with Linux systems and server systems management
Experience in troubleshooting server network boot process from both network and servers point of view
Experience in coding and scripting automation (Python, Go, Rust, Bash, Ruby)
Experience in automating management of large fleets of server hardware (Redfish, IPMI)
Experience with proven troubleshooting analytics of server/systems architecture and components
Experience with Kubernetes, Docker and/or cloud deployment technologies.
Education/Experience:
Bachelor's degree in computer science, software engineering or relevant field required. Experience can be used in lieu of education.
Linux/TCP/IP certification preferred.