We are seeking a highly skilled and motivated Data Center Operations Technician 3 to join our dynamic infrastructure operations team. In this role, you will be responsible for the diagnosis, troubleshooting, and repair of servers. The ideal candidate will possess a deep understanding of data center environments, repair experience, and a proficiency in Linux-based operating systems. You will play a critical role in ensuring the stability, reliability, and efficiency in data center infrastructure.
Responsibilities & Tasks
Hardware Troubleshooting & Repair:
Diagnose and resolve complex hardware failures on a variety of servers, including motherboards, CPUs, RAM, and storage devices
Perform component-level repairs and replacements on servers and other data center hardware
Manage and execute the hardware break/fix process, ensuring minimal downtime and adherence to service level agreements (SLAs)
Conduct root cause analysis of hardware failures and provide recommendations for preventative measures
Linux System Administration:
Utilize Linux command-line interface (CLI) for system monitoring, troubleshooting, and configuration
Assist in the deployment and provisioning of new servers running various Linux distributions
Troubleshoot boot-level and OS-level issues on Linux servers
Work with engineering teams to resolve escalated technical issues related to the interaction between hardware and the Linux OS
Data Center Operations:
Manage data center inventory, including spare parts and retired assets
Maintain detailed documentation of all hardware repairs, changes, and configurations
Participate in on-call rotations to respond to after-hours emergencies (if applicable)
Adhere to all safety and security protocols within the data center environment
Mentorship & Collaboration:
Provide training and guidance to team members on best practices for hardware repair and troubleshooting
Collaborate with network, storage, and other infrastructure teams to resolve complex, cross-functional issues
Experience, Skills and Qualifications Required
Experience:
Minimum of 3-5 years of experience in a data center environment, with a significant focus on hardware troubleshooting and repair
A minimum of 2 years of hands-on experience with Linux operating systems (e.g., RHEL, CentOS, Ubuntu) in a server environment is mandatory
Technical Skills:
Expert knowledge of x86 server architecture and components
Proficiency in diagnosing and repairing server hardware
Strong understanding of network hardware, including switches, routers, and firewalls
Solid command of Linux/Unix command-line tools for diagnostics and troubleshooting
Familiarity with data center infrastructure management (DCIM) and ticketing systems
Experience with structured cabling and fiber optic connectivity
Certifications (Preferred but not required)
CompTIA A+
CompTIA Server+
CompTIA Linux+ or LPI certification
Vendor-specific hardware certifications
Physical Requirements:
Ability to lift and move equipment up to 50 lbs.
Ability to work in a temperature-controlled environment with moderate noise levels
Must be able to perform physical tasks such as standing, walking, bending, and kneeling for extended periods