We are seeking a highly skilled Operations Engineer to join our team and take ownership of monitoring and managing a lab environment consisting of servers. The Operations Engineer will be responsible for ensuring the smooth operation and optimal performance of the lab infrastructure. Additionally, the candidate should have a demonstrated ability to diagnose and debug faults in complex services or systems. The ideal candidate should possess strong server troubleshooting skills, a solid understanding of networking fundamentals, and experience using diagnostic tools. Candidates with understanding of Azure IaaS, especially its administration, will have a strong preference.
Responsibilities:
· Manage and maintain the lab infrastructure consisting of servers, ensuring its availability and performance.
· Diagnose and debug faults in complex services or systems, utilizing troubleshooting techniques and diagnostic tools effectively.
· Demonstrate expertise in server troubleshooting, including identifying and mitigating faults promptly to minimize downtime.
· Possess a strong understanding of networking fundamentals to effectively diagnose network-related issues and optimize performance.
· Utilize diagnostic tools to analyze network traffic and diagnose faults accurately.
· Collaborate with feature teams to identify and drive recovery levers, ensuring timely resolution of system issues and minimizing impact.
· Analyze incoming volume and proactively drive improvements to optimize system performance and scalability.
Qualifications:
· Bachelor’s degree in computer science, Information Technology, or a related field.
· Proven experience diagnosing and debugging faults in complex services or systems.
· Strong working knowledge of server troubleshooting techniques and best practices.
· Solid understanding of networking fundamentals, including TCP/IP, DNS, and routing protocols.
· Experience using diagnostic tools such as Netmon, WinDBG, and Wireshark to diagnose and mitigate faults.
· Familiarity with online systems and experience diagnosing/debugging service faults in such environments.
· Ability to analyze incoming volume and implement improvements to optimize system performance.
· Excellent problem-solving and troubleshooting skills with a keen attention to detail.
· Ability to work independently, prioritize tasks, and adapt to changing priorities in a dynamic environment.
Job Type: Full-time
Pay: From $60,000.00 per year
Benefits:
- 401(k)
- 401(k) matching
- Dental insurance
- Health insurance
- Life insurance
- Paid time off
- Parental leave
- Retirement plan
- Vision insurance
Compensation package:
Schedule:
Application Question(s):
- Do you require sponsorship to work in the United States?
- This is a full time hire, no C2C employment. Are you okay with this?
Experience:
- Server Monitoring Systems: 1 year (Preferred)
- IT Systems Management: 2 years (Required)
- Hardware Configuration and Debugging: 1 year (Required)
- PowerShell Execution: 1 year (Required)
- Windows Server Configuration and Debugging: 1 year (Required)
- VMware Administration: 2 years (Preferred)
- Azure IaaS: 1 year (Preferred)
Work Location: In person